팬더 데이터 프레임을 특정 행에서 두 개의 데이터 프레임으로 분할

source

팬더 데이터 프레임을 특정 행에서 두 개의 데이터 프레임으로 분할

factcode 2023. 10. 11. 21:02

팬더 데이터 프레임을 특정 행에서 두 개의 데이터 프레임으로 분할

있습니다pandas제가 작성한 DataFrameconcat. 하나의 행은 96개의 값으로 구성되어 있는데, 저는 72개의 값에서 DataFrame을 나누어 보고 싶습니다.

행의 처음 72 값은 Dataframe1에 저장되고, 다음 24 값은 Dataframe2에 저장됩니다.

다음과 같이 DF를 만듭니다.

temps = DataFrame(myData)
datasX = concat(
[temps.shift(72), temps.shift(71), temps.shift(70), temps.shift(69), temps.shift(68), temps.shift(67),
 temps.shift(66), temps.shift(65), temps.shift(64), temps.shift(63), temps.shift(62), temps.shift(61),
 temps.shift(60), temps.shift(59), temps.shift(58), temps.shift(57), temps.shift(56), temps.shift(55),
 temps.shift(54), temps.shift(53), temps.shift(52), temps.shift(51), temps.shift(50), temps.shift(49),
 temps.shift(48), temps.shift(47), temps.shift(46), temps.shift(45), temps.shift(44), temps.shift(43),
 temps.shift(42), temps.shift(41), temps.shift(40), temps.shift(39), temps.shift(38), temps.shift(37),
 temps.shift(36), temps.shift(35), temps.shift(34), temps.shift(33), temps.shift(32), temps.shift(31),
 temps.shift(30), temps.shift(29), temps.shift(28), temps.shift(27), temps.shift(26), temps.shift(25),
 temps.shift(24), temps.shift(23), temps.shift(22), temps.shift(21), temps.shift(20), temps.shift(19),
 temps.shift(18), temps.shift(17), temps.shift(16), temps.shift(15), temps.shift(14), temps.shift(13),
 temps.shift(12), temps.shift(11), temps.shift(10), temps.shift(9), temps.shift(8), temps.shift(7),
 temps.shift(6), temps.shift(5), temps.shift(4), temps.shift(3), temps.shift(2), temps.shift(1), temps,
 temps.shift(-1), temps.shift(-2), temps.shift(-3), temps.shift(-4), temps.shift(-5), temps.shift(-6),
 temps.shift(-7), temps.shift(-8), temps.shift(-9), temps.shift(-10), temps.shift(-11), temps.shift(-12),
 temps.shift(-13), temps.shift(-14), temps.shift(-15), temps.shift(-16), temps.shift(-17), temps.shift(-18),
 temps.shift(-19), temps.shift(-20), temps.shift(-21), temps.shift(-22), temps.shift(-23)], axis=1)

질문:어떻게 그것들을 나눌 수 있습니까? :)

`iloc`

df1 = datasX.iloc[:, :72]
df2 = datasX.iloc[:, 72:]

(iloc 문서)

np. split(..., 축=1)을 사용합니다.

데모:

In [255]: df = pd.DataFrame(np.random.rand(5, 6), columns=list('abcdef'))

In [256]: df
Out[256]:
          a         b         c         d         e         f
0  0.823638  0.767999  0.460358  0.034578  0.592420  0.776803
1  0.344320  0.754412  0.274944  0.545039  0.031752  0.784564
2  0.238826  0.610893  0.861127  0.189441  0.294646  0.557034
3  0.478562  0.571750  0.116209  0.534039  0.869545  0.855520
4  0.130601  0.678583  0.157052  0.899672  0.093976  0.268974

In [257]: dfs = np.split(df, [4], axis=1)

In [258]: dfs[0]
Out[258]:
          a         b         c         d
0  0.823638  0.767999  0.460358  0.034578
1  0.344320  0.754412  0.274944  0.545039
2  0.238826  0.610893  0.861127  0.189441
3  0.478562  0.571750  0.116209  0.534039
4  0.130601  0.678583  0.157052  0.899672

In [259]: dfs[1]
Out[259]:
          e         f
0  0.592420  0.776803
1  0.031752  0.784564
2  0.294646  0.557034
3  0.869545  0.855520
4  0.093976  0.268974

np.split()상당히 유연합니다. 인덱스가 있는 열에서 원래 DF를 3개의 DF로 나누어 보겠습니다.[2,3]:

In [260]: dfs = np.split(df, [2,3], axis=1)

In [261]: dfs[0]
Out[261]:
          a         b
0  0.823638  0.767999
1  0.344320  0.754412
2  0.238826  0.610893
3  0.478562  0.571750
4  0.130601  0.678583

In [262]: dfs[1]
Out[262]:
          c
0  0.460358
1  0.274944
2  0.861127
3  0.116209
4  0.157052

In [263]: dfs[2]
Out[263]:
          d         e         f
0  0.034578  0.592420  0.776803
1  0.545039  0.031752  0.784564
2  0.189441  0.294646  0.557034
3  0.534039  0.869545  0.855520
4  0.899672  0.093976  0.268974

저는 일반적으로 배열 분할을 사용합니다. 왜냐하면 단순 구문이 더 쉽고 2개 이상의 파티션으로 확장이 더 잘되기 때문입니다.

import numpy as np
partitions = 2
dfs = np.array_split(df, partitions)

np.split(df, [100,200,300], axis=0] 바람직하거나 바람직하지 않은 명시적인 인덱스 번호를 원합니다.

언급URL : https://stackoverflow.com/questions/41624241/pandas-split-dataframe-into-two-dataframes-at-a-specific-row

'source' 카테고리의 다른 글

BYTE 데이터 유형을 정의하는 C/C++ 헤더 파일은 무엇입니까? (0)	2023.10.11
피토치 텐서와 눔피 배열 (0)	2023.10.11
jQuery: 데이터 특성 가져오기 (0)	2023.10.11
개체의 고정 크기 배열을 만드는 방법 (0)	2023.10.11
파이썬에서 테스트/유니트 테스트 없이 출력을 주장하는 방법? (0)	2023.10.11

현재글팬더 데이터 프레임을 특정 행에서 두 개의 데이터 프레임으로 분할

AngularJS, spring-boot, Vuex, Wordpress, oracle, C, Excel, git, vuejs2, jQuery, MySQL, JavaScript, php, REACTJS, asp.net, Python, java, MariaDB, json, sql-server,

Today :
Yesterday :

factcode

팬더 데이터 프레임을 특정 행에서 두 개의 데이터 프레임으로 분할

팬더 데이터 프레임을 특정 행에서 두 개의 데이터 프레임으로 분할

`iloc`

'source' 카테고리의 다른 글

'source'의 다른글

티스토리툴바

« 2025/12 »
일	월	화	수	목	금	토
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

팬더 데이터 프레임을 특정 행에서 두 개의 데이터 프레임으로 분할

팬더 데이터 프레임을 특정 행에서 두 개의 데이터 프레임으로 분할

iloc

'source' 카테고리의 다른 글

'source'의 다른글

관련글

티스토리툴바

`iloc`