pandas遍历每一行|极客笔记

pandas遍历每一行

在数据分析和处理的过程中，经常需要遍历DataFrame中的每一行数据。在Python中，使用pandas库可以非常方便地对DataFrame进行操作。本文将详细介绍如何使用pandas遍历每一行数据，并提供一些实用的示例代码。

1. 使用iterrows方法遍历每一行

pandas中的DataFrame对象提供了一个iterrows方法，可以用来遍历DataFrame中的每一行数据。具体使用方法如下：

import pandas as pd

# 创建一个示例DataFrame
data = {'A': [1, 2, 3], 'B': [4, 5, 6]}
df = pd.DataFrame(data)

# 使用iterrows方法遍历每一行
for index, row in df.iterrows():
    print(f'Row index: {index}')
    print(row)
    print('-----------------------------------')

上述代码会输出DataFrame中每一行的索引和数据内容。在循环中，可以使用row[‘列名’]来获取每一行中指定列的数据。

运行结果：

Row index: 0
A    1
B    4
Name: 0, dtype: int64
-----------------------------------
Row index: 1
A    2
B    5
Name: 1, dtype: int64
-----------------------------------
Row index: 2
A    3
B    6
Name: 2, dtype: int64
-----------------------------------

2. 使用apply方法遍历每一行

除了iterrows方法外，还可以使用apply方法来遍历每一行数据。apply方法可以接收一个函数作为参数，并对DataFrame中的每一行应用该函数。

import pandas as pd

# 创建一个示例DataFrame
data = {'A': [1, 2, 3], 'B': [4, 5, 6]}
df = pd.DataFrame(data)

# 定义一个处理每一行数据的函数
def process_row(row):
    return row['A'] * row['B']

# 使用apply方法遍历每一行
result = df.apply(process_row, axis=1)
print(result)

上述代码定义了一个process_row函数，对每一行数据进行处理并返回结果。然后使用apply方法遍历每一行数据，并将处理结果存储在result变量中。