支持的 pandas API

下表显示了 Spark 上的 Pandas API 中已实现或未实现的 pandas API。 一些 pandas API 没有实现全部参数,因此第三列显示了每个 API 缺少的参数。

  • 第二列中的“Y”表示已实现,包括其所有参数。

  • “N”表示尚未实现。

  • “P”表示已部分实现,缺少某些参数。

以下列表中的所有 API 都使用分布式执行来计算数据,除非那些设计上需要本地执行的 API。 例如,DataFrame.to_numpy() 需要将数据收集到驱动程序端。

如果您需要任何尚未实现的 pandas API 或参数,您可以创建一个 Apache Spark JIRA 来请求或自行贡献。

API 列表基于 最新的 pandas 官方 API 参考进行更新。

CategoricalIndex API

API

已实现

缺少参数

add_categories()

Y

all()

Y

any()

Y

append()

Y

argmax()

P

axis , skipna

argmin()

P

axis , skipna

argsort

N

as_ordered()

Y

as_unordered()

Y

asof()

Y

asof_locs

N

astype()

P

copy

copy()

Y

delete()

Y

difference()

Y

drop()

P

errors

drop_duplicates()

Y

droplevel()

Y

dropna()

Y

duplicated

N

equals()

Y

factorize()

P

use_na_sentinel

fillna()

P

downcast

format

N

get_indexer

N

get_indexer_for

N

get_indexer_non_unique

N

get_level_values()

Y

get_loc

N

get_slice_bound

N

groupby

N

holds_integer()

Y

identical()

Y

infer_objects

N

insert()

Y

intersection()

P

sort

is_

N

is_boolean()

Y

is_categorical()

Y

is_floating()

Y

is_integer()

Y

is_interval()

Y

is_numeric()

Y

is_object()

Y

isin()

P

level

isna()

Y

isnull()

Y

item()

Y

join

N

map()

Y

max()

Y

memory_usage

N

min()

Y

notna()

Y

notnull()

Y

nunique()

Y

putmask

N

ravel

N

reindex

N

remove_categories()

Y

remove_unused_categories()

Y

rename()

Y

rename_categories()

Y

reorder_categories()

Y

repeat()

P

axis

searchsorted

N

set_categories()

Y

set_names()

Y

shift()

P

freq

slice_indexer

N

slice_locs

N

sort()

Y

sort_values()

P

key , na_position

sortlevel

N

symmetric_difference()

Y

take()

P

allow_fill , axis , fill_value

to_flat_index

N

to_frame()

Y

to_list()

Y

to_numpy()

P

na_value

to_series()

P

index

tolist()

Y

transpose()

Y

union()

Y

unique()

Y

value_counts()

Y

view()

Y

where

N

DataFrame API

API

已实现

缺少参数

abs()

Y

add()

P

axis , fill_value , level

add_prefix()

P

axis

add_suffix()

P

axis

agg()

P

axis

aggregate()

P

axis

align()

P

broadcast_axis , fill_axis , fill_value , level , limit 等。 有关详细信息,请参见 pandas.DataFrame.alignpyspark.pandas.DataFrame.align

all()

Y

any()

P

skipna

apply()

P

raw , result_type

applymap()

P

na_action

asfreq

N

asof

N

assign()

Y

astype()

P

copy , errors

at_time()

Y

backfill()

P

downcast

between_time()

P

inclusive

bfill()

P

downcast

bool()

Y

boxplot()

P

ax , backend , by , column , figsize 等。 有关详细信息,请参见 pandas.DataFrame.boxplotpyspark.pandas.DataFrame.boxplot

clip()

P

axis , inplace

combine

N

combine_first()

Y

compare

N

convert_dtypes

N

copy()

Y

corr()

P

numeric_only

corrwith()

P

numeric_only

count()

Y

cov()

P

numeric_only

cummax()

P

axis

cummin()

P

axis

cumprod()

P

axis

cumsum()

P

axis

describe()

P

exclude , include

diff()

Y

div()

P

axis , fill_value , level

divide()

P

axis , fill_value , level

dot()

Y

drop()

P

errors , inplace , level

drop_duplicates()

Y

droplevel()

Y

dropna()

P

ignore_index

duplicated()

Y

eq()

P

axis , level

equals()

Y

eval()

Y

ewm()

P

adjust , axis , method , times

expanding()

P

axis , method

explode()

Y

ffill()

P

downcast

fillna()

P

downcast

filter()

Y

first()

Y

first_valid_index()

Y

floordiv()

P

axis , fill_value , level

ge()

P

axis , level

get()

Y

groupby()

P

group_keys , level , observed , sort

gt()

P

axis , level

head()

Y

hist()

P

ax , backend , by , column , data 等。 有关详细信息,请参见 pandas.DataFrame.histpyspark.pandas.DataFrame.hist

idxmax()

P

numeric_only , skipna

idxmin()

P

numeric_only , skipna

infer_objects

N

info()

P

memory_usage , show_counts

insert()

Y

interpolate()

P

axis , downcast , inplace

isetitem

N

isin()

Y

isna()

Y

isnull()

Y

items()

Y

iterrows()

Y

itertuples()

Y

join()

P

other , sort , validate

keys()

Y

kurt()

Y

kurtosis()

Y

last()

Y

last_valid_index()

Y

le()

P

axis , level

lt()

P

axis , level

mask()

P

axis , inplace , level

max()

Y

mean()

Y

median()

Y

melt()

P

col_level , ignore_index

memory_usage

N

merge()

P

copy , indicator , sort , validate

min()

Y

mod()

P

axis , fill_value , level

mode()

Y

mul()

P

axis , fill_value , level

multiply()

P

axis , fill_value , level

ne()

P

axis , level

nlargest()

Y

notna()

Y

notnull()

Y

nsmallest()

Y

nunique()

Y

pad()

P

downcast

pct_change()

P

fill_method , freq , limit

pipe()

Y

pivot()

Y

pivot_table()

P

dropna , margins , margins_name , observed , sort

pop()

Y

pow()

P

axis , fill_value , level

prod()

Y

product()

Y

quantile()

P

interpolation , method

query()

Y

radd()

P

axis , fill_value , level

rank()

P

axis , na_option , pct

rdiv()

P

axis , fill_value , level

reindex()

P

level , limit , method , tolerance

reindex_like()

P

limit , method , tolerance

rename()

P

copy

rename_axis()

P

copy

reorder_levels

N

replace()

Y

resample()

P

axis , convention , group_keys , kind , level 等等。详情请参考 pandas.DataFrame.resamplepyspark.pandas.DataFrame.resample

reset_index()

P

allow_duplicates , names

rfloordiv()

P

axis , fill_value , level

rmod()

P

axis , fill_value , level

rmul()

P

axis , fill_value , level

rolling()

P

axis , center , closed , method , on 等等。详情请参考 pandas.DataFrame.rollingpyspark.pandas.DataFrame.rolling

round()

Y

rpow()

P

axis , fill_value , level

rsub()

P

axis , fill_value , level

rtruediv()

P

axis , fill_value , level

sample()

P

axis , weights

select_dtypes()

Y

sem()

Y

set_axis

N

set_flags

N

set_index()

P

verify_integrity

shift()

P

axis , freq

skew()

Y

sort_index()

P

key , sort_remaining

sort_values()

P

axis , key , kind

squeeze()

Y

stack()

P

dropna , level

std()

Y

sub()

P

axis , fill_value , level

subtract()

P

axis , fill_value , level

sum()

Y

swapaxes()

P

axis1 , axis2

swaplevel()

Y

tail()

Y

take()

Y

to_clipboard()

Y

to_csv()

P

chunksize , compression , decimal , doublequote , encoding 等等。详情请参考 pandas.DataFrame.to_csvpyspark.pandas.DataFrame.to_csv

to_dict()

P

index

to_excel()

P

storage_options

to_feather

N

to_gbq

N

to_hdf

N

to_html()

P

encoding

to_json()

P

date_format , date_unit , default_handler , double_precision , force_ascii 等等。详情请参考 pandas.DataFrame.to_jsonpyspark.pandas.DataFrame.to_json

to_latex()

P

caption , label , position

to_markdown()

P

index , storage_options

to_numpy()

P

copy , dtype , na_value

to_orc()

P

engine , engine_kwargs , index

to_parquet()

P

engine , index , storage_options

to_period

N

to_pickle

N

to_records()

Y

to_sql

N

to_stata

N

to_string()

P

encoding , max_colwidth , min_rows

to_timestamp

N

to_xarray

N

to_xml

N

transform()

Y

transpose()

P

copy

truediv()

P

axis , fill_value , level

truncate()

Y

tz_convert

N

tz_localize

N

unstack()

P

fill_value , level

update()

P

errors , filter_func

value_counts

N

var()

P

skipna

where()

P

inplace , level

xs()

P

drop_level

DatetimeIndex API

API

已实现

缺少参数

all()

Y

any()

Y

append()

Y

argmax()

P

axis , skipna

argmin()

P

axis , skipna

argsort

N

as_unit

N

asof()

Y

asof_locs

N

astype()

P

copy

ceil()

Y

copy()

Y

day_name()

Y

delete()

Y

difference()

Y

drop()

P

errors

drop_duplicates()

Y

droplevel()

Y

dropna()

Y

duplicated

N

equals()

Y

factorize()

P

use_na_sentinel

fillna()

P

downcast

floor()

Y

format

N

get_indexer

N

get_indexer_for

N

get_indexer_non_unique

N

get_level_values()

Y

get_loc

N

get_slice_bound

N

groupby

N

holds_integer()

Y

identical()

Y

indexer_at_time()

Y

indexer_between_time()

Y

infer_objects

N

insert()

Y

intersection()

P

sort

is_

N

is_boolean()

Y

is_categorical()

Y

is_floating()

Y

is_integer()

Y

is_interval()

Y

is_numeric()

Y

is_object()

Y

isin()

P

level

isna()

Y

isnull()

Y

isocalendar

N

item()

Y

join

N

map()

Y

max()

P

axis , skipna

mean

N

memory_usage

N

min()

P

axis , skipna

month_name()

Y

normalize()

Y

notna()

Y

notnull()

Y

nunique()

Y

putmask

N

ravel

N

reindex

N

rename()

Y

repeat()

P

axis

round()

Y

searchsorted

N

set_names()

Y

shift()

P

freq

slice_indexer

N

slice_locs

N

snap

N

sort()

Y

sort_values()

P

key , na_position

sortlevel

N

std

N

strftime()

Y

symmetric_difference()

Y

take()

P

allow_fill , axis , fill_value

to_flat_index

N

to_frame()

Y

to_julian_date

N

to_list()

Y

to_numpy()

P

na_value

to_period

N

to_pydatetime

N

to_series()

P

index

tolist()

Y

transpose()

Y

tz_convert

N

tz_localize

N

union()

Y

unique()

Y

value_counts()

Y

view()

Y

where

N

Index API

API

已实现

缺少参数

all()

Y

any()

Y

append()

Y

argmax()

P

axis , skipna

argmin()

P

axis , skipna

argsort

N

asof()

Y

asof_locs

N

astype()

P

copy

copy()

Y

delete()

Y

difference()

Y

drop()

P

errors

drop_duplicates()

Y

droplevel()

Y

dropna()

Y

duplicated

N

equals()

Y

factorize()

P

use_na_sentinel

fillna()

P

downcast

format

N

get_indexer

N

get_indexer_for

N

get_indexer_non_unique

N

get_level_values()

Y

get_loc

N

get_slice_bound

N

groupby

N

holds_integer()

Y

identical()

Y

infer_objects

N

insert()

Y

intersection()

P

sort

is_

N

is_boolean()

Y

is_categorical()

Y

is_floating()

Y

is_integer()

Y

is_interval()

Y

is_numeric()

Y

is_object()

Y

isin()

P

level

isna()

Y

isnull()

Y

item()

Y

join

N

map()

Y

max()

P

axis , skipna

memory_usage

N

min()

P

axis , skipna

notna()

Y

notnull()

Y

nunique()

Y

putmask

N

ravel

N

reindex

N

rename()

Y

repeat()

P

axis

searchsorted

N

set_names()

Y

shift()

P

freq

slice_indexer

N

slice_locs

N

sort()

Y

sort_values()

P

key , na_position

sortlevel

N

symmetric_difference()

Y

take()

P

allow_fill , axis , fill_value

to_flat_index

N

to_frame()

Y

to_list()

Y

to_numpy()

P

na_value

to_series()

P

index

tolist()

Y

transpose()

Y

union()

Y

unique()

Y

value_counts()

Y

view()

Y

where

N

MultiIndex API

API

已实现

缺少参数

all()

Y

any()

Y

append()

Y

argmax()

P

axis , skipna

argmin()

P

axis , skipna

argsort

N

asof()

Y

asof_locs

N

astype()

P

copy

copy()

P

name , names

delete()

Y

difference()

Y

drop()

P

errors

drop_duplicates()

Y

droplevel()

Y

dropna()

Y

duplicated

N

equal_levels()

Y

equals()

Y

factorize()

P

use_na_sentinel

fillna()

P

downcast

format

N

get_indexer

N

get_indexer_for

N

get_indexer_non_unique

N

get_level_values()

Y

get_loc

N

get_loc_level

N

get_locs

N

get_slice_bound

N

groupby

N

holds_integer()

Y

identical()

Y

infer_objects

N

insert()

Y

intersection()

P

sort

is_

N

is_boolean()

Y

is_categorical()

Y

is_floating()

Y

is_integer()

Y

is_interval()

Y

is_numeric()

Y

is_object()

Y

isin()

P

level

isna()

Y

isnull()

Y

item()

Y

join

N

map()

Y

max()

P

axis , skipna

memory_usage

N

min()

P

axis , skipna

notna()

Y

notnull()

Y

nunique()

Y

putmask

N

ravel

N

reindex

N

remove_unused_levels

N

rename()

P

level , names

reorder_levels

N

repeat()

P

axis

searchsorted

N

set_codes

N

set_levels

N

set_names()

Y

shift()

P

freq

slice_indexer

N

slice_locs

N

sort()

Y

sort_values()

P

key , na_position

sortlevel

N

swaplevel()

Y

symmetric_difference()

Y

take()

P

allow_fill , axis , fill_value

to_flat_index

N

to_frame()

P

allow_duplicates

to_list()

Y

to_numpy()

P

na_value

to_series()

P

index

tolist()

Y

transpose()

Y

truncate

N

union()

Y

unique()

Y

value_counts()

Y

view()

Y

where

N

Series API

API

已实现

缺少参数

abs()

Y

add()

P

axis , level

add_prefix()

P

axis

add_suffix()

P

axis

agg()

P

axis

aggregate()

P

axis

align()

P

broadcast_axis , fill_axis , fill_value , level , limit 等等。详情请参考 pandas.Series.alignpyspark.pandas.Series.align

all()

P

bool_only

any()

P

bool_only , skipna

apply()

P

convert_dtype

argmax()

Y

argmin()

Y

argsort()

P

axis , kind , order

asfreq

N

asof()

P

subset

astype()

P

copy , errors

at_time()

Y

autocorr()

Y

backfill()

P

downcast

between()

Y

between_time()

P

inclusive

bfill()

P

downcast

bool()

Y

clip()

P

axis

combine

N

combine_first()

Y

compare()

P

align_axis , result_names

convert_dtypes

N

copy()

Y

corr()

Y

count()

Y

cov()

Y

cummax()

P

axis

cummin()

P

axis

cumprod()

P

axis

cumsum()

P

axis

describe()

P

exclude , include

diff()

Y

div()

P

axis , fill_value , level

divide()

P

axis , fill_value , level

divmod()

P

axis , fill_value , level

dot()

Y

drop()

P

axis , errors

drop_duplicates()

P

ignore_index

droplevel()

P

axis

dropna()

P

how , ignore_index

duplicated()

Y

eq()

P

axis , fill_value , level

equals()

Y

ewm()

P

adjust , axis , method , times

expanding()

P

axis , method

explode()

P

ignore_index

factorize()

P

use_na_sentinel

ffill()

P

downcast

fillna()

P

downcast

filter()

Y

first()

Y

first_valid_index()

Y

floordiv()

P

axis , fill_value , level

ge()

P

axis , fill_value , level

get()

Y

groupby()

P

group_keys , level , observed , sort

gt()

P

axis , fill_value , level

head()

Y

hist()

P

ax , backend , by , figsize , grid 等等。详情请参考 pandas.Series.histpyspark.pandas.Series.hist

idxmax()

P

axis

idxmin()

P

axis

infer_objects

N

info

N

interpolate()

P

axis , downcast , inplace

isin()

Y

isna()

Y

isnull()

Y

item()

Y

items()

Y

keys()

Y

kurt()

Y

kurtosis()

Y

last()

Y

last_valid_index()

Y

le()

P

axis , fill_value , level

lt()

P

axis , fill_value , level

map()

Y

mask()

P

axis , inplace , level

max()

Y

mean()

Y

median()

Y

memory_usage

N

min()

Y

mod()

P

axis , fill_value , level

mode()

Y

mul()

P

axis , fill_value , level

multiply()

P

axis , fill_value , level

ne()

P

axis , fill_value , level

nlargest()

P

keep

notna()

Y

notnull()

Y

nsmallest()

P

keep

nunique()

Y

pad()

P

downcast

pct_change()

P

fill_method , freq , limit

pipe()

Y

pop()

Y

pow()

P

axis , fill_value , level

prod()

Y

product()

Y

quantile()

P

interpolation

radd()

P

axis , level

rank()

P

axis , na_option , pct

ravel

N

rdiv()

P

axis , fill_value , level

rdivmod()

P

axis , fill_value , level

reindex()

P

axis , copy , level , limit , method 等等。详情请参考 pandas.Series.reindexpyspark.pandas.Series.reindex

reindex_like()

P

copy , limit , method , tolerance

rename()

P

axis , copy , errors , inplace , level

rename_axis()

P

axis , copy

reorder_levels

N

repeat()

P

axis

replace()

P

inplace , limit , method

resample()

P

axis , convention , group_keys , kind , level 等等。详情请参考 pandas.Series.resamplepyspark.pandas.Series.resample

reset_index()

P

allow_duplicates

rfloordiv()

P

axis , fill_value , level

rmod()

P

axis , fill_value , level

rmul()

P

axis , fill_value , level

rolling()

P

axis , center , closed , method , on 等等。详情请参考 pandas.Series.rollingpyspark.pandas.Series.rolling

round()

Y

rpow()

P

axis , fill_value , level

rsub()

P

axis , fill_value , level

rtruediv()

P

axis , fill_value , level

sample()

P

axis , weights

searchsorted()

P

sorter

sem()

Y

set_axis

N

set_flags

N

shift()

P

axis , freq

skew()

Y

sort_index()

P

key , sort_remaining

sort_values()

P

axis , key , kind

squeeze()

Y

std()

Y

sub()

P

axis , fill_value , level

subtract()

P

axis , fill_value , level

sum()

Y

swapaxes()

P

axis1 , axis2

swaplevel()

Y

tail()

Y

take()

P

axis

to_clipboard()

Y

to_csv()

P

chunksize , compression , decimal , doublequote , encoding 等等。详情请参考 pandas.Series.to_csvpyspark.pandas.Series.to_csv

to_dict()

Y

to_excel()

P

storage_options

to_frame()

Y

to_hdf

N

to_json()

P

date_format , date_unit , default_handler , double_precision , force_ascii 等等。详情请参考 pandas.Series.to_jsonpyspark.pandas.Series.to_json

to_latex()

P

caption , label , position

to_list()

Y

to_markdown()

P

index , storage_options

to_numpy()

P

copy , dtype , na_value

to_period

N

to_pickle

N

to_sql

N

to_string()

P

min_rows

to_timestamp

N

to_xarray

N

tolist()

Y

transform()

Y

transpose()

Y

truediv()

P

axis , fill_value , level

truncate()

Y

tz_convert

N

tz_localize

N

unique()

Y

unstack()

P

fill_value

update()

Y

value_counts()

Y

var()

P

skipna

view

N

where()

P

axis , inplace , level

xs()

P

axis , drop_level

TimedeltaIndex API

API

已实现

缺少参数

all()

Y

any()

Y

append()

Y

argmax()

P

axis , skipna

argmin()

P

axis , skipna

argsort

N

as_unit

N

asof()

Y

asof_locs

N

astype()

P

copy

ceil

N

copy()

Y

delete()

Y

difference()

Y

drop()

P

errors

drop_duplicates()

Y

droplevel()

Y

dropna()

Y

duplicated

N

equals()

Y

factorize()

P

use_na_sentinel

fillna()

P

downcast

floor

N

format

N

get_indexer

N

get_indexer_for

N

get_indexer_non_unique

N

get_level_values()

Y

get_loc

N

get_slice_bound

N

groupby

N

holds_integer()

Y

identical()

Y

infer_objects

N

insert()

Y

intersection()

P

sort

is_

N

is_boolean()

Y

is_categorical()

Y

is_floating()

Y

is_integer()

Y

is_interval()

Y

is_numeric()

Y

is_object()

Y

isin()

P

level

isna()

Y

isnull()

Y

item()

Y

join

N

map()

Y

max()

P

axis , skipna

mean

N

median

N

memory_usage

N

min()

P

axis , skipna

notna()

Y

notnull()

Y

nunique()

Y

putmask

N

ravel

N

reindex

N

rename()

Y

repeat()

P

axis

round

N

searchsorted

N

set_names()

Y

shift()

P

freq

slice_indexer

N

slice_locs

N

sort()

Y

sort_values()

P

key , na_position

sortlevel

N

std

N

sum

N

symmetric_difference()

Y

take()

P

allow_fill , axis , fill_value

to_flat_index

N

to_frame()

Y

to_list()

Y

to_numpy()

P

na_value

to_pytimedelta

N

to_series()

P

index

tolist()

Y

total_seconds

N

transpose()

Y

union()

Y

unique()

Y

value_counts()

Y

view()

Y

where

N

通用函数 API

API

已实现

缺少参数

array

N

bdate_range

N

concat()

P

copy , keys , levels , names , verify_integrity

crosstab

N

cut

N

date_range()

P

inclusive , unit

eval

N

factorize

N

from_dummies

N

get_dummies()

Y

infer_freq

N

interval_range

N

isna()

Y

isnull()

Y

json_normalize

N

lreshape

N

melt()

P

col_level , ignore_index

merge()

P

copy , indicator , left , sort , validate

merge_asof()

Y

merge_ordered

N

notna()

Y

notnull()

Y

period_range

N

pivot

N

pivot_table

N

qcut

N

read_clipboard()

P

dtype_backend

read_csv()

P

cache_dates , chunksize , compression , converters , date_format 等。详情请参阅 pandas.read_csvpyspark.pandas.read_csv

read_excel()

P

date_format , decimal , dtype_backend , na_filter , storage_options

read_feather

N

read_fwf

N

read_gbq

N

read_hdf

N

read_html()

P

dtype_backend , extract_links

read_json()

P

chunksize , compression , convert_axes , convert_dates , date_unit 等。详情请参阅 pandas.read_jsonpyspark.pandas.read_json

read_orc()

P

dtype_backend

read_parquet()

P

dtype_backend , engine , storage_options , use_nullable_dtypes

read_pickle

N

read_sas

N

read_spss

N

read_sql()

P

chunksize , coerce_float , dtype , dtype_backend , params 等。详情请参阅 pandas.read_sqlpyspark.pandas.read_sql

read_sql_query()

P

chunksize , coerce_float , dtype , dtype_backend , params 等。详情请参阅 pandas.read_sql_querypyspark.pandas.read_sql_query

read_sql_table()

P

chunksize , coerce_float , dtype_backend , parse_dates

read_stata

N

read_table()

P

cache_dates , chunksize , comment , compression , converters 等。详情请参阅 pandas.read_tablepyspark.pandas.read_table

read_xml

N

set_eng_float_format

N

show_versions

N

test

N

timedelta_range()

P

unit

to_datetime()

P

cache , dayfirst , exact , utc , yearfirst

to_numeric()

P

downcast , dtype_backend

to_pickle

N

to_timedelta()

Y

unique

N

value_counts

N

wide_to_long

N

扩展 API

API

已实现

缺少参数

agg

N

aggregate

N

apply

N

corr

N

count()

P

numeric_only

cov

N

kurt()

P

numeric_only

max()

P

engine , engine_kwargs , numeric_only

mean()

P

engine , engine_kwargs , numeric_only

median

N

min()

P

engine , engine_kwargs , numeric_only

quantile()

P

interpolation , numeric_only

rank

N

sem

N

skew()

P

numeric_only

std()

P

ddof , engine , engine_kwargs , numeric_only

sum()

P

engine , engine_kwargs , numeric_only

var()

P

ddof , engine , engine_kwargs , numeric_only

扩展 Groupby API

API

已实现

缺少参数

agg

N

aggregate

N

apply

N

corr

N

count()

P

numeric_only

cov

N

kurt()

P

numeric_only

max()

P

engine , engine_kwargs , numeric_only

mean()

P

engine , engine_kwargs , numeric_only

median

N

min()

P

engine , engine_kwargs , numeric_only

quantile()

P

interpolation , numeric_only

rank

N

sem

N

skew()

P

numeric_only

std()

P

ddof , engine , engine_kwargs , numeric_only

sum()

P

engine , engine_kwargs , numeric_only

var()

P

ddof , engine , engine_kwargs , numeric_only

滚动 API

API

已实现

缺少参数

agg

N

aggregate

N

apply

N

corr

N

count()

P

numeric_only

cov

N

kurt()

P

numeric_only

max()

P

engine , engine_kwargs , numeric_only

mean()

P

engine , engine_kwargs , numeric_only

median

N

min()

P

engine , engine_kwargs , numeric_only

quantile()

P

interpolation , numeric_only

rank

N

sem

N

skew()

P

numeric_only

std()

P

ddof , engine , engine_kwargs , numeric_only

sum()

P

engine , engine_kwargs , numeric_only

var()

P

ddof , engine , engine_kwargs , numeric_only

滚动 Groupby API

API

已实现

缺少参数

agg

N

aggregate

N

apply

N

corr

N

count()

P

numeric_only

cov

N

kurt()

P

numeric_only

max()

P

engine , engine_kwargs , numeric_only

mean()

P

engine , engine_kwargs , numeric_only

median

N

min()

P

engine , engine_kwargs , numeric_only

quantile()

P

interpolation , numeric_only

rank

N

sem

N

skew()

P

numeric_only

std()

P

ddof , engine , engine_kwargs , numeric_only

sum()

P

engine , engine_kwargs , numeric_only

var()

P

ddof , engine , engine_kwargs , numeric_only

窗口 API

API

已实现

缺少参数

agg

N

aggregate

N

mean

N

std

N

sum

N

var

N

DataFrameGroupBy API

API

已实现

缺少参数

agg()

P

engine , engine_kwargs , func

aggregate()

P

engine , engine_kwargs , func

all()

Y

any()

P

skipna

apply()

Y

bfill()

Y

boxplot

N

corr

N

corrwith

N

count()

Y

cov

N

cumcount()

Y

cummax()

P

axis , numeric_only

cummin()

P

axis , numeric_only

cumprod()

P

axis

cumsum()

P

axis

describe()

P

exclude , include , percentiles

diff()

P

axis

ewm()

Y

expanding()

Y

ffill()

Y

fillna()

P

downcast

filter()

P

dropna

first()

Y

get_group()

P

obj

head()

Y

hist

N

idxmax()

P

axis , numeric_only

idxmin()

P

axis , numeric_only

last()

Y

max()

P

engine , engine_kwargs

mean()

P

engine , engine_kwargs

median()

Y

min()

P

engine , engine_kwargs

ngroup

N

nunique()

Y

ohlc

N

pct_change

N

pipe

N

prod()

Y

quantile()

P

interpolation , numeric_only

rank()

P

axis , na_option , pct

resample

N

rolling()

Y

sample

N

sem()

P

numeric_only

shift()

P

axis , freq

size()

Y

skew()

P

axis , numeric_only , skipna

std()

P

engine , engine_kwargs , numeric_only

sum()

P

engine , engine_kwargs

tail()

Y

take

N

transform()

P

engine , engine_kwargs

value_counts

N

var()

P

engine , engine_kwargs , numeric_only

GroupBy API

API

已实现

缺少参数

agg()

P

func

aggregate()

P

func

all()

Y

any()

P

skipna

apply()

Y

bfill()

Y

count()

Y

cumcount()

Y

cummax()

P

axis , numeric_only

cummin()

P

axis , numeric_only

cumprod()

P

axis

cumsum()

P

axis

describe

N

diff()

P

axis

ewm()

Y

expanding()

Y

ffill()

Y

first()

Y

get_group()

P

obj

head()

Y

last()

Y

max()

P

engine , engine_kwargs

mean()

P

engine , engine_kwargs

median()

Y

min()

P

engine , engine_kwargs

ngroup

N

ohlc

N

pct_change

N

pipe

N

prod()

Y

quantile()

P

interpolation , numeric_only

rank()

P

axis , na_option , pct

resample

N

rolling()

Y

sample

N

sem()

P

numeric_only

shift()

P

axis , freq

size()

Y

std()

P

engine , engine_kwargs , numeric_only

sum()

P

engine , engine_kwargs

tail()

Y

var()

P

engine , engine_kwargs , numeric_only

SeriesGroupBy API

API

已实现

缺少参数

agg()

P

engine , engine_kwargs , func

aggregate()

P

engine , engine_kwargs , func

all()

Y

any()

P

skipna

apply()

Y

bfill()

Y

corr

N

count()

Y

cov

N

cumcount()

Y

cummax()

P

axis , numeric_only

cummin()

P

axis , numeric_only

cumprod()

P

axis

cumsum()

P

axis

describe

N

diff()

P

axis

ewm()

Y

expanding()

Y

ffill()

Y

fillna()

P

downcast

filter()

P

dropna

first()

Y

get_group()

P

obj

head()

Y

hist

N

idxmax()

P

axis

idxmin()

P

axis

last()

Y

max()

P

engine , engine_kwargs

mean()

P

engine , engine_kwargs

median()

Y

min()

P

engine , engine_kwargs

ngroup

N

nlargest()

P

keep

nsmallest()

P

keep

nunique()

Y

ohlc

N

pct_change

N

pipe

N

prod()

Y

quantile()

P

interpolation , numeric_only

rank()

P

axis , na_option , pct

resample

N

rolling()

Y

sample

N

sem()

P

numeric_only

shift()

P

axis , freq

size()

Y

skew()

P

axis , numeric_only , skipna

std()

P

engine , engine_kwargs , numeric_only

sum()

P

engine , engine_kwargs

tail()

Y

take

N

transform()

P

engine , engine_kwargs

unique()

Y

value_counts()

P

bins , normalize

var()

P

engine , engine_kwargs , numeric_only