DataFrame的apply()、applymap()、map()方法[通俗易懂]

全栈程序员-站长 • 2022年5月17日上午7:00 • 未分类 • 阅读 30

DataFrame的apply()、applymap()、map()方法[通俗易懂]对DataFrame对象中的某些行或列，或者对DataFrame对象中的所有元素进行某种运算或操作，我们无需利用低效笨拙的循环，DataFrame给我们分别提供了相应的直接而简单的方法，apply()和applymap()。其中apply()方法是针对某些行或列进行操作的，而applymap()方法则是针对所有元素进行操作的。1map()方法Themapmethod…

大家好，又见面了，我是你们的朋友全栈君。

对DataFrame对象中的某些行或列，或者对DataFrame对象中的所有元素进行某种运算或操作，我们无需利用低效笨拙的循环，DataFrame给我们分别提供了相应的直接而简单的方法，apply()和applymap()。其中apply()方法是针对某些行或列进行操作的，而applymap()方法则是针对所有元素进行操作的。

1 map()方法

The map method works on series, so in our case, we will use it to transform a column of our DataFrame, which remember is just a pandas Series. Suppose that we decide that the class names are a bit long for our taste and we would like to code them using our special threeletter coding system. We’ll use the map method with a Python dictionary as the argument toaccomplish this. We’ll pass in a replacement for each of the unique iris types:

df[‘class’] = df[‘class’].map({‘Iris-setosa’: ‘SET’, ‘Iris-virginica’:’VIR’, ‘Iris-versicolor’: ‘VER’})
df

DataFrame的apply()、applymap()、map()方法[通俗易懂]

2 Apply（）方法

The apply method allows us to work with both DataFrames and Series. We’ll start with an example that would work equally well with map, then we’ll move on to examples that would work only with apply.

Using the iris DataFrame, let’s make a new column based on the petal width. We previously saw that the mean for the petal width was 1.3. Let’s now create a new column in our DataFrame, wide petal, that contains binary values based on the value in the petal width column. If the petal width is equal to or wider than the median, we will code it with a 1, and if it is less than the median, we will code it 0. We’ll do this using the apply method on the petal width column:

df[‘wide petal’] = df[‘petal width’].apply(lambda v: 1 if v >= 1.3 else 0)
df

DataFrame的apply()、applymap()、map()方法[通俗易懂]

df[‘petal area’] = df.apply(lambda r: r[‘petal length’] * r[‘petal width’],axis=1)
df

DataFrame的apply()、applymap()、map()方法[通俗易懂]

3 Applymap（）方法

We’ve looked at manipulating columns and explained how to work with rows, but suppose that you’d like to perform a function across all data cells in your DataFrame; this is where applymap is the right tool. Let’s take a look at an example:

df.applymap(lambda v: np.log(v) if isinstance(v, float) else v)

DataFrame的apply()、applymap()、map()方法[通俗易懂]

4 Groupby方法

df.groupby(‘class’).mean()

df.groupby(‘petalwidth’)[‘class’].unique().to_frame()

df.groupby(‘petalwidth’)[‘class’].unique().to_frame()

DataFrame的apply()、applymap()、map()方法[通俗易懂]

df.groupby(‘petal width’)[‘class’].unique().to_frame()

df.groupby(‘class’).describe()

df.groupby(‘class’)[‘petal width’].agg({‘delta’: lambda x: x.max() – x.min(), ‘max’: np.max, ‘min’: np.min})

简单来说，apply()方法可以作用于DataFrame 还有Series，作用于一行或者一列时，我们不妨可以采用，因为可以通过设置axis=0/1 来把握，demo如下：

DataFrame的apply()、applymap()、map()方法[通俗易懂]

applymap() 作用于每一个元素

DataFrame的apply()、applymap()、map()方法[通俗易懂]

map可以作用于Series每一个元素的

DataFrame的apply()、applymap()、map()方法[通俗易懂]

总的来说，map()、aply()、applymap()方法是一种对series、dataframe极其方便的应用与映射函数。

最后，非常重要的一点，这些映射函数，里面都是可以放入自定义函数的。

tips.head()

Out[34]:

	total_bill	tip	smoker	day	time	size	tip_pct
0	16.99	1.01	No	Sun	Dinner	2	0.059447
1	10.34	1.66	No	Sun	Dinner	3	0.160542
2	21.01	3.50	No	Sun	Dinner	3	0.166587
3	23.68	3.31	No	Sun	Dinner	2	0.139780
4	24.59	3.61	No	Sun	Dinner	4	0.146808

def top(df,n=5,column=’tip_pct’):
return df.sort_values(by=column)[-n:]

tips.groupby(‘smoker’).apply(top)

Out[38]:

		total_bill	tip	smoker	day	time	size	tip_pct
smoker
No	88	24.71	5.85	No	Thur	Lunch	2	0.236746
	185	20.69	5.00	No	Sun	Dinner	5	0.241663
	51	10.29	2.60	No	Sun	Dinner	2	0.252672
	149	7.51	2.00	No	Thur	Lunch	2	0.266312
	232	11.61	3.39	No	Sat	Dinner	2	0.291990
Yes	109	14.31	4.00	Yes	Sat	Dinner	2	0.279525
	183	23.17	6.50	Yes	Sun	Dinner	4	0.280535
	67	3.07	1.00	Yes	Sat	Dinner	1	0.325733
	178	9.60	4.00	Yes	Sun	Dinner	2	0.416667
	172	7.25	5.15	Yes	Sun	Dinner	2	0.710345

版权声明：本文内容由互联网用户自发贡献，该文观点仅代表作者本人。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容，请联系我们举报，一经查实，本站将立刻删除。

发布者：全栈程序员-站长，转载请注明出处：https://javaforall.net/145590.html原文链接：https://javaforall.net

赞 (0)

全栈程序员-站长

0 0

ElasticSearch 简单的搜索聚合分析

ElasticSearch 简单的搜索聚合分析一、搜索1.DSL搜索全部数据没有任何条件查询名称包含xxx的商品，同时按照价格降序排序分页查询商品from第几条开始size获取几条查询结果中返回的字段设置2、query

全栈程序员-站长
2022年7月2日
34
怎样对ListView的项进行排序

怎样对ListView的项进行排序

全栈程序员-站长
2021年11月24日
51
Code Coverage API plugin 一个新的代码覆盖率插件

Code Coverage API plugin 一个新的代码覆盖率插件

全栈程序员-站长
2021年6月19日
94
模糊PID基本原理及matlab仿真实现（新手！新手！新手！）「建议收藏」

模糊PID基本原理及matlab仿真实现（新手！新手！新手！）「建议收藏」有关模糊pid的相关知识就把自己从刚接触到仿真出结果看到的大部分资料总结一下，以及一些自己的ps 以下未说明的都为转载内容 1.转自 https://blog.csdn.net/weixin_36340979/article/details/79168052在讲解模糊PID前，我们先要了解PID控制器的原理（本文主要介绍模糊PID的运用,对PID控制器的原理不做详细介绍）。P…

全栈程序员-站长
2022年6月4日
32
mongoDB安装和服务配置过程「建议收藏」

mongoDB安装和服务配置过程「建议收藏」mongoDB安装和服务配置过程

全栈程序员-站长
2022年4月24日
36
廖雪峰git学习资料-涂改笔记

廖雪峰git学习资料-涂改笔记注意：本文章是看廖雪峰官网资料整理而来原地址如下：http://www.liaoxuefeng.com/附件为git常用命令前言：注意的问题如果是首次提交会第一步：先在本地建立一个一样的仓库，称本地仓库。第二步：在本地进行commit操作将把更新提交到本地仓库；第三步：将服务器端的更新pull到本地仓库进行合并，最后将合并好的本地仓库push到服务…

全栈程序员-站长
2025年9月27日
6

发表回复

关注全栈程序员社区公众号