Pandas入门到精通（一）

2023-06-24 06:07:12

关于Pandas是干嘛的，功能有多强大我就不多说看了，我直接上干货。

1、导入pandas包

import pandas as pd

2、创建一个Series序列

pd.Series([1, 3, 5, np.nan, np.nan, 6, 8])
print(s)
0    1.0
1    3.0
2    5.0
3    NaN
4    6.0
5    8.0
dtype: float64

3、根据日期创建一个序列（DatetimeIndex）

Pandas入门到精通（一）

periods 表示的时期

freq=“D” 表示的是以 Day 增加也就是天数目，所以一直是日在增加,当然也可以改成“M”，具体可以参考

freq参数可选项目

还有一些参数，比如开始、结束、时区等相关参数。

4 、设置index的Series

s = pd.Series(np.random.randn(5), index=["a", "b", "c", "d", "e"])
a    0.114858
b    0.872704
c    0.284864
d    0.884343
e    0.249049
dtype: float64

5、从字典里生成Series

d = {"a": 0.0, "b": 1.0, "c": 2.0}
pd.Series(d)

6、Series切片

data2 = pd.Series(np.random.randint(1, 5, 5), index=["a", "b", "c", "d", "e"], dtype="uint32")
    print(data2)
    print("-" * 32)
    print("median", data2.median()) # 取平均数
    print("-" * 32)
    print(data2[:2])  # 取前两个
    print("-" * 32)
    print(data2[data2 > data2.median()])  # 取大于平均数的
    print(data2[[4, 3, 1]])  # 取列表中的索引为 4、3、1的数
    print(np.exp(data2)) # np.exp()函数是求e x e^{x}e
a    1
b    3
c    2
d    4
e    2
dtype: uint32
--------------------------------
median 2.0
--------------------------------
a    1
b    3
dtype: uint32
--------------------------------
b    3
d    4
dtype: uint32
e    2
d    4
b    3
dtype: uint32
a     2.718282
b    20.085537
c     7.389056
d    54.598150
e     7.389056
dtype: float64

Process finished with exit code 0

6、Series类似字典的操作

data2 = pd.Series([1, 2, 3, 4, 5], index=["a", "b", "c", "d", "e"], dtype="uint32")
    print(data2['a']) # 取出索引是 a 的数据
    print('b' in data2) # 判断是否有 b 这个索引
    print(data2.get("d"))

7、Series类似 narray的操作

data2 = pd.Series([1, 2, 3, 4, 5], index=["a", "b", "c", "d", "e"], dtype="uint32")
    print(data2+data2) 
    print(2*data2) 
    print(np.exp(data2))

8、Series 设置name和修改

data2 = pd.Series([1, 2, 3, 4, 5], index=["a", "b", "c", "d", "e"], dtype="uint32")
    data2.name = "hongbiao"
    print(data2.name)
    data = data2.rename("zhangsan")
    # 注意这里rename之后就是另外一个对象了
    print(data.name)

Pandas入门到精通（一）

1、导入pandas包

2、创建一个Series序列

3、根据日期创建一个序列（DatetimeIndex）

4 、设置index的Series

5、从字典里生成Series

6、Series切片

6、Series类似字典的操作

7、Series类似 narray的操作

8、Series 设置name和修改

好了，上面就是Series的一些基本知识，当然还有Series的一些索引，我这里就不多说了，后面在索引的时候再说

继续阅读

2021-2025年中国运动疗法（KT）带行业市场供需与战略研究报告

cs231n斯坦福基于卷积神经网络的CV学习笔记（一）KNN和线性分类器/分类器损失/反向传播一，KNN图像分类算法二，线性分类器三，线性分类器损失四，反向传播五，神经网络

Small tricks

libsvm for python 安装

2021年危险化学品经营单位安全管理人员考试题库及危险化学品经营单位安全管理人员考试技巧

学习软件测试基础测试第七天

Zeppelin 配置访问 REST APIApache Zeppelin Configuration REST API

【Torch】最简洁logging使用指南

27. Remove Element(列表)题目代码

无人机--飞控科普

Cloud Studio初体验

使用 ctypes 进行 Python 和 C 的混合编程

【python】【数据处理】画多维数据分布图

【python】netconf协议对接管理设备

「Python 网络自动化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 网络设备

在python中创建excel并写入