Python libraries and frameworks Pandas

Pandas is a popular Python library for data manipulation and analysis. Pandas provides a set of data structures and tools for working with tabular data, including the `Series` and `DataFrame` objects, and a variety of functions for manipulating, filtering, aggregating, and visualizing data.

Here are some key features and examples of how to use Pandas:

## Data Structures

The core of Pandas is its data structures, including the `Series` and `DataFrame` objects. A `Series` is a one-dimensional array-like object that can hold any data type, while a `DataFrame` is a two-dimensional tabular data structure that can hold multiple `Series` objects.

Here’s an example of how to create and manipulate a Pandas `DataFrame`:

python
import pandas as pd

# Create a Pandas DataFrame
data = {"Name": ["Alice", "Bob", "Charlie", "Dave"],
        "Age": [25, 30, 35, 40],
        "City": ["New York", "Paris", "London", "Tokyo"]}
df = pd.DataFrame(data)

# Perform operations on the DataFrame
mean_age = df["Age"].mean()
df_filtered = df[df["Age"] > 30]

# Print the results
print("Original DataFrame:", df)
print("Mean age:", mean_age)
print("Filtered DataFrame:", df_filtered)

## Data Manipulation

Pandas provides a variety of functions for manipulating and transforming data, including merging and joining data, reshaping and pivoting data, grouping and aggregating data, and handling missing data.

Here’s an example of how to group and aggregate data using Pandas:

python
import pandas as pd

# Create a Pandas DataFrame
data = {"Name": ["Alice", "Bob", "Charlie", "Dave"],
        "Age": [25, 30, 35, 40],
        "City": ["New York", "Paris", "London", "Tokyo"]}
df = pd.DataFrame(data)

# Group and aggregate the data
grouped = df.groupby("City")
mean_age = grouped["Age"].mean()

# Print the results
print("Original DataFrame:", df)
print("Mean age by city:", mean_age)

## Data Visualization

Pandas also provides a set of functions for visualizing data, including line plots, scatter plots, bar plots, and histograms. Pandas uses the `matplotlib` library for visualization.

Here’s an example of how to create a bar plot using Pandas:

python
import pandas as pd
import matplotlib.pyplot as plt

# Create a Pandas DataFrame
data = {"Name": ["Alice", "Bob", "Charlie", "Dave"],
        "Age": [25, 30, 35, 40],
        "City": ["New York", "Paris", "London", "Tokyo"]}
df = pd.DataFrame(data)

# Create a bar plot of the age data
df.plot(kind="bar", x="Name", y="Age")

# Show the plot
plt.show()

Overall, Pandas is a powerful and flexible library for data manipulation and analysis in Python, and is widely used in a variety of fields, including finance, economics, social sciences, and data science.