Python for Data Analysis

Table of Contents

Import Conventions

The Python community has adopted a number of naming conventions for commonly- used modules:

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

pandas

The pandas name itself is derived from panel data, an econometrics term for multidi- mensional structured data sets, and Python data analysis itself.

CHAPTER 2 Introductory Examples

1.usa.gov data from bit.ly

MovieLens 1M Data Set

GroupLens Research (http://www.grouplens.org/node/73) provides a number of collec- tions of movie ratings data collected from users of MovieLens in the late 1990s and early 2000s.

US Baby Names 1880-2010

The United States Social Security Administration (SSA) has made available data on the frequency of baby names from 1880 through the present.

As of this writing, the US Social Security Administration makes available data files, one per year, containing the total number of births for each sex/name combination. The raw archive of these files can be obtained here: http://www.ssa.gov/oact/babynames/limits.html

CHAPTER 3 IPython

ss



Author: Shi Shougang

Created: 2015-03-05 Thu 23:21

Emacs 24.3.1 (Org mode 8.2.10)

Validate