Python: Learning Statistics and Machine Learning with the Titanic Dataset ver. 2

Last Updated on November 19, 2021 by shibatau

I. Titanic (1997)

Starring: Leonardo Dicaprio , Kate Winslet , Billy Zane and Kathy Bates
Directed by: James Cameron


Is Rose from The Titanic Still Alive? Was She a Real Person?


II. Dataset from seaborn

# import seaborn for getting sample datasets
import seaborn as sb
# import pandas for data frames
import pandas as pd
# load the dataset
df = sb.load_dataset('titanic')


Survived: Survived (1) or died (0)
Pclass: Passenger’s class
Name: Passenger’s name
Sex: Passenger’s sex
Age: Passenger’s age
SibSp: Number of siblings/spouses aboard
Parch: Number of parents/children aboard
Ticket: Ticket number
Fare: Fare
Cabin: Cabin
Embarked: Port of embarkation


All the scripts are here though not completed yet. You are requested to restart runtime when you have installed AutoViz.

IV. Machine Learning

To be continued.

About shibatau

I was born and grown up in Kyoto. I studied western philosophy at the University and specialized in analytic philosophy, especially Ludwig Wittgenstein at the postgraduate school. I'm interested in new technology, especially machine learning and have been learning R language for two years and began to learn Python last summer. Listening toParamore, Sia, Amazarashi and MIyuki Nakajima. Favorite movies I've recently seen: "FREEHELD". Favorite actors and actresses: Anthony Hopkins, Denzel Washington, Ellen Page, Meryl Streep, Mia Wasikowska and Robert DeNiro. Favorite books: Fyodor Mikhailovich Dostoyevsky, "The Karamazov Brothers", Shinran, "Lamentations of Divergences". Favorite phrase: Salvation by Faith. Twitter: @shibatau

Leave a Reply

Your email address will not be published.

This site uses Akismet to reduce spam. Learn how your comment data is processed.