Python&R: How to write and read Feather and Parquet, google colaboratory ver. 2

Last Updated on July 23, 2022 by shibatau

I. What is Feather?

Most of the time you work with CSV (Comma Separated Values) file formats. It is also a widely used file format for data storage. So, what is special about this? Well, CSV files will consume more space and take more time to load as well. Therefore, we have to find some alternative to overcome this issue. Here, I am introducing Feather file format to you which offers lightning speed and manages the space very efficiently. Finally, companies will end up saving some bucks on storage services. 

What is the Feather File Format In Python?
Feather is first created in the Arrow project as a POC for fast data frame storage in Python and R.
But, now it is not limited to Python and R. You can use it will all major languages.
It is also known as a portable file format for sorting data frames.
There are 2 versions available, Version1 and Version2. If any of the libraries are not comfortable with one of them, you can pass the version = ” ” argument to set the specific version.

Quoted from https://www.journaldev.com/53105/feather-file-format-in-python

You can learn how to handle Feather files with Python here:

Stop Using CSVs for Storage — This File Format Is 150 Times Faster

II. What is Parquet?

You can learn how to handle Parquet files with Python here:

CSV Files for Storage? No Thanks. There’s a Better Option

III. Writing and reading Feather and Parquet with Python and R on Google Colaboratory

You can see the sample scripts here on Google Colaboratory:

https://colab.research.google.com/drive/1SZi7R6EIvKfOB2Ph2KkL_UMp5WMaJ-eL?usp=sharing

About shibatau

I was born and grown up in Kyoto. I studied western philosophy at the University and specialized in analytic philosophy, especially Ludwig Wittgenstein at the postgraduate school. I'm interested in new technology, especially machine learning and have been learning R language for two years and began to learn Python last summer. Listening toParamore, Sia, Amazarashi and MIyuki Nakajima. Favorite movies I've recently seen: "FREEHELD". Favorite actors and actresses: Anthony Hopkins, Denzel Washington, Ellen Page, Meryl Streep, Mia Wasikowska and Robert DeNiro. Favorite books: Fyodor Mikhailovich Dostoyevsky, "The Karamazov Brothers", Shinran, "Lamentations of Divergences". Favorite phrase: Salvation by Faith. Twitter: @shibatau

Leave a Reply

Your email address will not be published.

This site uses Akismet to reduce spam. Learn how your comment data is processed.