Lecture: Scraping texts using Beautifulsoup ver. 4

Last Updated on July 11, 2022 by shibatau

III is added.

I. Scrape texts

In my class, I have scraped some texts on a Web page using Web Query in Excel but you can also scrape them using Beautifulsoup in Python.

You can see the scripts here:

https://colab.research.google.com/drive/1HBFDeuxtWFnTZXbQybASbrbc8KTCAqkl?usp=sharing

II. Scrape dialogs and translate them

We copy a short English dialog , paste it on Excel and practice speaking in each class. Some students translate them in Japanese using Google Translate. This script would help them a lot.

You can see there scripts here:

https://colab.research.google.com/drive/11Q1ugM50n6XBD5tEJaewTHtXrkBiobVl?usp=sharing

III. Scrape the BBC News headlines and translate them

You can learn the scripts here:

How to get the Daily News using Python

I have scraped the BBC headlines and translate them in the same way as in II.

You can see the scripts here:

https://colab.research.google.com/drive/19pZAFR5DO1JFbTl2NNgJR96xjCpTpTvw?usp=sharing

About shibatau

I was born and grown up in Kyoto. I studied western philosophy at the University and specialized in analytic philosophy, especially Ludwig Wittgenstein at the postgraduate school. I'm interested in new technology, especially machine learning and have been learning R language for two years and began to learn Python last summer. Listening toParamore, Sia, Amazarashi and MIyuki Nakajima. Favorite movies I've recently seen: "FREEHELD". Favorite actors and actresses: Anthony Hopkins, Denzel Washington, Ellen Page, Meryl Streep, Mia Wasikowska and Robert DeNiro. Favorite books: Fyodor Mikhailovich Dostoyevsky, "The Karamazov Brothers", Shinran, "Lamentations of Divergences". Favorite phrase: Salvation by Faith. Twitter: @shibatau

Leave a Reply

Your email address will not be published.

This site uses Akismet to reduce spam. Learn how your comment data is processed.