Change language

Newspaper: Article scraping and curation (Python)

| |

Installation:

 pip install newspaper 

The newspaper supports the following languages:

  input code   full name  ar Arabic da Danish de German el Greek en English it Italian zh Chinese ......... and many more 

Some useful functions

Create a copy of the article

 article_name = Article (url, language = "language code according to newspaper") 

Download article

 article_name.download () 

Parse article

 article_name.parse () 

Apply nlp (natural language processing) to article

 article_name.nlp () 

Extract text articles

 article_name.text 

Extract article title

 article_name.title 

Extract article summary

 article_name. summary 

Extract article keywords

 article_name.keywords 

 from newspaper import Article

 
# New article from TOI

url = " http:// timesofindia.indiatimes. com / world / china / chinese-expert-warns-of-troops-entering-kashmir / articleshow / 59516912.cms "

  
# For newspapers in different languages, see the table above

toi_article = Article (url, language = "en" ) # en for English

 
# To download the article
toi_ar ticle.download ()

 
# To parse the article
toi_article.parse ()

 
# Perform natural language processing ie..nlp
toi_article.nlp ()

  
# Extract title

print ( "Article’s Title:" )

print (toi_article.title)

print ( "n" )

 
# Extract text

print ( "Article’s Text:" )

print (toi_article.text)

print ( " n " )

  
# Extract summary

print ( " Article’s Summary: " )

print (toi_article.summary)

print ( "n" )

  
# To extract keywords

print ( "Article’s Keywords:" )

print (toi_article.keywords)

Output:

 Article’s Title: India China News: Chinese expert warns of troops entering Kashmir Article’s Text: BEIJING: A Chinese expert has argued that his country’s troops would be entitled to enter the Indian side of Kashmir by extending the logic that has permitted Indian troops to enter an area which is disputed by China and Bhutan This is one of the several arguments made by the scholar in an attempt to blame India for. India has responded to efforts by China to build a road in the Doklam area, which falls next to the trijunction connecting Sikkim with Tibet and Bhutan and "Even if India were requested to defend Bhutan’s territory, this could only be limited to its established territory, not the disputed area, "Long Xingchun, director of the Center for Indian Studies at China West Normal University said in an article. "Otherwise, under India’s logic, if the Pakistani government requests, a third country’s army can enter the area disputed by India and Pakistan, including India-controlled Kashmir". China is not just interfering, it is building roads and other infrastructure projects right inside Pakistan-Occupied Kashmir (PoK), which is claimed by both India and Pakistan. This is one of the facts that the article did not mention the scholar, through his article in the Beijing-based Global Times, suggested that Beijing can internationalize the Doklam controversy without worrying about western countries supporting India because the West has a lot of business to do with China. "China can show the region and the international community or even the UN Security Council its evidence to illustrate China’s position," Long said. At the same time, he complained that "Western governments and media kept silent, ignoring India’s hegemony over the small countries of South Asia" when India imposed a blockade on the flow of goods to Nepal in 2015. Recent actions by US president Donald Trump, which include selling arms to Taiwan and pressuring China on the North Korean issue, shows that the West is not necessarily cowered down by China’s business capabilities.He reiterated the government’s stated line that Doklam belongs to China, and that Indian troops had entered the area under the guise of helping Bhutan protect its territory. "For a long time, India has been talking about international equality and non-interference in the internal affairs of others, but it has pursued hegemonic diplomacy in South Asia, seriously violating the UN Charter and undermining the basic norms of international relations, "he said. Interestingly, Chinese scholars are worrying about India interfering in Bhutan’s" sovereignty and national inter ests "even though it is Chinese troops who have entered the Doklam area claimed by it." Indians have migrated in large numbers to Nepal and Bhutan, interfering with Nepal’s internal affairs. The first challenge for Nepal and Bhutan is to avoid becoming a state of India, like Sikkim, "he said. Article’s Summary: sending its troops to the disputed Doklam area + puts Indian territory at risk + BEIJING: A Chinese expert has argued that his country’s troops would be entitled to enter the Indian side of Kashmir by extending the logic that has permitted Indian troops to enter an area which is disputed by China and Bhutan This is one of the several arguments made by the scholar in an attempt to blame India for. "Otherwise, under India’s logic, if the Pakistani government requests, a third country’s army can enter the area disputed by India and Pakistan, including India-controlled Kashmir". China is not just interfering, it is building roads and other infrastructure projects right inside Pakistan-Occupied Kashmir (PoK), which is claimed by both India and Pakistan. "China can show the region and the international community or even the UN Security Council its evidence to illustrate Chi na’s position, "Long said. "Indians have migrated in large numbers to Nepal and Bhutan, interfering with Nepal’s internal affairs. The first challenge for Nepal and Bhutan is to avoid becoming a state of India, like Sikkim," he said. Article’s Keywords: [’troops’,’ india’, ’china’,’ territory’, ’west’,’ disputed’, ’expert’,’ indian’, ’bhutan’,’ kashmir’, ’chinese’, ’entering’,’ doklam’, ’area’,’ warns’] 

Link: Python newspaper package on GitHub

This article is courtesy of Pratik Chhajer . If you are as Python.Engineering and would like to contribute, you can also write an article using contribute.python.engineering or by posting an article contribute @ python.engineering. See my article appearing on the Python.Engineering homepage and help other geeks.

Please post comments if you find anything wrong or if you’d like to share more information on the topic discussed above.

Shop

Learn programming in R: courses

$

Best Python online courses for 2022

$

Best laptop for Fortnite

$

Best laptop for Excel

$

Best laptop for Solidworks

$

Best laptop for Roblox

$

Best computer for crypto mining

$

Best laptop for Sims 4

$

Latest questions

NUMPYNUMPY

Common xlabel/ylabel for matplotlib subplots

12 answers

NUMPYNUMPY

How to specify multiple return types using type-hints

12 answers

NUMPYNUMPY

Why do I get "Pickle - EOFError: Ran out of input" reading an empty file?

12 answers

NUMPYNUMPY

Flake8: Ignore specific warning for entire file

12 answers

NUMPYNUMPY

glob exclude pattern

12 answers

NUMPYNUMPY

How to avoid HTTP error 429 (Too Many Requests) python

12 answers

NUMPYNUMPY

Python CSV error: line contains NULL byte

12 answers

NUMPYNUMPY

csv.Error: iterator should return strings, not bytes

12 answers

News


Wiki

Python | How to copy data from one Excel sheet to another

Common xlabel/ylabel for matplotlib subplots

Check if one list is a subset of another in Python

sin

How to specify multiple return types using type-hints

exp

Printing words vertically in Python

exp

Python Extract words from a given string

Cyclic redundancy check in Python

Finding mean, median, mode in Python without libraries

cos

Python add suffix / add prefix to strings in a list

Why do I get "Pickle - EOFError: Ran out of input" reading an empty file?

Python - Move item to the end of the list

Python - Print list vertically