Web scraping with python how to get to the text

Question

I'm trying to get the text from a website but can't find a way do to it. How do I need to write it?

link="https://www.ynet.co.il/articles/0,7340,L-5553905,00.html"
response = requests.get(link)

soup = BeautifulSoup(response.text,'html.parser')
info = soup.find('div', attrs={'class':'text14'})
name = info.text.strip()
print(name)

This is how it looks:

i'm getting none everytime

Your screenshot shows the DOM while beautifulsoup operates on the source. They can differ. — Klaus D., Jul 20 '19 at 07:18

johnsnow06 · Accepted Answer · 2019-07-21T14:57:20.930

2

import requests
from bs4 import BeautifulSoup
import json
link="https://www.ynet.co.il/articles/0,7340,L-5553905,00.html" 
response = requests.get(link)
soup = BeautifulSoup(response.text,'html.parser') 
info = soup.findAll('script',attrs={'type':"application/ld+json"})[0].text.strip()
jsonDict = json.loads(info)
print(jsonDict['articleBody'])

The page seems to store all the article data in json in the <script> tag so try this code.

edited Jul 21 '19 at 14:57

answered Jul 20 '19 at 09:06

johnsnow06

131
7

what about in this case: https://www.ynetnews.com/articles/0,7340,L-5554655,00.html ? any idea how to get the text? it's not working well in my way and not in yours as well – Michael Jul 21 '19 at 11:40

score 1 · Answer 2 · answered Jul 20 '19 at 07:45

1

The solution is :

info = soup.find('meta', attrs={'property':'og:description'})

It gave me the text i needed

answered Jul 20 '19 at 07:45

Michael

189
1
10

Web scraping with python how to get to the text

2 Answers2