beautifulsoup Questions

Html – how to extract the texts after the first h1 Tag?

June 22, 2023
Mostafa Bouzari
2 Answers

i'm trying to write a code to get and clean the text from 100 websites per day. i came across an issue with one website that has More than one h1 tag and when you scroll to the next h1…

VIEW QUESTION

Ubuntu – gathering data from clutch.io : some issues with BS4 while working on colab

June 21, 2023
malaga
2 Answers

update: what bout selenium - support in colab: i have checked this..see below! good day dear experts - well at the moment i am trying to figure out a simple way and method to obtain data from clutch.io note: i…

VIEW QUESTION

Html – Python / BeautifulSoup get an attribute within <option>

June 14, 2023
0range
2 Answers

I am a python / beautifulsoup newbie here. I am trying to get an attribute value within the <option> tag. The HTML snippet is below. Specifically, I am trying to retrieve the value from the first "data-inventory-quantity (in this case,…

VIEW QUESTION

Html – Scraping using BeautifulSoup print an empty output

I'm trying to scrape a website. I want to print all the elements with the following class name, class=product-size-info__main-label The code is the following: from bs4 import BeautifulSoup with open("MadeInItaly.html", "r") as f: doc= BeautifulSoup (f, "html.parser") tags = doc.find_all(class_="product-size-info__main-label")…

VIEW QUESTION

Html – How Can I Pull Specific Links From A Webpage Using Python?

June 2, 2023
Ericander1
2 Answers

I'd like to pull specific links from a webpage using Python. In my example below I'm viewing a form 8-K from the SEC website with several links in it. A link for a press release but also a link to…

VIEW QUESTION

Can Ubuntu scrape a URL address for reviews?

May 31, 2023
Misha
2 Answers

So I need to extract the reviews from the URL of a product on this site, more specifically the username, date, text, and score. However, I have some issues with it because I keep getting an error: failed to retrieve…

VIEW QUESTION

Why is my Google search for “Debian” returning an empty ResultSet?

I've been working on Google Colab developing a script to scrape google search results. It has been working for a long time without any problem but now doesn't. It seems that the code page source its different and the CSS…

VIEW QUESTION

Why does Ubuntu-beautifulsoup give me every output thrice?

May 15, 2023
paulina
2 Answers

I'm trying to write a program that lets me easily scale recipes created using the wordpress recipe maker plugin. I have already been advised to use beautifulsoup instead of parsing HTML with regex, and it does what it's supposed to…

VIEW QUESTION

Html – How to scrape specific element with a certain id in BeautifulSoup?

May 14, 2023
Zinc Cheng
2 Answers

I am trying to scrape the table from baseball reference: https://www.baseball-reference.com/players/b/bondsba01.shtml, and the table I want is the one with id="batting_value", but when I trying to print out what I have scraped, the program returned an empty list instead. Any…

VIEW QUESTION

Can WordPress & BeautifulSoup 4 find the next attribute?

May 13, 2023
Zero
2 Answers

The project: for a list of meta-data of wordpress-plugins: - approx 50 plugins are of interest! but the challenge is: i want to fetch meta-data of all the existing plugins. What i subsequently want to filter out after the fetch…

VIEW QUESTION