Get all <thspan> contents in Python Selenium - Html

fToxicw5916
February 20, 2023
174 views
0 votes
2 Answers

Say that I have a piece of HTML code that looks like this:

<html>
    <body>
        <thspan class="sentence">He</thspan>
        <thspan class="sentence">llo</thspan>
    </body>
</html>

And I wanted to get the content of both and connect them into a string in Python Selenium.

My current code looks like this:

from selenium import webdriver
from selenium.webdriver.common.by import By

browser = webdriver.Chrome()

thspans = browser.find_elements(By.CLASS_NAME, "sentence")
context = ""
for thspan in thspans:
    context.join(thspan.text)

The code can run without any problem, but the context variable doesn’t contain anything. How can I get the content of both and connect them into a string in Python Selenium?

Answers

Chosen as BEST ANSWER
- fToxicw5916
- February 20, 2023 at 9:45 am
- 0 votes
0
context += thspan.text instead of using context.join(thspan.text) just like @Rajagopalan said

(Edit)

- ElielBerra
- February 20, 2023 at 1:32 pm
- 0 votes
0
You were not redirecting the browser to the page you actually want to scrape the data from. And you were misusing the .join method. Here is a code that will work for you:
```
from selenium import webdriver
from selenium.webdriver.common.by import By

browser = webdriver.Chrome()
# Put the absolute path to your html file if you are working locally, or
# the URL of the domain you want to scrap
browser.get('file:///your/absolute/path/to/the/html/code/index.html')

thspans = browser.find_elements(By.CLASS_NAME, "sentence")
context = ''
print('thspans', thspans, end='nn')
for thspan in thspans:
    context += thspan.text
print(context)
```
Login or Signup to reply.

Please signup or login to give your own answer.

Click here to cancel reply.

Get all <thspan> contents in Python Selenium – Html

Answers