Extracting a string between 2 chracters using python [duplicate]

2024/9/22 12:49:12

I need a Python regex to give me all the strings between ~ and ^ from a string like this:

~~~~ AAA ^ BBB ^ CCC > DDD ^ 

I've tried this:

import re
target = ' ~~~~ AAA > ^ BBB ^ CCC > DDD ^  '
matchObj = re.findall(r'~(.*?)\^', target)
print matchObj 

But the result is:

['~~~ ABC ']

What I expect is:

['AAA', 'BBB', 'CCC', 'DDD']

or

['^AAA', '^BBB', '^CCC', 'DDD']

I want to do this because I am trying to extract text from an HTML page like this:

 <td class="cell-1"><div><span class="value-frame">&nbsp;~~~~ ABC ^ DEF ^ HGK > LMN ^</span></div>
</td>
Answer

As long as you're parsing out the HTML correctly with BeautifulSoup and are just left with the contents of the span, then you should just be able to use this as a RegEx:

import retarget = ' ~~~~ AAA ^ BBB ^ CCC > DDD ^  'matchObj = re.findall('(\w{3})', target)print(matchObj)

Outputs:

['AAA', 'BBB', 'CCC', 'DDD']
https://en.xdnf.cn/q/119135.html

Related Q&A

remove empty line printed from hive query output using python

i am performing a hive query and storing the output in a tsv file in the local FS. I am running a for loop for the hive query and passing different parameters. If the hive query returns no output once …

.exceptions.WebDriverException: Message: Can not connect to the Service

struggling to find a solution all over, have latest chrome 117 and also downloaded chromedriver and used the path accordingly in script also tried with chrome browser Although it opens the browser but …

How to call a previous function in a new function? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.This question does not appear to be about programming within the scope defined in the help center.Cl…

Using simpleauth to login in with GAE

This question is in the reference of this. As suggested I am using simpleauth to login via linkedin. Now I am having trouble with the redirect_uri. I have successfully deployed dev_appserver.py example…

How do i force my code to print in python

Im having trouble trying to work out an error in my code. It isnt printing the final product and leaving a blank space.playing = True string = "" Alphabet = (z,a,b, c, d, e, f, g, h, i, j, k,…

Adding specific days in python table

I have a dataset (Product_ID,date_time, Sold) which has products sold on various dates. The dates are not consistent and are given for 9 months with random 13 days or more from a month. I have to segre…

django how to following relationships backwards?

I am having some issue with following relationships backwards. From the parent page i want to be able to see what children belong to that parent. Heres what i got so farmodel.pyclass Parents(models.Mod…

Python File handling: Seaching for specific numbers

Im creating a document in which I need to record license plates of vehicles (its a practice exercise, nothing illegal) and calculate the speed they travel at and display all the vehicles that are trave…

How to convert token list into wordnet lemma list using nltk?

I have a list of tokens extracted out of a pdf source. I am able to pre process the text and tokenize it but I want to loop through the tokens and convert each token in the list to its lemma in the wor…

Script throws an error when it is made to run using multiprocessing

Ive written a script in python in combination with BeautifulSoup to extract the title of books which get populated upon providing some ISBN numbers in amazon search box. Im providing those ISBN numbers…