skip to Main Content

i search the best regex method for the most functionality.

I search on Google and will extract the Facebook Links. Because Google has no Search API that works 1to1 with the exact Google Results i don’t can use the API.

I send now a normal request to google, extract the html code and will find all Facebook Link without google parameters.

Examples you find on regex debbuger.

I will see only this links if is possible.

Here Example Strings to search:
`

 /url?q=https://www.facebook.com/pageid/about&sa=U&ved=0ahUKEwi27NeDvfTTAhWBfywKHbuDDS4QjBAIHDAB&usg=AFQjCNH7T2JEP5DzGpiiwT_pMt2oGJ10ow

 /url?q=https://www.facebook.com/pageid/%3Fpnref%3Dlhc&sa=U&ved=0ahUKEwiWv8S6vfTTAhUEBiwKHW04AH8Q_BcIyQQoATBu&usg=AFQjCNEZIUb1yqqYtzjPfDEVi4GPHDY5FQ

 /url?q=https://www.facebook.com/pageid%3Fpnref%3Dlhc&sa=U&ved=0ahUKEwiWv8S6vfTTAhUEBiwKHW04AH8Q_BcIyQQoATBu&usg=AFQjCNEZIUb1yqqYtzjPfDEVi4GPHDY5FQ

/url?q=https://www.facebook.com/name-name-585606818284844/%3Fpnref%3Dlhc&sa=U&ved=0ahUKEwiWv8S6vfTTAhUEBiwKHW04AH8Q_BcIyQQoATBu&usg=AFQjCNEZIUb1yqqYtzjPfDEVi4GPHDY5FQ

/url?q=https://www.facebook.com/name-name-585606818284844%3Fpnref%3Dlhc&sa=U&ved=0ahUKEwiWv8S6vfTTAhUEBiwKHW04AH8Q_BcIyQQoATBu&usg=AFQjCNEZIUb1yqqYtzjPfDEVi4GPHDY5FQ`

Thats my Regex this works but not for all options. Regex Debugger:

https://regex101.com/r/LcYz8c/8

2

Answers


  1. Try this:

    q=(https://www.facebook.com.*?)&
    

    https://regex101.com/r/LcYz8c/11

    Login or Signup to reply.
  2. Something like:

    “q=(https?://.*?facebook.com/)derName-/”

     "q=(https?://.*?facebook.com/)derName(?:%[^%]*%..|[-/])?([^&]‌​+)"
    

    might be what you are looking for. From what I see in your example, it looks like you want:
    everything from the http up to the first / after the domain. Then skip the derName, and then grab everything up to the next &. So this is going to use 2 capture groups. Hope that helps!

    Login or Signup to reply.
Please signup or login to give your own answer.
Back To Top
Search