dom - Capture PHP links without image links -

$url = 'http://www.test.com/'; $dom = new domdocument; @$dom->loadhtmlfile($url);  $links = $dom->getelementsbytagname('a'); foreach ($links $link) {

i using above script capture links on page, found there duplicate links. on page, there picture linked, followed text link goes same link. there easy way capture text link, not image link?

as saying, might take approach of cleaning dupes in result set. not sure on scraping if link only used image?

you count occurrences.

$url = 'http://www.test.com/'; $dom = new domdocument; @$dom->loadhtmlfile($url);  $links = $dom->getelementsbytagname('a'); $distinctlinks = []; foreach ($links $link) {     $distinctlinks[$link] = (int) $distinctlinks[$link] + 1; }

Search This Blog

Mind Blowing Facts

dom - Capture PHP links without image links -

Comments

Post a Comment

Popular posts from this blog

java - Solr query version issue: Invalid version or the data in not in 'javabin' format -

Hard vs. Soft Water: What's The Difference?

The Ten Most Livable Cities In The World