06 Aug 2016 20:45:50 UTC
- Distribution: URI-Find
- Module version: 20160806
- Source (raw)
- Browse (raw)
- How to Contribute
- Issues (14)
- Testers (6284 / 2 / 193)
- KwaliteeBus factor: 0
- 89.33% Coverage
- License: perl_5
- Perl: v5.8.8
- Activity24 month
- Download (36.5KB)
- MetaCPAN Explorer
- Subscribe to distribution
- This version
- Latest version++ed by:4 non-PAUSE usersMSCHWERN Michael G Schwernand 1 contributors
- Michael G Schwern
Take me over?The maintainer of this distribution is looking for someone to take over! If you're interested then please contact them via email.
URI::Find::Schemeless - Find schemeless URIs in arbitrary text.
require URI::Find::Schemeless; my $finder = URI::Find::Schemeless->new(\&callback); The rest is the same as URI::Find.
URI::Find finds absolute URIs in plain text with some weak heuristics for finding schemeless URIs. This subclass is for finding things which might be URIs in free text. Things like "www.foo.com" and "lifes.a.bitch.if.you.aint.got.net".
The heuristics are such that it hopefully finds a minimum of false positives, but there's no easy way for it know if "COMMAND.COM" refers to a web site or a file.
my $tld_re = $self->top_level_domain_re;
Returns the regex for matching top level DNS domains. The regex shouldn't be anchored, it shouldn't do any capturing matches, and it should make itself ignore case.
Original code by Roderick Schertler <firstname.lastname@example.org>, adapted by Michael G Schwern <email@example.com>.
Currently maintained by Roderick Schertler <firstname.lastname@example.org>.