
Signature
Henri Sivonen
hsivonen@iki.fi
http://hsivonen.iki.fi/
> * Check if a string is an IRI per RFC 3987.
http://rfc3987.rfc.8x.ca/
an IRI is an international URI, i.e. a URI with an extended character
set.
Very likely you might want some subset of the IRI spec.
You might see what the Java URN and URI classes do. If they parse for
you, you might then use a regex for fine tuning.

Signature
Canadian Mind Products, Roedy Green.
http://mindprod.com Java custom programming, consulting and coaching.
Henri Sivonen - 24 Apr 2006 22:31 GMT
> > * Check if a string is an IRI per RFC 3987.
>
> http://rfc3987.rfc.8x.ca/
> an IRI is an international URI, i.e. a URI with an extended character
> set.
Yes.
> Very likely you might want some subset of the IRI spec.
No, I want to check exactly for IRIs to the extent the spec is exact.
> You might see what the Java URN and URI classes do. If they parse for
> you, you might then use a regex for fine tuning.
The JDK classes won't be even nearly up to date until Mustang.
The library I already mentioned uses JFlex--not just regexps.

Signature
Henri Sivonen
hsivonen@iki.fi
http://hsivonen.iki.fi/