Parse from Stream
Jsoup is an open source java library for parsing and manipulating HTML with ease.
Get the latest binaries from http://jsoup.org/ .
This requires the library jsoup-1.6.1.jar to be in classpath.
The following example shows parsing html content from an input stream.
03
<
title
>BE THE CODER</
title
>
06
<
input
type
=
"text"
id
=
"user"
name
=
"user"
value
=
"ABC"
></
input
>
07
<
input
type
=
"password"
id
=
"password"
name
=
"password"
value
=
"xyz"
></
input
>
08
<
input
type
=
"hidden"
id
=
"secret"
name
=
"secret"
></
input
>
package com.bethecoder.tutorials.jsoup.tests;
import java.io.IOException;
import java.io.InputStream;
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
public class ParseFromStreamTest {
/**
* @param args
* @throws IOException
*/
public static void main ( String [] args ) throws IOException {
InputStream ins = ParseFromStreamTest. class .
getClassLoader () .getResourceAsStream ( "Simple2.html" ) ;
Document doc = Jsoup.parse ( ins, "UTF-8" , "btc.com" ) ;
System.out.println ( doc.body () .html ()) ;
}
}
It gives the following output,
1
<
input
type
=
"text"
id
=
"user"
name
=
"user"
value
=
"ABC"
/>
2
<
input
type
=
"password"
id
=
"password"
name
=
"password"
value
=
"xyz"
/>
3
<
input
type
=
"hidden"
id
=
"secret"
name
=
"secret"
/>