webCrawling | JeongKeepsCalm

webCrawling

By Jeong full-stack developer

Posted Oct 18, 2023 1 min read

Web Crawling with Jsoap library

  
public void WebCrawlingTest() throws IOException {

  String URL = "https://news.daum.net/";
  Document doc;

  try {
      doc = Jsoup.connect(URL).get();
      Elements els = doc.select(".item_issue a");
      for (Element el : els) {
          String href = el.attr("href");
          if (!el.text().equals("")) {
              System.out.println("title : " + el.text()+" news link : "+href);
          }
      }
  } catch (IOException e) {
      e.printStackTrace();
  }

}

Java, Data Collection

Contents

webCrawling

Web Crawling with Jsoap library

Further Reading

JDBC Template

Java Basic

Java Stream API