The SimpleGraph HTML module makes HTML files accessible via HTML Cleaner parser see HTML Cleaner. The initial issue is 6
Extract links from the agile manifesto
HtmlSystem hs = HtmlSystem.forUrl("http://agilemanifesto.org/");
...
GraphTraversal<Vertex, Vertex> links = hs.g().V().hasLabel("a")