Swoogle

R Karshan
2 min readDec 29, 2020

What is Swoogle?

Started as a research project of the Ebiquity research group in University of Maryland. Swoogle is a search engine for Semantic Web ontologies, documents, terms, and data published on the Web. It distributes online repository of SWDs. It is a crawler-based indexing and retrieval system for Semantic Web. Crawls and discovers documents written in RDF,OWL. It provides services to human users through a browser interface and to software agents via RESTful web services.

Objective of Swoogle

  • More and more SWDs, both ontologies and instances physically distributed all over the web.
  • A retrieval system that organizes these documents in a systematic way
  • Both humans and agents can easily conduct searches and queries against this repository

Why we use Swoogle?

  • Avoid creating new ontologies.
  • Need for reuse.

Services

  • Search Semantic Web ontologies
  • Search Semantic Web instance data.
  • Search Semantic Web terms, i.e., URIs that have been defined as classes and properties.
  • Provide metadata of Semantic Web documents and support browsing the Semantic Web.
  • Archive different versions of Semantic Web documents

What Swoogle search?

  • Find if suitable ontologies matching the user’s need already exist within underlying domain.
  • User inputs specific term
  • Swoogle replies with existing ontologies that also use the term entered.
  • Follow the link and see whether the provided ontology satisfies the need.
  • Query SWDs with constraints on classes and properties used by them.

Swoogle Architecture

  • SWD discovery component — This component has two distinct Web crawlers that discover SWDs distributed all over the web. These two crawlers can be invoked periodically to keep updated information about SWDs.
  • Metadata creation component — The metadata creation component creates metadata for each SWD.
  • Data analysis component — This component uses the metadata information to classify the relationship among the given set of SWDs and further calculates the rank of each SWD.
  • Indexation and retrieval component — Swoogle is after all a search engine, and therefore indexation and retrieval are necessary. Details of this component will be discussed later in this section.
  • User interface — This is what the user sees when using the Swoogle search engine.

Swoogle Crawler

  • Crawler visits the web to collect SWDs, ignoring all other documents (html, pdf, image files)
  • For each SWD discovered, Swoogle extracts metadata from the document and indexes it into an information retrieval system for later searches and queries.

How does Swoogle crawl the semantic web?

  • Manual submission
  • Google-based meta-crawling
  • Bounded HTML crawling
  • RDF crawling

Thank you.

--

--

R Karshan

Final Year Undergraduate | Software Engineering University of Kelaniya.