Swoogle

What is Swoogle?

Started as a research project of the Ebiquity research group in University of Maryland. Swoogle is a search engine for Semantic Web ontologies, documents, terms, and data published on the Web. It distributes online repository of SWDs. It is a crawler-based indexing and retrieval system for Semantic Web. Crawls and discovers documents written in RDF,OWL. It provides services to human users through a browser interface and to software agents via RESTful web services.

Objective of Swoogle

  • More and more SWDs, both ontologies and instances physically distributed all over the web.
  • A retrieval system that organizes these documents in a systematic way
  • Both humans and agents can easily conduct searches and queries against this repository

Why we use Swoogle?

  • Avoid creating new ontologies.
  • Need for reuse.

Services

  • Search Semantic Web ontologies
  • Search Semantic Web instance data.
  • Search Semantic Web terms, i.e., URIs that have been defined as classes and properties.
  • Provide metadata of Semantic Web documents and support browsing the Semantic Web.
  • Archive different versions of Semantic Web documents

What Swoogle search?

  • Find if suitable ontologies matching the user’s need already exist within underlying domain.
  • User inputs specific term
  • Swoogle replies with existing ontologies that also use the term entered.
  • Follow the link and see whether the provided ontology satisfies the need.
  • Query SWDs with constraints on classes and properties used by them.

Swoogle Architecture

  • SWD discovery component — This component has two distinct Web crawlers that discover SWDs distributed all over the web. These two crawlers can be invoked periodically to keep updated information about SWDs.
  • Metadata creation component — The metadata creation component creates metadata for each SWD.
  • Data analysis component — This component uses the metadata information to classify the relationship among the given set of SWDs and further calculates the rank of each SWD.
  • Indexation and retrieval component — Swoogle is after all a search engine, and therefore indexation and retrieval are necessary. Details of this component will be discussed later in this section.
  • User interface — This is what the user sees when using the Swoogle search engine.

Swoogle Crawler

  • Crawler visits the web to collect SWDs, ignoring all other documents (html, pdf, image files)
  • For each SWD discovered, Swoogle extracts metadata from the document and indexes it into an information retrieval system for later searches and queries.

How does Swoogle crawl the semantic web?

  • Manual submission
  • Google-based meta-crawling
  • Bounded HTML crawling
  • RDF crawling

Thank you.

--

--

--

Final Year Undergraduate | Software Engineering University of Kelaniya.

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

MicroK8s: Unleashing power of Kubernetes in a Snap

Ego is the Enemy for Developers

CS373 Spring 2021: Sruthi Rudravajjala

Terraform Module Design

Collecting and Processing WMATA SmarTrip Use History with Python

Numbers and Statics

Pine.finance — Limit orders 101

Programming Languages Used To Make Video Games

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
R Karshan

R Karshan

Final Year Undergraduate | Software Engineering University of Kelaniya.

More from Medium

Mechanisms of Meaning

Connecting your brain to the cloud

The Model From Brazil

Algorithms Are Shaping Reality: A Journalistic Fight