Serge Abiteboul
Gael de Chalendar
Ioana Manolescu
Emmanuel Pietriga
Bernd Amann
Patrice Buche
Juliette Dibie-Barthelemy
Patrick Giroux
Bruno Grilheres
Benjamin Nguyen | ABSTRACT
We describe the WebContent platform for the management of content from the Web. The platform is based on a service-oriented architecture and Web standards (notably, Web services, XML and RDF). An enterprise service bus (following the JBI speci?cation) and BEPL may be used to orchestrate service invocations. A peer-to-peer architecture may also be used to facilitate cooperation between independent partners as well as provide scaling.
We brie?y describe services that were developed for supporting
the main functions of the platform: acquisition, e.g., Web crawling, semantic enrichment, e.g., concept annotations, high-scale XML storage and querying (in a centralized or P2P architecture) and exploitation (including Web-based interfaces). Ontologies are pervasive in WebContent applications, supporting the description of the harvested and derived information as well as that of applications.
WebContent brings together a large number of groups. The core of the platform is open-source
...
|