File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/02/w02-1707_metho.xml
Size: 3,384 bytes
Last Modified: 2025-10-06 14:08:11
<?xml version="1.0" standalone="yes"?> <Paper uid="W02-1707"> <Title>Towards a web-based centre on Swedish Language Technology</Title> <Section position="4" start_page="0" end_page="0" type="metho"> <SectionTitle> 3 Other related projects and </SectionTitle> <Paragraph position="0"> standards We are not alone in realizing the benefits with meta-data. In fact, there are quite a lot of buzzwords and hype surrounding XMLdevelopment. These are some projects and standards that we should keep in mind, particularly with respect to data-harvesting and future development. Please note that this is in no way a complete list, due to the novelty of the field.</Paragraph> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 3.1Semantic Web </SectionTitle> <Paragraph position="0"> The Semantic Web is an attempt to give information on the web meaning, in order to facilitate searching, automation, etc (W3C, 2001).</Paragraph> </Section> <Section position="2" start_page="0" end_page="0" type="sub_section"> <SectionTitle> 3.2Resource Description Framework </SectionTitle> <Paragraph position="0"> The Resource Description Framework, RDF is a language for providing metadata to support the Semantic Web. It can be used for describing resources that can be located via a URI or other identifier (W3C, 2001). RDF is developed side by side with the Dublin Core (below), and both standards may be used in one document.</Paragraph> </Section> </Section> <Section position="5" start_page="0" end_page="0" type="metho"> <SectionTitle> 3.3Dublin Core </SectionTitle> <Paragraph position="0"> The Dublin Core Metadata Element Set (Dublin Core, DC) is, as its name implies, a set of elements for metadata. There are fifteen elements, strictly defined using a set of ten attributes (DCMI, 1999). The elements' main uses are for information or service resources, e.g. bibliographies and card catalogs.</Paragraph> <Paragraph position="1"> 3.4DocBook DocBook is an SGML or XML format for technical documentation. It is intended for authoring, and can be converted to other formats for reading (Harold and Means, 2001).</Paragraph> <Paragraph position="2"> 3.5Open Archives Initiative The Open Archives Initiative, OAI is an experimental initiative for efficient dissemination of content. It uses the Dublin Core. Historically, the main intention of OAI was providing a meta-data language for e-prints, but this has been expanded to related domains as well. There is an OAI protocol for dataharvesting. Version 2.0 of that protocol is scheduled for release in June 2002 (OAI, 2001). 3.6OLAC OLAC, the Open Language Archives Community is a community who develop methods for digital archiving of language resources and a network for housing and accessing such services (OLAC, 2001). They use methods very similar to ours, though aimed at language in general rather than language technology. The Dublin Core forms the basis of their meta-data set. Alpha tests have commenced, and the project has recently been launched in Europe (May 2002).</Paragraph> <Paragraph position="3"> 3.7Web services and SOAP Web services is a way to share application methods over the Internet, by means of some standard interface such as Simple Object Access Protocol, SOAP (W3C, 2000). The methods implemented may for example provide access to a site's data.</Paragraph> </Section> class="xml-element"></Paper>