Difference between revisions of "SA-REST"

From Knoesis wiki
Jump to: navigation, search
(jdjYDwNceOcc)
(Changed the top to reflect the latest version)
 
(43 intermediate revisions by 4 users not shown)
Line 1: Line 1:
286a3629a0bbe2f63
+
<span style="color:red">Note : This is a draft version of SA-REST. The latest (and active) version is the [http://www.w3.org/Submission/SA-REST/ W3C Member Submission] </span>
http://4.moxyeche.com/insticts-xbox-cheats.html install cwcheats on psp
+
 
http://10.tabadixy.com/american-pittbulls-for-sale-in.html
+
----
http://11.pijatheh.com/release-the-crackin.html release 3.13 crack
+
 
http://2.hybijika.com/media-center-software-updates.html
+
 
http://10.tabadixy.com/02008-0131-cracked-nope.html 007 spy software 3.86 crack
+
SA-REST is a simple and open microformat for enhancing Web resources with additional semantic information.  
http://9.tabadixy.com/2400mah-extended-life-battery.html
+
In addition to HTML and XHTML, the SA-REST approach can also be used to enrich Atom, RSS, and arbitrary XML. SA-REST is one of several open microformat standards.
http://9.wyfexyud.com/burnley-road-brierfield.html burneside road kendal
+
 
http://10.chumyris.com/datei-von-mindestens.html
+
==People==
http://5.cherahiv.com/dirname-__file__-wp-config-php.html dirname __file__ wp
+
Collaborators in the SA-REST initiative are listed by their affiliations.  
http://7.chumyris.com/effects-of-the-plan.html
+
* '''kno.e.sis center, Wright State University, Dayton, OH'''
http://4.gijobuth.com/prolonging-the-life-of-the.html prometrium half life
+
** Amit P. Sheth.
http://7.cherahiv.com/of-fire-air-america-radio.html
+
** Karthik Gomadam.
http://1.nitoqueq.com/the-only-life-that-mattered.html the ocean life center
+
** Ajith Ranabahu
http://9.wyfexyud.com/theyre-watching-you.html
+
 
http://7.wyfexyud.com/weather-in-hampton-new-hampshire.html weather in hannover germany
+
==Introduction==
http://3.gijobuth.com/gawsworth-road-macclesfield.html
+
<!-- Replace this with a need for additional semantic annotation for Web resources. Highlight Web APIs as an example, not the only effect. -->
http://3.cherahiv.com/bury-my-heart-at-wounded-knee-book.html buros mental measurement yearbooks
+
 
http://6.cherahiv.com/the-four-inner-planets.html
+
===Current mashup tools and technologies===
http://11.wyfexyud.com/of-nebraska-lincoln-summer.html of nebraska lincoln summer
+
<!-- This section changes as well to talk about limitation in APIs and how SA-REST can address this. Add a blog example.-->
http://5.gijobuth.com/butter-chocolate-pretzels-recipe.html
+
Although mashups fully embrace the idea of customization on the Web, read-write is another story. The complexity of application development using javascript makes it hard for average developers to create new mashups and to customize the existing ones. To solve this problem, several companies are developing tools for mashup creation that require little or no programming knowledge. These tools, exemplified by Yahoo! pipes, IBM's QEDwiki and Google's Mashup Editor, facilitate the selection of some number of RESTful Web services or other Web resources and chain them together by piping one service's output into the next service's input while filtering content and making slight format changes.
http://3.kuxyewus.com/carnivorous-plants-habitat.html carnivorous plants store
+
====Limitations====
http://2.gijobuth.com/roman-numeral-pictures.html
+
One of the main drawbacks of the current state of the art is the lack of support for interoperability, especially that of data. Since most of the existing tools limit their support to services internal to the specific vendor that created them (for example, Google Mashup Editor has a complete abstraction of Google Maps service) or to services that have standard types of outputs such as RSS or Atom (Yahoo! Pipes), the problem of interop is not highlighted.  
http://9.moxyeche.com/science-books-and-kids.html science books for 7th
+
 
http://6.chumyris.com/american-football-verband-deutschland.html
+
Another drawback of the current approach to mashups is the complexity of their development itself. Frameworks like Pipes that do not require any programming have limited capabilities. Google
http://11.cherahiv.com/american-technology-rentals.html american tejano music
+
Mashup Editor has a steeper learning curve and can do much more than Yahoo! Pipes. However the need to be proficient in languages such as javascript is mandatory to create more complex mashups.
http://9.wyfexyud.com/recipes-by-zubaida.html
+
We identify two aspects of this complexity
http://6.cherahiv.com/healthy-popsicle-recipes.html healthy halloween recipe
+
* Visual: Complexity arising out of the need to create intuitive visual elements and handle various events relating to them
http://6.wyfexyud.com/basic-production-planning.html
+
* Data: Complexity arising due to heterogeneity in data schemas and formats.
http://5.kuxyewus.com/love-hearts-wallpapers.html winnie the pooh disney wallpaper
+
 
http://2.moxyeche.com/summer-boat-rentals.html
+
The primary objective of SA-REST is to address the ''data complexity issue''.
http://8.wyfexyud.com/wittgenstein-form-of-life.html witi ihimaera life
+
===SA-REST in a nutshell===
http://6.hybijika.com/simpson-colouring-book.html
+
<!-- Change this. We can remove this completely, if we can capture a very good description at the beginning. -->
http://5.nitoqueq.com/maebashi-gunma-japan.html madyo mangun karso
+
 
http://7.chumyris.com/who-got-voted-off-american-ido.html
+
SA-REST is a microformat to add additional meta-data to (but not limited to) REST API descriptions in HTML and XHTML. Developers can directly embed meta-data from various models such an ontology, taxonomy or a tag cloud into their API descriptions. The embedded meta-data can be used to improve search (for example: perform faceted search for APIs), data mediation (in conjunction with XML annotation) as well as help in easier integration of services to create mashups.
http://5.wyfexyud.com/prada-americas-cups.html powers great american
+
 
http://4.chumyris.com/planatation-high-school.html
+
===Background===
http://6.nitoqueq.com/industrial-minerals-handybook.html indonesian language books
+
Rather than talking only about SWS (not relevant), we should instead talk about other microformats and the synergy therein with SA-REST.  
http://6.kuxyewus.com/ds-instructions-booklet.html
+
 
http://9.gijobuth.com/cheats-for-adrenilin.html cheats for adreanaline
+
Researchers in the area of Semantic Web Services have proposed various specifications, the prominent of which are
http://4.tabadixy.com/college-hallticket-numbers.html
+
* OWL-S
http://1.chumyris.com/watch-gilmor-girls-online.html watch giligans island
+
*WSDL-S
http://5.cherahiv.com/watch-the-come-up-dvd.html
+
*WSMO
http://9.moxyeche.com/curl-up-hair-product.html curling hair directions
+
In 2005, the W3C initiated a charter to create a standard for adding semantics to WSDL descriptions. The WSDL-S specification (submitted by Services Research Lab at kno.e.sis from LSDIS Lab in GA along with IBM) was taken as the primary input for the charter. This led to the standardization of SAWSDL (Semantic Annotation of WSDL and XML Schema). SAWSDL has had a significant
http://10.gijobuth.com/guns-n-roses-apetite.html
+
impact in the evolution of SA-REST. However, the adoption of a microformat based approach is a key difference between the SAWSDL and SA-REST frameworks. However, it must be noted here that the principles of Schema annotation, lifting and lowering can be directly used from SAWSDL for XML data objects in the RESTful environment.
http://6.hybijika.com/quick-asian-recipe.html quick appetizers recipes
+
 
http://6.kuxyewus.com/is-twiggys-real-name.html
+
==Usage Scenarios==
http://1.moxyeche.com/wwww-weatheroffice-gc.html wayzata post office
+
 
http://2.gijobuth.com/american-standards-for-steel.html
+
<!-- Need to write this along the style of microformats.org wiki. -->
http://7.gijobuth.com/used-hobbing-machines.html used hino trucks for sale
+
 
http://8.nitoqueq.com/kendal-records-office.html
+
===Faceted Search===
http://4.gijobuth.com/prawn-dish-recipes.html cinamin buns recipe
+
 
http://9.wyfexyud.com/gordon-ramsay-paella-recipe.html
+
The number of available APIs are growing fast. In April 2008, we found that there were about 700 APIs added to ProgrammableWeb. In September, that number is over 900. Currently, general
http://6.cherahiv.com/book-of-hsitorical-faffiction.html book of hexahedron
+
purpose search engines like Google are largely used to find these APIs. However, these treat API documents like any other in indexing and ranking APIs. As a result, search for APIs (even when specific queries like "Maps API") results in API resources being scattered all over the result set. Web API directories like programmableWeb do present a more domain-specific solution. However, they largely rely on user tags for classification and searching.  
http://2.nitoqueq.com/road-joan-armatrading.html
+
 
http://4.hybijika.com/soth-african-post-office.html souillac tourist office
+
Addition of meta-data to capture the various facets of APIs (their functionality, the message types they support, clientside bindings, protocol) can allow for better searching. The results of
http://2.hybijika.com/how-do-i-change-my-online-name.html
+
one such framework [http://knoesis.wright.edu/research/srl/projects/apihut/ APIHut] is presented in [http://knoesis.wright.edu/library/download/GRNVS-facetedAPISearch-ICWS08.pdf Faceted Search for APIs]. We also present
http://2.moxyeche.com/tranquillo-week-end-di-paura.html traditional holy week
+
our initial evaluation of precision and recall metrics. SA-REST can improve faceted search in a significant manner. Using known techniques of GRDDL and XSLT, one can extract RDF representations of APIs. This can then be indexed and searched upon.
http://11.tabadixy.com/medium-length-womens-hair.html
+
 
http://8.kuxyewus.com/american-industry-classifcation.html american infant care
+
===Data Mediation and Mediatability===
http://2.moxyeche.com/jefferson-county-colorado-job.html
+
The importance of enabling easier approaches to data mediation has been well understood. In the context of mashups this is even more important, largely due to the fact that often developers are faced with the burden of handling data at the client side. SA-REST will address this issue in two ways
http://6.pijatheh.com/weather-in-delaware-ohio.html weather in denarau
+
*'''Adopting XML annotation from SAWSDL''': This will allow us to add the lifting and lowering transformations to data elements as a part of the API description. Information about SAWSDL lifting and lowering can be found in the [http://www.w3.org/TR/sawsdl/#annotateXSD SAWSDL spec on schema annotations]. There is a small catch that we have to address here. In the WSDL world, data exchange was XML de-facto. However, in the RESTful environment, developers can use many formats such as JSON, GData, RSS. It will be interesting to investigate this as a part of the SA-REST effort.  
  http://1.hybijika.com/unbought-grace-of-life.html
+
 
http://7.moxyeche.com/bumble-and-bumble-hair-wax.html bumble and bumble s brown hair powder
+
*'''Mediatability''': [http://knoesis.wright.edu/students/gomadam/publications/GRRVS--ICSC08.pdf Mediatability] is a measure of the estimated human effort for performing data mediation manually. Having additional annotations can significantly help us in computing the mediatability. Even when automatic mediation is not possible, knowing how hard or easy the mediation between two services can definitely help developers in choosing services for their mashups.
http://1.gijobuth.com/well-air-soft-guns.html
+
 
http://2.tabadixy.com/laplanche-and-pontalis.html lapland december 2008
+
===Smart Mashups===
http://9.moxyeche.com/442-orchard-road-singapore-238879.html
+
Smart mashups are those that allow the end user more flexibility to change certain services in a mashup. For example, in the popular [http://housingmaps.com Housing Maps] mashup, if the quality of Yahoo! maps in a certain area is better than that of Google, the user must have the flexibility to change it. To realize this, we are pursuing on a meta based approach for mashup creation. In this approach, the developer creates the mashup application at a meta level and services are added to them at the run time. In this context, there needs to be a way for the developer to specify the requirements for a service and the system to check if the user preference meets the requirement. Having annotations can help accomplish this task with lesser difficulty.
http://7.wyfexyud.com/greek-cappies-real-name.html greek gods and goddesses names and meanings
+
 
http://3.chumyris.com/boeing-727-pictures.html
+
===Automatic Annotation===
http://10.hybijika.com/cost-for-airplane-tickets.html cost of movie tickets
+
Semi-automatic text annotation is a significant research area, primarily due to the large volume of text data that becomes available everyday. It is not viable to annotate such volumes of data purely by human effort and one needs to employ text processing techniques to provide automatic markup. One major challenge in text processing is disambiguation, selecting the correct semantics of a word that may be used across domains to represent different concepts.
http://4.cherahiv.com/names-of-nobel-peace-prize-winners.html
+
 
http://4.nitoqueq.com/i-ve-waited-all-my-life-lyrics.html addlife pharma 2006
+
The ''domain-rel'' property acts as a guide to describe the domain(s) a certain text snippet describes and hence provides a means for the text processing / automatic annotation engines to perform effective disambiguation.
http://11.chumyris.com/crysis-keygen-serial-to-play.html
+
 
http://8.kuxyewus.com/come-up-through-the-summer.html common invicible summer
+
== Design principles and methods ==
  http://3.tabadixy.com/noaa-weather-binghamton.html
+
 
http://11.tabadixy.com/band-named-fort-minor-his-bandmate-chester.html band with no name christian
+
===XHTML Design principles===
http://9.chumyris.com/arville-road-las-vegas.html
+
 
http://3.moxyeche.com/american-graffitti-movie.html american graffiti screen
+
Due to the specific nature of this type of annotations, there are several design principles that are usually followed when designing XHTML based microformats. These patterns are well documented in the [http://microformats.org/wiki/hcalendar hcalendar microformat specification]. However we outline the most important design principles that were followed during this microformat design.
  http://3.tabadixy.com/free-mobile-picture-downloads.html
+
 
http://3.chumyris.com/carling-cup-update.html carrack tour dates
+
# Reuse the schema (names, objects, properties, values, types, hierarchies, constraints) as much as possible from pre-existing, established, well-supported standards by reference. This introduces minimum or no disruption to the regular machinery that interacts with this markup.
http://4.cherahiv.com/sewa-dewan-serbaguna.html
+
# Use a generic structural element (e.g. <nowiki><span></nowiki> or <nowiki><div></nowiki> ), or the appropriate contextual element (e.g. an <nowiki><li></nowiki> inside a <nowiki><ul></nowiki> or <nowiki><ol></nowiki>).  
http://6.kuxyewus.com/jobs-in-lockhart-texas.html jobs in llandough hospital
+
# if the format of the data according to the original schema is too long and/or not human-friendly,place the literal data into the 'title' attribute leaving the human readable text inside the element. This is extensively used in the hcalendar format with the <nowiki><abbr></nowiki> element where literal text is wrapped with <nowiki><abbr></nowiki> elements to provide the machine-friendly annotation. The specific design decision in hcalendar is detailed out by the author [http://tantek.com/log/2005/01.html#d26t0100 here].
http://3.tabadixy.com/magazine-bonnier-corp.html
+
 
http://6.wyfexyud.com/ethnobotanical-study-of-medicinal-plants.html alno ag kitchen planner 0.96 a  
+
===Annotation styles===
http://11.cherahiv.com/superhero-movie-release-date.html
+
Now we provide two styles of applying the microformat. These styles can be used interchangeably depending on readability and convenience considerations.
http://6.nitoqueq.com/american-pie-crystal-lake-il.html american pie betha
+
 
http://7.moxyeche.com/used-stubben-scandica.html
+
====Class and Title Style====
http://9.cherahiv.com/corozzo-americas-most.html corozzo americas most
+
Guided by the first design principle, the class and title attributes are used to provide a name-value relationship for the text content. For example marking up the word map with a specific class reference would appear in XHTML as follows
http://3.pijatheh.com/bert-and-ernie-book.html
+
<pre><nowiki><span class="sem-class" title="http://apihut.com/schema/apihut-taxonomy#Map">Maps</span></nowiki></pre>
http://7.gijobuth.com/frequently-used-icd-9-codes.html alberta used oil management
+
 
http://7.tabadixy.com/orange-and-date-cake.html
+
==== Class Only Style====
http://9.moxyeche.com/liveupdate-can-t-connect.html liveupdate doesn t work
+
This is a convenient way of achieving the same objective of attaching a name-value pair to text content. The class attribute contains the name and value separated by a space and the first section is considered to be the name and the other, the value. Class only style has the additional benefit of
http://8.cherahiv.com/pictures-of-brading.html
+
not having an undesired ''tooltip effect'' on the text, but gives up the neat separation and hence sacrifices the readability.
http://10.kuxyewus.com/suq-road-bur-dubai.html suq road bur dubai
+
<pre><nowiki><span class="sem-class http://apihut.com/schema/apihut-taxonomy#Map">Maps</span></nowiki></pre>
http://10.gijobuth.com/constellation-men-watch.html
+
 
http://4.hybijika.com/space-cowboy-update.html space shuttle launch update
+
=== Property Types ===
http://8.kuxyewus.com/hat-intimidates-me.html
+
 
http://4.tabadixy.com/are-pachira-braid-only-indoor-plants.html are planning a vacation
+
The microformat properties can be categorized into two major types.
http://6.wyfexyud.com/full-body-massage-seattle.html
+
   
http://4.kuxyewus.com/electrician-job-in-vancouver.html electrician jobs hawaii
+
====Block markup====
http://6.tabadixy.com/miraval-life-balance-resort.html
+
Markups that pertain to a block like div, body etc. Such markup applies to a larger portion of text that may or may not contain other markups.
http://2.cherahiv.com/used-boats-dealers.html used boats charleston
+
<pre><nowiki> <body class="domain-rel" title="http://apihut.com/taxonomies/domainClassification.rdf#maps"> ... </body></nowiki></pre>
http://6.cherahiv.com/international-visitors-office.html
+
In this example, the domain-rel property is added to the body and hence covers the complete text content encapsulated by the <nowiki><body></nowiki> element. This particular property indicates that API descriptions inside the body belong to the maps domain as described in the domain model.
http://11.pijatheh.com/games-road-cockfosters.html kanan dume road malibu
+
 
http://6.chumyris.com/life-book-information.html
+
====Element markup====
http://5.moxyeche.com/invalid-or-missing-data-file-name.html invalid program file name
+
Markup on a single element like a span which wraps a word,phrase or a single resource. Element markup should not contain other markup.
http://4.gijobuth.com/words-to-country-roads.html
+
 
http://7.nitoqueq.com/pictures-of-stalactite.html pictures of stage lighting
+
=== Multivalued Properties ===
http://10.chumyris.com/replica-watch-malaysia.html
+
Often it is necessary to associate multiple values with a single property. For example one might need to indicate that a certain text content as a whole is relevant to both mathematics and biology domains. In such situations SA-REST allows enumerations to be included as the value for a property. Enumerations are a white space separated list of references. The following example illustrates the use of enumerations as a property value.
http://11.chumyris.com/african-american-religious-denominations.html african american remy hair
+
 
http://1.kuxyewus.com/snakes-of-pennsylvania-pictures.html  
+
<pre><nowiki> <body class="domain-rel" title="http://apihut.com/taxonomies/domainClassification.rdf#mathematics
http://9.nitoqueq.com/really-cute-puppy-pictures.html real time satellite pictures of earth
+
http://apihut.com/taxonomies/domainClassification.rdf#biology"> ... </body> </nowiki></pre>
http://6.chumyris.com/book-of-the-dead-paperback.html
+
 
http://11.nitoqueq.com/interserve-home-office.html interoffice memo of
+
When using the class only style, the list of values ''after'' the name are considered to be the value. The previous markup, when made with
http://6.cherahiv.com/book-of-revelation-pictures.html
+
class only style, would appear as follows.
http://3.cherahiv.com/city-storiescheats-psp.html civilation 4 cheats
+
 
  http://10.hybijika.com/language-problems-and-language-planning.html
+
<pre><nowiki> <body class="domain-rel http://apihut.com/taxonomies/domainClassification.rdf#mathematics
http://2.kuxyewus.com/kpss-guncel-bilgiler.html kredit tanpa agunan bca
+
http://apihut.com/taxonomies/domainClassification.rdf#biology"> ... </body> </nowiki></pre>
http://10.hybijika.com/fantasy-sim-date-rpg-codes.html
+
 
http://6.chumyris.com/old-lesbians-pictures.html old internet pictures
+
==Basic SA-REST properties==
http://5.cherahiv.com/baptist-church-job-opening.html
+
 
 +
SA-REST has three basic properties discussed below. These properties provide for mechanisms to add richer semantic information for any Web resource. However, one can extend SA-REST
 +
to capture resource specific semantics. Examples of these include SA-REST extensions for Web APIs and SA-REST extensions for social networking profiles. An author desiring to create a new
 +
microformat, however is strongly urged to consider reuse for a resource type before attempting an extension.  
 +
 
 +
===domain-rel===
 +
 
 +
The domain-rel property allows description of domain information for an entire resource. If a given resource (such as blog posts) has content spanning multiple domains, it is desired to add
 +
multiple domain-rel elements, each surrounding a section of the resource. If such a separation cannot be made, the title attribute should be an enumeration of the domains.  
 +
 
 +
'''Simple domain-rel example'''
 +
<pre><nowiki><span class="domain-rel" title="http://apihut.com/schemas/socialnetworking#socialnetworks> The growing trend of "liking" has recently caught a lot of attention of both network
 +
users as well as developers.</span></nowiki></pre>
 +
 
 +
The example below illustrates a multi-domain scenario where the domain contexts can be separated in the content. This annotation is desirable when a resources (such as integrated feeds) that draw content from multiple sources are created.  
 +
 
 +
'''Multi-domain domain-rel example 1'''
 +
<pre><nowiki><span class="domain-rel" title="http://apihut.com/schemas/socialnetworking#socialnetworks> The growing trend of "liking" has recently caught a lot of attention of both network
 +
users as well as developers...</span>
 +
 
 +
<span class="domain-rel" title="http://apihut.com/schemas/economy#banking"> I also came across this interesting discussion on bailout that talked about nationalization of banks</span>
 +
</nowiki></pre>
 +
 
 +
A very frequent scenario is one where a resource content spans multiple domains and the content is not contextualized.  
 +
 
 +
'''Multi-domain domain-rel example 2'''
 +
<pre><nowiki><span class="domain-rel" title="http://apihut.com/schemas/socialnetworking#socialnetworks http://apihut.com/schemas/economy#recession"> One often wonders the future of  
 +
advertisement driven Web applications in the current economic scenario. For example, social networking applications such as...</span></nowiki></pre>
 +
 
 +
===sem-rel===
 +
 
 +
The sem-rel property will capture the semantics of a link. This evolves from the popular rel tag. An application of sem-rel would be to describe
 +
a data model that is captured in a XSD. The primary purpose of the sem-rel tag is to allow developers to add "top level annotation" to schemas that are third party.  
 +
The sem-rel property also allows enumeration within the title attribute. It is used in conjunction with the a attribute.
 +
 
 +
<pre><nowiki> <a href="http://foo.xsd" sem-rel="http://taxonomy.org/computerscience#firstname"> This is the input schema</a> </nowiki></pre>
 +
 
 +
<pre><nowiki> <a href="www.teachmegooglemaps.com" class="sem-rel" title="http://apihut.com/taxonomies/domainClassification.rdf#maps"> Learn Google Maps</nowiki></pre>
 +
 
 +
===sem-class===
 +
sem-class is an element markup property. sem-class can be used to markup a single entity within a resource. Like the domain-rel tag, the sem-rel tag can be an enumeration.  
 +
For example, in a blog, sem-class can be used to markup single words, while in a
 +
page (such as YouTube), sem-class mark up an video object. Both scenarios are exemplified below.  
 +
 
 +
<pre><nowiki> One striking observation in evolution of <span class="sem-class" title="http://tap.stanford.edu/#computer"> Computers </span> is the relationship between speed and size.</nowiki></pre>.
 +
 
 +
The example below illustrates the markup of a flash media object. The markup describes the actual video that is embedded therein. In this case, the video is from [http://www.hulu.com/watch/49927/american-dad-i-cant-stan-you American Dad episode from hulu.com (may not be active all the time)]
 +
 
 +
'''sem-class on non-textual content'''
 +
<pre><nowiki> <span class="sem-class" title="http://entertainment.org/schemas/tv#american_dad"><div id="player-container" style="text-align: center;">
 +
<embed id="player" height="368" width="790" flashvars="stage_width=790&stage_height=368&content_id=m1ppkqeh&bitrate=700000&user_id=-1" bgcolor="#000000"
 +
allowfullscreen="true" allowscriptaccess="sameDomain" quality="high" name="player" style="z-index: 10;" src="/player.swf" type="application/x-shockwave-flash"/>
 +
</div></span></nowiki></pre>.
 +
 
 +
==Processing SA-REST==
 +
The most straight forward way to process the documents is to use [http://www.w3.org/TR/xslt.html XSLT] along with [http://www.w3.org/TR/grddl/ GRDDL]. XSLT is a well supported and a flexible way to transform XML documents from one form to another, typically the target form being XML or any other text format. GRDDL specification describes how the XSLT transformation can be used to convert annotated XHTML/HTML documents to RDF. The following snippet shows a specification of the transformation stylesheet according to GRDDL.
 +
 
 +
<pre><nowiki> <html xmlns="http://www.w3.org/1999/xhtml"
 +
      xmlns:grddl='http://www.w3.org/2003/g/data-view#'
 +
      grddl:transformation="glean_title.xsl
 +
http://www.w3.org/2001/sw/grddl-wg/td/getAuthor.xsl"
 +
  >
 +
  </nowiki></pre>
 +
 
 +
The subsequent processing can be done using the RDF representation.
 +
 
 +
== Related Documents ==
 +
 
 +
* [http://knoesis.wright.edu/library/resource.php?id=00162 SA-REST publication in IEEE Internet Computing Journal]
 +
* [http://knoesis.wright.edu/research/srl/standards/sa-rest/ SA-REST initial discussion draft]

Latest revision as of 11:12, 20 April 2011

Note : This is a draft version of SA-REST. The latest (and active) version is the W3C Member Submission



SA-REST is a simple and open microformat for enhancing Web resources with additional semantic information. In addition to HTML and XHTML, the SA-REST approach can also be used to enrich Atom, RSS, and arbitrary XML. SA-REST is one of several open microformat standards.

People

Collaborators in the SA-REST initiative are listed by their affiliations.

  • kno.e.sis center, Wright State University, Dayton, OH
    • Amit P. Sheth.
    • Karthik Gomadam.
    • Ajith Ranabahu

Introduction

Current mashup tools and technologies

Although mashups fully embrace the idea of customization on the Web, read-write is another story. The complexity of application development using javascript makes it hard for average developers to create new mashups and to customize the existing ones. To solve this problem, several companies are developing tools for mashup creation that require little or no programming knowledge. These tools, exemplified by Yahoo! pipes, IBM's QEDwiki and Google's Mashup Editor, facilitate the selection of some number of RESTful Web services or other Web resources and chain them together by piping one service's output into the next service's input while filtering content and making slight format changes.

Limitations

One of the main drawbacks of the current state of the art is the lack of support for interoperability, especially that of data. Since most of the existing tools limit their support to services internal to the specific vendor that created them (for example, Google Mashup Editor has a complete abstraction of Google Maps service) or to services that have standard types of outputs such as RSS or Atom (Yahoo! Pipes), the problem of interop is not highlighted.

Another drawback of the current approach to mashups is the complexity of their development itself. Frameworks like Pipes that do not require any programming have limited capabilities. Google Mashup Editor has a steeper learning curve and can do much more than Yahoo! Pipes. However the need to be proficient in languages such as javascript is mandatory to create more complex mashups. We identify two aspects of this complexity

  • Visual: Complexity arising out of the need to create intuitive visual elements and handle various events relating to them
  • Data: Complexity arising due to heterogeneity in data schemas and formats.

The primary objective of SA-REST is to address the data complexity issue.

SA-REST in a nutshell

SA-REST is a microformat to add additional meta-data to (but not limited to) REST API descriptions in HTML and XHTML. Developers can directly embed meta-data from various models such an ontology, taxonomy or a tag cloud into their API descriptions. The embedded meta-data can be used to improve search (for example: perform faceted search for APIs), data mediation (in conjunction with XML annotation) as well as help in easier integration of services to create mashups.

Background

Rather than talking only about SWS (not relevant), we should instead talk about other microformats and the synergy therein with SA-REST.

Researchers in the area of Semantic Web Services have proposed various specifications, the prominent of which are

  • OWL-S
  • WSDL-S
  • WSMO

In 2005, the W3C initiated a charter to create a standard for adding semantics to WSDL descriptions. The WSDL-S specification (submitted by Services Research Lab at kno.e.sis from LSDIS Lab in GA along with IBM) was taken as the primary input for the charter. This led to the standardization of SAWSDL (Semantic Annotation of WSDL and XML Schema). SAWSDL has had a significant impact in the evolution of SA-REST. However, the adoption of a microformat based approach is a key difference between the SAWSDL and SA-REST frameworks. However, it must be noted here that the principles of Schema annotation, lifting and lowering can be directly used from SAWSDL for XML data objects in the RESTful environment.

Usage Scenarios

Faceted Search

The number of available APIs are growing fast. In April 2008, we found that there were about 700 APIs added to ProgrammableWeb. In September, that number is over 900. Currently, general purpose search engines like Google are largely used to find these APIs. However, these treat API documents like any other in indexing and ranking APIs. As a result, search for APIs (even when specific queries like "Maps API") results in API resources being scattered all over the result set. Web API directories like programmableWeb do present a more domain-specific solution. However, they largely rely on user tags for classification and searching.

Addition of meta-data to capture the various facets of APIs (their functionality, the message types they support, clientside bindings, protocol) can allow for better searching. The results of one such framework APIHut is presented in Faceted Search for APIs. We also present our initial evaluation of precision and recall metrics. SA-REST can improve faceted search in a significant manner. Using known techniques of GRDDL and XSLT, one can extract RDF representations of APIs. This can then be indexed and searched upon.

Data Mediation and Mediatability

The importance of enabling easier approaches to data mediation has been well understood. In the context of mashups this is even more important, largely due to the fact that often developers are faced with the burden of handling data at the client side. SA-REST will address this issue in two ways

  • Adopting XML annotation from SAWSDL: This will allow us to add the lifting and lowering transformations to data elements as a part of the API description. Information about SAWSDL lifting and lowering can be found in the SAWSDL spec on schema annotations. There is a small catch that we have to address here. In the WSDL world, data exchange was XML de-facto. However, in the RESTful environment, developers can use many formats such as JSON, GData, RSS. It will be interesting to investigate this as a part of the SA-REST effort.
  • Mediatability: Mediatability is a measure of the estimated human effort for performing data mediation manually. Having additional annotations can significantly help us in computing the mediatability. Even when automatic mediation is not possible, knowing how hard or easy the mediation between two services can definitely help developers in choosing services for their mashups.

Smart Mashups

Smart mashups are those that allow the end user more flexibility to change certain services in a mashup. For example, in the popular Housing Maps mashup, if the quality of Yahoo! maps in a certain area is better than that of Google, the user must have the flexibility to change it. To realize this, we are pursuing on a meta based approach for mashup creation. In this approach, the developer creates the mashup application at a meta level and services are added to them at the run time. In this context, there needs to be a way for the developer to specify the requirements for a service and the system to check if the user preference meets the requirement. Having annotations can help accomplish this task with lesser difficulty.

Automatic Annotation

Semi-automatic text annotation is a significant research area, primarily due to the large volume of text data that becomes available everyday. It is not viable to annotate such volumes of data purely by human effort and one needs to employ text processing techniques to provide automatic markup. One major challenge in text processing is disambiguation, selecting the correct semantics of a word that may be used across domains to represent different concepts.

The domain-rel property acts as a guide to describe the domain(s) a certain text snippet describes and hence provides a means for the text processing / automatic annotation engines to perform effective disambiguation.

Design principles and methods

XHTML Design principles

Due to the specific nature of this type of annotations, there are several design principles that are usually followed when designing XHTML based microformats. These patterns are well documented in the hcalendar microformat specification. However we outline the most important design principles that were followed during this microformat design.

  1. Reuse the schema (names, objects, properties, values, types, hierarchies, constraints) as much as possible from pre-existing, established, well-supported standards by reference. This introduces minimum or no disruption to the regular machinery that interacts with this markup.
  2. Use a generic structural element (e.g. <span> or <div> ), or the appropriate contextual element (e.g. an <li> inside a <ul> or <ol>).
  3. if the format of the data according to the original schema is too long and/or not human-friendly,place the literal data into the 'title' attribute leaving the human readable text inside the element. This is extensively used in the hcalendar format with the <abbr> element where literal text is wrapped with <abbr> elements to provide the machine-friendly annotation. The specific design decision in hcalendar is detailed out by the author here.

Annotation styles

Now we provide two styles of applying the microformat. These styles can be used interchangeably depending on readability and convenience considerations.

Class and Title Style

Guided by the first design principle, the class and title attributes are used to provide a name-value relationship for the text content. For example marking up the word map with a specific class reference would appear in XHTML as follows

<span class="sem-class" title="http://apihut.com/schema/apihut-taxonomy#Map">Maps</span>

Class Only Style

This is a convenient way of achieving the same objective of attaching a name-value pair to text content. The class attribute contains the name and value separated by a space and the first section is considered to be the name and the other, the value. Class only style has the additional benefit of not having an undesired tooltip effect on the text, but gives up the neat separation and hence sacrifices the readability.

<span class="sem-class http://apihut.com/schema/apihut-taxonomy#Map">Maps</span>

Property Types

The microformat properties can be categorized into two major types.

Block markup

Markups that pertain to a block like div, body etc. Such markup applies to a larger portion of text that may or may not contain other markups.

 <body class="domain-rel" title="http://apihut.com/taxonomies/domainClassification.rdf#maps"> ... </body>

In this example, the domain-rel property is added to the body and hence covers the complete text content encapsulated by the <body> element. This particular property indicates that API descriptions inside the body belong to the maps domain as described in the domain model.

Element markup

Markup on a single element like a span which wraps a word,phrase or a single resource. Element markup should not contain other markup.

Multivalued Properties

Often it is necessary to associate multiple values with a single property. For example one might need to indicate that a certain text content as a whole is relevant to both mathematics and biology domains. In such situations SA-REST allows enumerations to be included as the value for a property. Enumerations are a white space separated list of references. The following example illustrates the use of enumerations as a property value.

 <body class="domain-rel" title="http://apihut.com/taxonomies/domainClassification.rdf#mathematics 
http://apihut.com/taxonomies/domainClassification.rdf#biology"> ... </body>  

When using the class only style, the list of values after the name are considered to be the value. The previous markup, when made with class only style, would appear as follows.

 <body class="domain-rel http://apihut.com/taxonomies/domainClassification.rdf#mathematics 
http://apihut.com/taxonomies/domainClassification.rdf#biology"> ... </body>  

Basic SA-REST properties

SA-REST has three basic properties discussed below. These properties provide for mechanisms to add richer semantic information for any Web resource. However, one can extend SA-REST to capture resource specific semantics. Examples of these include SA-REST extensions for Web APIs and SA-REST extensions for social networking profiles. An author desiring to create a new microformat, however is strongly urged to consider reuse for a resource type before attempting an extension.

domain-rel

The domain-rel property allows description of domain information for an entire resource. If a given resource (such as blog posts) has content spanning multiple domains, it is desired to add multiple domain-rel elements, each surrounding a section of the resource. If such a separation cannot be made, the title attribute should be an enumeration of the domains.

Simple domain-rel example

<span class="domain-rel" title="http://apihut.com/schemas/socialnetworking#socialnetworks> The growing trend of "liking" has recently caught a lot of attention of both network 
users as well as developers.</span>

The example below illustrates a multi-domain scenario where the domain contexts can be separated in the content. This annotation is desirable when a resources (such as integrated feeds) that draw content from multiple sources are created.

Multi-domain domain-rel example 1

<span class="domain-rel" title="http://apihut.com/schemas/socialnetworking#socialnetworks> The growing trend of "liking" has recently caught a lot of attention of both network 
users as well as developers...</span>

<span class="domain-rel" title="http://apihut.com/schemas/economy#banking"> I also came across this interesting discussion on bailout that talked about nationalization of banks</span>

A very frequent scenario is one where a resource content spans multiple domains and the content is not contextualized.

Multi-domain domain-rel example 2

<span class="domain-rel" title="http://apihut.com/schemas/socialnetworking#socialnetworks http://apihut.com/schemas/economy#recession"> One often wonders the future of 
advertisement driven Web applications in the current economic scenario. For example, social networking applications such as...</span>

sem-rel

The sem-rel property will capture the semantics of a link. This evolves from the popular rel tag. An application of sem-rel would be to describe a data model that is captured in a XSD. The primary purpose of the sem-rel tag is to allow developers to add "top level annotation" to schemas that are third party. The sem-rel property also allows enumeration within the title attribute. It is used in conjunction with the a attribute.

 <a href="http://foo.xsd" sem-rel="http://taxonomy.org/computerscience#firstname"> This is the input schema</a> 
 <a href="www.teachmegooglemaps.com" class="sem-rel" title="http://apihut.com/taxonomies/domainClassification.rdf#maps"> Learn Google Maps

sem-class

sem-class is an element markup property. sem-class can be used to markup a single entity within a resource. Like the domain-rel tag, the sem-rel tag can be an enumeration. For example, in a blog, sem-class can be used to markup single words, while in a page (such as YouTube), sem-class mark up an video object. Both scenarios are exemplified below.

 One striking observation in evolution of <span class="sem-class" title="http://tap.stanford.edu/#computer"> Computers </span> is the relationship between speed and size.
.

The example below illustrates the markup of a flash media object. The markup describes the actual video that is embedded therein. In this case, the video is from American Dad episode from hulu.com (may not be active all the time)

sem-class on non-textual content

 <span class="sem-class" title="http://entertainment.org/schemas/tv#american_dad"><div id="player-container" style="text-align: center;">
<embed id="player" height="368" width="790" flashvars="stage_width=790&stage_height=368&content_id=m1ppkqeh&bitrate=700000&user_id=-1" bgcolor="#000000" 
allowfullscreen="true" allowscriptaccess="sameDomain" quality="high" name="player" style="z-index: 10;" src="/player.swf" type="application/x-shockwave-flash"/>
</div></span>
.

Processing SA-REST

The most straight forward way to process the documents is to use XSLT along with GRDDL. XSLT is a well supported and a flexible way to transform XML documents from one form to another, typically the target form being XML or any other text format. GRDDL specification describes how the XSLT transformation can be used to convert annotated XHTML/HTML documents to RDF. The following snippet shows a specification of the transformation stylesheet according to GRDDL.

 <html xmlns="http://www.w3.org/1999/xhtml"
      xmlns:grddl='http://www.w3.org/2003/g/data-view#'
      grddl:transformation="glean_title.xsl
			http://www.w3.org/2001/sw/grddl-wg/td/getAuthor.xsl"
 >
  

The subsequent processing can be done using the RDF representation.

Related Documents