xfolk: Difference between revisions
No edit summary |
(→hCard) |
||
Line 3: | Line 3: | ||
The xFolk page is under construction. The official page is still [http://thecommunityengine.com/home/archives/2005/05/xfolk_entry_04.html here]. | The xFolk page is under construction. The official page is still [http://thecommunityengine.com/home/archives/2005/05/xfolk_entry_04.html here]. | ||
= | = xFolk = | ||
xFolk is a simple and open format for publishing collections of social bookmarks like those produced by de.licio.us, de.lirio.us, and blogmarks. Using a standard like xFolk makes it easier to develop and share ajax scripts for manipulating bookmarks with the added benefit that the bookmarks will be indexable by technorati and other tagging services. xFolk may be embedded in (X)HTML, Atom, RSS, and arbitrary XML. It is one of several open [[microformats|microformat]] standards. | |||
__TOC__ | __TOC__ | ||
Line 11: | Line 12: | ||
=== Editor === | === Editor === | ||
[http:// | [http://thecommunityengine.com/ Bud Gibson], [http://thecommunityengine.com The Community Engine] | ||
=== Authors === | === Authors === | ||
* [http:// | * [http://thecommunityengine.com/ Bud Gibson], [http://thecommunityengine.com The Community Engine] | ||
=== Copyright === | === Copyright === | ||
Line 24: | Line 24: | ||
=== Inspiration and Acknowledgments === | === Inspiration and Acknowledgments === | ||
Thanks to: | Thanks to: [http://www.cnbc.cmu.edu/~plaut/ David Plaut] who introduced me to the idea that items may partially belong to many categories at once and statistical ways to sort that out over a decade ago. More recently, thanks to [ Tantek Çelik], [ Kevin Marks], [ Steve Mallet], [ Brian DelVecchio], and [ François Hardienne] who have contributed to the development of xFolk with thoughtful critiques and implementations. | ||
== Introduction == | == Introduction == | ||
Current social bookmarking services are centralized systems that allow user to contribute, tag, and share bookmarks. Over the past year, the number of these services has exploded, with entrants like: [ del.icio.us], [ furl], [ de.lirio.us], and [ blogmarks]. Currently, technorati and gattaga allow you to view bookmarks in a subset of these systems by tag. | |||
A major issue for sharing bookmarks across systems is the lack of an interoperable standard for representing social bookmark data. An open standard for sharing social bookmark data would, for instance, make it possible to write ajax scripts that work across social bookmark services, thereby generally improving user experience. It would also make it easier to pull data from across social bookmark services and even from decentralized social bookmark repositories like individual scuttle installations. Users would no longer be forced to rely on aggregators like technorati to come to agreement with social bookmark systems like del.icio.us to index their tags or search engines like gattaga to come up with adapters for each service's specific social bookmark format. | |||
xFolk is a microformat that builds on reltag to provide an open standard for publishing social bookmark data. Using xFolk | |||
== Semantic XHTML Design Principles == | == Semantic XHTML Design Principles == |
Revision as of 13:58, 6 July 2005
Under Construction
The xFolk page is under construction. The official page is still here.
xFolk
xFolk is a simple and open format for publishing collections of social bookmarks like those produced by de.licio.us, de.lirio.us, and blogmarks. Using a standard like xFolk makes it easier to develop and share ajax scripts for manipulating bookmarks with the added benefit that the bookmarks will be indexable by technorati and other tagging services. xFolk may be embedded in (X)HTML, Atom, RSS, and arbitrary XML. It is one of several open microformat standards.
Draft Specification
Editor
Bud Gibson, The Community Engine
Authors
Copyright
This specification is (C) 2004-2025 by the authors. However, the authors intend to submit (or already have submitted, see details in the spec) this specification to a standards body with a liberal copyright/licensing policy such as the GMPG, IETF, and/or W3C. Anyone wishing to contribute should read their copyright principles, policies and licenses (e.g. the GMPG Principles) and agree to them, including licensing of all contributions under all required licenses (e.g. CC-by 1.0 and later), before contributing.
Patents
This specification is subject to a royalty free patent policy, e.g. per the W3C Patent Policy, and IETF RFC3667 & RFC3668.
Inspiration and Acknowledgments
Thanks to: David Plaut who introduced me to the idea that items may partially belong to many categories at once and statistical ways to sort that out over a decade ago. More recently, thanks to [ Tantek Çelik], [ Kevin Marks], [ Steve Mallet], [ Brian DelVecchio], and [ François Hardienne] who have contributed to the development of xFolk with thoughtful critiques and implementations.
Introduction
Current social bookmarking services are centralized systems that allow user to contribute, tag, and share bookmarks. Over the past year, the number of these services has exploded, with entrants like: [ del.icio.us], [ furl], [ de.lirio.us], and [ blogmarks]. Currently, technorati and gattaga allow you to view bookmarks in a subset of these systems by tag.
A major issue for sharing bookmarks across systems is the lack of an interoperable standard for representing social bookmark data. An open standard for sharing social bookmark data would, for instance, make it possible to write ajax scripts that work across social bookmark services, thereby generally improving user experience. It would also make it easier to pull data from across social bookmark services and even from decentralized social bookmark repositories like individual scuttle installations. Users would no longer be forced to rely on aggregators like technorati to come to agreement with social bookmark systems like del.icio.us to index their tags or search engines like gattaga to come up with adapters for each service's specific social bookmark format.
xFolk is a microformat that builds on reltag to provide an open standard for publishing social bookmark data. Using xFolk
Semantic XHTML Design Principles
XHTML is built on XML, and thus XHTML-based formats can be used not only for convenient display presentation, but also for general-purpose data exchange. In many ways, XHTML-based formats exemplify the best of both HTML and XML worlds. However, when building XHTML-based formats, it helps to have a guiding set of principles.
- Reuse the schema (names, objects, properties, values, types, hierarchies, constraints) as much as possible from pre-existing, established, well-supported standards by reference. Avoid restating constraints expressed in the source standard. Informative mentions are ok.
- For types with multiple components, use nested elements with class names equivalent to the names of the components.
- Plural components are made singular, and thus multiple nested elements are used to represent multiple text values that are comma-delimited.
- Use the most accurately precise semantic XHTML building block for each object etc.
- Otherwise use a generic structural element (e.g.
<span>
or<div>
), or the appropriate contextual element (e.g. an<li>
inside a<ul>
or<ol>
). - Use class names based on names from the original schema, unless the semantic XHTML building block precisely represents that part of the original schema. If names in the source schema are case-insensitive, then use an all lowercase equivalent. Components names implicit in prose (rather than explicit in the defined schema) should also use lowercase equivalents for ease of use. Spaces in component names become dash '-' characters.
- Finally, if the format of the data according to the original schema is too long and/or not human-friendly, use
<abbr>
instead of a generic structural element, and place the literal data into the 'title' attribute (where abbr expansions go), and the more brief and human-readable equivalent into the element itself. Further informative explanation of this use of<abbr>
: Human vs. ISO8601 dates problem solved
Format
In General
The vCard standard (RFC2426) forms the basis of hCard.
The basic format of hCard is to use vCard object/property names in lower-case for class names, and to map the nesting of vCard objects directly into nested XHTML elements.
More Semantic Equivalents
However, for some properties there is a more semantic equivalent, and therefore they get special treatment, e.g.:
URL
in vCard becomes<a class="url" href="...">...</a>
inside the element withclass="vcard"
in hCard.- Similarly,
EMAIL
in vCard becomes<a class="email" href="mailto:...">...</a>
PHOTO
in vCard becomes<img class="photo" src="..." alt="Photo of ..." />
Singular vs. Plural Properties
For properties which are singular (e.g. "N" and "FN"), the first descendant element with that class should take effect, any others being ignored.
For properties which can be plural (e.g. "TEL"), each class instance should create a instance of that property. Plural properties with subtypes (e.g. TEL with WORK, HOME, CELL) can be optimized to share a common element for the property itself, with each instance of subtype being an appropriately classed descendant of the property element.
Human vs. Machine readable
If an element is used for a property, then the '
title
' attribute of the element is the value of the property, instead of the contents of the element, which instead provide a human presentable version of the value.
Similarly, if an <img>
element is used for one or more properties, it must be treated as follows:
- For the "PHOTO" property and any other property that takes a URL as its value, the
src="..."
attribute provides the property value. - For other properties, the
<img>
element's 'alt
' attribute is the value of the property.
Property Exceptions
vCard has several properties which either do not make sense on, or are already implied within the context of a web page. This section explains what to (not) do with them.
- NAME, PROFILE, SOURCE, PRODID properties as defined in Sections 2.1.2, 2.1.3, 2.1.4, 3.6.3 of RFC 2426. Content publishers MUST NOT use these properties in their hCards, and as such, hCard consumers/parsers MUST IGNORE these properties if they are found within an hCard. Instead. hCard to vCard converters SHOULD use the title of the page where the hCard is found (e.g. the
<title>
element in (X)HTML documents) to construct the NAME property, MAY output a PROFILE value of "VCARD
" per RFC 2426, SHOULD use the URL of the page where the hCard is found to construct the SOURCE property (e.g. perhaps as a parameter to URL that converts hCards to vCards), for an output vCard stream (e.g. a .vcf file).
Implied "N" Optimization
Although vCard requires that the "N" property be present, the authors of the vCard specification (RFC 2426) themselves do not include "N" properties in their vCards near the end of the spec (p.38). This apparent contradiction can be resolved by simply allowing the "FN" property to imply "N" property values in typical cases provided in the spec. We do so explicitly in hCard.
If the value of the "FN" property is exactly two words (separated by whitespace.), and there is no explicit "N" property, then the "N" property is inferred from the "FN" property. For "FN"s with either one word or three or more, then the author MUST explicitly markup the "N".
- The content of "FN" is broken into two "words" separated by whitespace.
- The first word of the "FN" is interpreted as the "Given Name" for the "N" property.
- The second/last word of the "FN" is interpreted as the "Family Name" for the "N" property.
- Exception: If the first word ends in a "," comma OR if the second word is a single character (optionally followed by a period "."), then the first word (minus the comma at the end if any) is interpreted as the "Family Name" and the second word is interpreted as the "Given Name".
This allows simplification in the typical case of people stating their firstname space lastname.
Implied "organization-name" Optimization
The "ORG" property has two subproperties, organization-name and organization-unit. Very often authors only publish the organization-name. Thus if an "ORG" property has no "organization-name" inside it, then its entire contents MUST be treated as the "organization-name".
XMDP Profile
See hcard-profile for the XMDP profile of hCard.
Example
Here is a sample vCard:
BEGIN:VCARD
VERSION:3.0
N:Çelik;Tantek
FN:Tantek Çelik
URL:http://tantek.com
ORG:Technorati
END:VCARD
and an equivalent in hCard with various elements optimized appropriately. See hCard Example 1 for the derivation.
<div class="vcard">
<a class="url fn" href="http://tantek.com/">
Tantek Çelik
</a>
<div class="org">Technorati</div>
</div>
This hCard might be displayed as::
Tantek Çelik
Technorati
Note: The version information is unnecessary in hCard markup directly since the version will be defined by the profile of hCard that is used/referred to in the 'profile' attribute of the <head> element.
Examples in the wild
This section is informative.
The following sites have published hCards, and thus are a great place to start for anyone looking for examples "in the wild" to try parsing, indexing, organizing etc. If you have an hCard on your own page, feel free to add it to the top of this list. Once the list grows too big, we'll make a separate wiki page.
- Cinema Treasures uses hCard to markup venue information for 10,000+ movie theaters.
- Andy Hume uses the hcard format to mark-up the names and URLs of commentors on his blog.
- Steven Ametjan has published his hCard on his about page.
- EVDB, the Events and Venues Database, publishes all venue detail pages with embedded hCards.
- Paul Schreiber's Sunnyvale House Concerts site publishes hCards for upcoming artists, as well as an hCard for the page itself. In addition the Past Shows page contains hCards for all past artists.
- Rob Mientjes has published his hCard on his about page.
- Don Crowley has published his hCard as well as a nifty hCard button:
- Technorati's About page lists their Media Contact, and their Press page also lists their Press Contact, both as inline hCards.
- Tantek's Thoughts includes an inline author hCard at the bottom of the page.
Implementations
This section is informative.
The following implementations have been developed which either generate or parse hCards. If you have an hCard implementation, feel free to add it to the top of this list. Once the list grows too big, we'll make a separate wiki page.
- The hCard creator is a very simple, yet illustrative, open source user interface / form / script which creates an hCard in real-time as you type in a set of contact information.
- rfc2629.xslt now attempts to generate hCard information (RFC2629 is an XML format for authoring RFCs and Internet Drafts, see example document)
- iChat buddy list to hCards - Open source AppleScript to automatically convert one's buddy list in the MacOSX iChat AIM client into a valid XHTML 1.0 Strict list of hCards.
- X2V is a bookmarklet that parses hCard and produces a .vcf (vCard) stream. Note: needs to be updated as the spec is refined
References
Normative References
Informative References
- CSS1
- hCard term introduced and defined on the Web, 20040930
- FOO Camp 2004 Simple Semantic Formats presentation, 20040910
- Contributed from http://developers.technorati.com/wiki/hCard.
- XHTML 1.1
Similar Work
Work in progress
This specification is a work in progress. As additional aspects are discussed, understood, and written, they will be added. There is a separate document where we are keeping our brainstorms and other explorations relating to hCard:
Discussions
Q&A
- If you have any questions about hCard, check the hCard FAQ, and if you don't find answers, add your questions!
Issues
- Please add any issues with the specification to the separate hCard issues document.