<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: LinkedIn terrified of OpenCalais?</title>
	<atom:link href="http://hightechcville.wordpress.com/2009/02/25/linkedin-terrified-of-opencalais/feed/" rel="self" type="application/rss+xml" />
	<link>http://hightechcville.wordpress.com/2009/02/25/linkedin-terrified-of-opencalais/</link>
	<description>Just another WordPress.com weblog</description>
	<lastBuildDate>Wed, 25 Feb 2009 21:53:43 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Thomas Tague</title>
		<link>http://hightechcville.wordpress.com/2009/02/25/linkedin-terrified-of-opencalais/#comment-29</link>
		<dc:creator>Thomas Tague</dc:creator>
		<pubDate>Wed, 25 Feb 2009 21:53:43 +0000</pubDate>
		<guid isPermaLink="false">http://hightechcville.wordpress.com/?p=47#comment-29</guid>
		<description>Eric:

You might want to experiment a bit with the GenericRelations extraction parameter - but be prepared for a flood of very general metadata. This exposes general relationships betwwen a known entity type (for example person) and ... whatever. 

Regards,</description>
		<content:encoded><![CDATA[<p>Eric:</p>
<p>You might want to experiment a bit with the GenericRelations extraction parameter &#8211; but be prepared for a flood of very general metadata. This exposes general relationships betwwen a known entity type (for example person) and &#8230; whatever. </p>
<p>Regards,</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Eric</title>
		<link>http://hightechcville.wordpress.com/2009/02/25/linkedin-terrified-of-opencalais/#comment-27</link>
		<dc:creator>Eric</dc:creator>
		<pubDate>Wed, 25 Feb 2009 20:43:07 +0000</pubDate>
		<guid isPermaLink="false">http://hightechcville.wordpress.com/?p=47#comment-27</guid>
		<description>Tom,

I agree with opening the gate up a bit, since they will always have a better handle on the structure of the data since they built the structure then a tool like Calais will...   

I do owe another post about LinkedIn.  I noticed that that they did recently add hCard formatting to their profile pages, which makes reusing that content much simpler, and is a nice sign of their playing will with open standards.  

I think the big value in Calais is the handling unstructured data.  I can write a simple parser for structured data, well, assuming we aren&#039;t looking at 1000&#039;s of sites.  And Dapper is an interesting approach at the same issue.  

Calais has really enahanced the data in HTC, and the next version which will be live later this week will reflect that.  Now if only Calais could pull out event information like &quot;Bob Smith speaking Monday 2/25/09 on Semantic Web&quot;!</description>
		<content:encoded><![CDATA[<p>Tom,</p>
<p>I agree with opening the gate up a bit, since they will always have a better handle on the structure of the data since they built the structure then a tool like Calais will&#8230;   </p>
<p>I do owe another post about LinkedIn.  I noticed that that they did recently add hCard formatting to their profile pages, which makes reusing that content much simpler, and is a nice sign of their playing will with open standards.  </p>
<p>I think the big value in Calais is the handling unstructured data.  I can write a simple parser for structured data, well, assuming we aren&#8217;t looking at 1000&#8217;s of sites.  And Dapper is an interesting approach at the same issue.  </p>
<p>Calais has really enahanced the data in HTC, and the next version which will be live later this week will reflect that.  Now if only Calais could pull out event information like &#8220;Bob Smith speaking Monday 2/25/09 on Semantic Web&#8221;!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Thomas Tague</title>
		<link>http://hightechcville.wordpress.com/2009/02/25/linkedin-terrified-of-opencalais/#comment-26</link>
		<dc:creator>Thomas Tague</dc:creator>
		<pubDate>Wed, 25 Feb 2009 20:30:32 +0000</pubDate>
		<guid isPermaLink="false">http://hightechcville.wordpress.com/?p=47#comment-26</guid>
		<description>Eric:

Brief follow up.

I processed the page with Gnosis (http://bit.ly/8ICuy) our Firefox plugin. And it did... not so great. Calais is really designed to deal with unstructured textual prose - and a formatted page like the LinkedIn profile doesn&#039;t give us a lot to work with. That being said - teaching the system to understand the structural elements (ala Dapper) of the top 1,000 sites or so would not be that big a deal.</description>
		<content:encoded><![CDATA[<p>Eric:</p>
<p>Brief follow up.</p>
<p>I processed the page with Gnosis (<a href="http://bit.ly/8ICuy" rel="nofollow">http://bit.ly/8ICuy</a>) our Firefox plugin. And it did&#8230; not so great. Calais is really designed to deal with unstructured textual prose &#8211; and a formatted page like the LinkedIn profile doesn&#8217;t give us a lot to work with. That being said &#8211; teaching the system to understand the structural elements (ala Dapper) of the top 1,000 sites or so would not be that big a deal.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Thomas Tague</title>
		<link>http://hightechcville.wordpress.com/2009/02/25/linkedin-terrified-of-opencalais/#comment-25</link>
		<dc:creator>Thomas Tague</dc:creator>
		<pubDate>Wed, 25 Feb 2009 20:22:00 +0000</pubDate>
		<guid isPermaLink="false">http://hightechcville.wordpress.com/?p=47#comment-25</guid>
		<description>Eric:

Tom Tague from Calais / Semanticproxy.com here. 

It might be intentional blocking - or it might be that the page has some non-standard HTML that somehow deeply confuses semanticproxy.com. I noticed it handles the summary page fine - but not the full profile. Sometimes things confuse us - we&#039;ll take a look.

All technology aside - you do raise an interesting point. It will be very interesting to see how various walled gardens deal with the onset of tools like Semanticproxy that free the information inside them for wider consumption and sharing. 

We&#039;re treading on the safe side of the equation and following robots rules and all that. But - we&#039;re the good guys. There are plenty of people ready to harvest this type of information using any tools at hand.

Maybe the gardens should open the gate just a bit?</description>
		<content:encoded><![CDATA[<p>Eric:</p>
<p>Tom Tague from Calais / Semanticproxy.com here. </p>
<p>It might be intentional blocking &#8211; or it might be that the page has some non-standard HTML that somehow deeply confuses semanticproxy.com. I noticed it handles the summary page fine &#8211; but not the full profile. Sometimes things confuse us &#8211; we&#8217;ll take a look.</p>
<p>All technology aside &#8211; you do raise an interesting point. It will be very interesting to see how various walled gardens deal with the onset of tools like Semanticproxy that free the information inside them for wider consumption and sharing. </p>
<p>We&#8217;re treading on the safe side of the equation and following robots rules and all that. But &#8211; we&#8217;re the good guys. There are plenty of people ready to harvest this type of information using any tools at hand.</p>
<p>Maybe the gardens should open the gate just a bit?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
