Semantic legalities
I’ve just read a very interesting article on Read/WriteWeb about this new semantic web thing and some kwaai sites out there that “screen scrape” blogs/sites and formulate the content semantically. The question arose: “Is this legal?”
First, lets look at Dapper. What they do is they allow you to scrape content from sites by defining what is what through a user interface. You then “learn” Dapper what goes where and before you know it, it can convert your web pages to a web service. Hmmm… yes, I know, KICKS ASS!
Yahoo! Pipes is another one that does something similar, but with RSS feeds.
The problem is what if the content is copywrited? What if I go and scrape Tertia’s blog and present it in book format on my own site? Now that sounds unethical and dodgy to say the least! (BTW, congratulations Tertia, for being nominated as a Blooker Award finalist!). I can do that though. Does this mean that the semantic web poses a threat that Tertia will lose her rights to her content? Does this make Copyright statements at the bottom of sites “null-in-void”? It shouldn’t! Does this mean just because content is more accessible it is public domain?
I strongly believe that sites should not be “scraped”. If you want to share your data, you should have tools that will allow you to produce your data in the appropriate microformat. Even then, you should have some control over what is done with your content.
Lucky for us, the guys involved with all these microformat stuff already thought about these issues! They produced another microformat that will enable you to license your handywork called rel-license. In other words, even if you microformat your data, you are still able to control what is done with that data and what restrictions apply to that data.
I’d love to get Paul’s take on this, since he is the legal expert.
technorati tags:semanticweb, web3.0, microformats, legal, rel-license
Blogged with Flock
3 Comments to "Semantic legalities"
Spit it out!
Semantic Web Stii
Recent Posts
- Astalavista Wordpress!
- Lifestreaming and Twitter is making us lazy
- Days with my father
- Friday morning fail by a stripper
- Got Springleap!
- Afrigator vs Regator
- Don’t pirate music/movies! You might be forced to use Windows if you do…
- Pike > Python?
- Using Twhirl for FriendFeed
- Being anti-social SUCKS!
My Posse
- Jayx’s bloggy
- Gogo’s blog
- Go2 South Africa
- Stumble Upon
- Dave Duarte
- Wikipedia
- zlythern
- Max Kaizen
- Tresblue
- Mike Stopforth
- RafiQ
- Muti.co.za
- Employmint
- Danette’s Bloggy!
- Thinking Machine
- White African
- kiefpiet.co.za
- Skuff’s World
- Goozeberry
- Crossloop blog
- Crossloop
- Aquila Online
- Charl van Niekerk
- Derek Allard
- Code Igniter
- Carls
- Justin Hartman
- blik.co.za
- Stefano Sessa
- Uno de Waal
- Amplitude!
- bLaugh
- Tyler Reed
- Chris Rawlinson
- Stormhoek!
- 3am
- Mike Solomon
- Mobile Q and A
- Eric Edelstein
- Marc Forrest
- Imel Rautenbach
- Absolutewillie
- Vincent Maher
- Colin Daniels
- Groogle!
- Chilibean
- Paul Jacobson
- Ayelet
- Python Guru Neil
- Rails Guru Nic
- Beverley Merriman
- Miguel
- Nic Harrywhatshisname
- Chris iMod
- Geekrebel!
- Steven McD
- Belinda sweetheart!
- Henre Rossouw
- JPGeek
- Foxinni
- Adii
- Charl Norman
- Bandwidthblog
- Jason Bagley
- Simon Botes
- Auric Silverwing
- Mark Forrester
- Saul Kropman
- Fred Roed
- Sass Schultz
- Gregor Rohrig
- Catherine Lückhoff
- Toastmasters
- SAA
Filed in
- Afrigator (26)
- ajax (9)
- API (2)
- Apple stuff (10)
- Blogging (25)
- browsers (5)
- Business (28)
- Code Igniter (8)
- firefox (8)
- flock (14)
- Funnies (73)
- GeekDinner! (18)
- General and sometimes Rants (49)
- Go2SA (2)
- ideas 2.0 (14)
- javascript (12)
- Kick-ass Tools (30)
- Linux (5)
- Marketing (25)
- moo.ajax (4)
- mootools (6)
- Open Source (10)
- Programming (33)
- C# (1)
- PHP (13)
- Python (9)
- Ruby (on Rails) (9)
- RSS (5)
- Semantic Web (32)
- Social Web (57)
- Software Development (15)
- South Africa (33)
- Tagging (6)
- Techie stuff (22)
- Tshirts (3)
- Tutorials (42)
- Blogging (17)
- Flocking (6)
- muti.co.za (13)
- Web 2.0 (73)
- web development (20)
Past Stuff
- August 2008
- July 2008
- June 2008
- May 2008
- April 2008
- March 2008
- February 2008
- January 2008
- December 2007
- November 2007
- October 2007
- September 2007
- August 2007
- July 2007
- June 2007
- May 2007
- April 2007
- March 2007
- February 2007
- January 2007
- December 2006
- November 2006
- October 2006
- September 2006
















Hey Stii
No matter what the software does, I don’t believe anyone would lose their rights. If a site ripped content from Tertia’s site (touch wood it never does) and repackages it or something then it would probably be a breach of copyright. The same thing would apply to something like Creative Commons licenses and any other license where the software does something the author did not authorise.
I was thinking along the lines of a content “nofollow” tag just before I read about the rel-release microformat. Where there are engines that will collect content, there should be tools to manage that in accordance with content creators’ wishes.
rel-release or rel-license? If rel-release, send the link pleez!
Hi Stii
I don’t have a link for that thing, it was just something I was thinking about as a solution.