<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-5160001102716841899</id><updated>2011-07-08T12:52:23.740+03:00</updated><category term='Team'/><category term='Identity'/><category term='Social Web'/><category term='Business Model'/><category term='Ruby on Rails'/><category term='Feedback'/><category term='Web 2.0'/><category term='Recommender Systems'/><category term='Intellectual Property'/><title type='text'>Universal Recommendations</title><subtitle type='html'>A Discovery and Monetization Revolution on the Internet</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>18</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-5349629908931671070</id><published>2009-11-26T01:20:00.005+02:00</published><updated>2009-12-03T00:40:16.920+02:00</updated><title type='text'>What is the Affinax Project?</title><content type='html'>&lt;p&gt;
&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_bu_aLR84YIg/Sw3BH4xu-9I/AAAAAAAAAEU/dTBtYoifeKk/s1600/viewlogo_affinax.png"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 120px; height: 100px;" src="http://1.bp.blogspot.com/_bu_aLR84YIg/Sw3BH4xu-9I/AAAAAAAAAEU/dTBtYoifeKk/s400/viewlogo_affinax.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5408191068786064338" /&gt;&lt;/a&gt;
The Affinax Project is attempting to solve one of the most complex and interesting puzzles on the internet: how to predict a person's future favorites in any domain of life (social, career, products, services, media, etc.). 
&lt;/p&gt;&lt;p&gt;
We have developed a completely novel technology, a cross-pollination between Bioinformatics and the Semantic Web. If we are right, it will make such predictions cold, rapidly, without tracking a user's behavior, without data mining, without a lengthy registration process, and in any domain. It should do so without many of the drawbacks of existing recommender technologies.
&lt;/p&gt;&lt;p&gt;
Due to the nature of the technology, it is not possible to perform a simulation or testing using Netflix data. We must build a live proof of concept with real users and real objects to match them to. This will require a few Facebook applications and some efficient algorithms. 
&lt;/p&gt;&lt;p&gt;
We hope to evaluate the technology to academic standards in order to demonstrate that it works. Join us and help us solve this puzzle - any and all relevant skills are welcome.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-5349629908931671070?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/5349629908931671070/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=5349629908931671070' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/5349629908931671070'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/5349629908931671070'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2009/11/what-is-affinax-project.html' title='What is the Affinax Project?'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_bu_aLR84YIg/Sw3BH4xu-9I/AAAAAAAAAEU/dTBtYoifeKk/s72-c/viewlogo_affinax.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-4456585587353730100</id><published>2008-06-18T01:39:00.014+03:00</published><updated>2008-06-27T22:41:22.497+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Social Web'/><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><category scheme='http://www.blogger.com/atom/ns#' term='Business Model'/><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><title type='text'>Changing the World: The Business Model</title><content type='html'>&lt;p&gt;
What is the business model for the next internet revolution? In this article, I review web monetization issues, especially that of web 2.0. I propose a monetization solution where any site with users, commercial items, and even visitors, can significantly increase its revenue and reduce marketing and advertising expenses. Our affinity targeting system monetizes itself in the process.
&lt;/p&gt;&lt;p&gt;
Traditionally, business models for web applications, communities, blogs, etc. are an afterthought. Apps and networking sites dream of reaching critical mass and then selling to Google, Microsoft, Yahoo, etc. Thus the &lt;a href="http://www.mappingtheweb.com/2007/05/28/everyones-web-20-revenue-model/"&gt;revenue model is actually an exit strategy&lt;/a&gt;. This dream has been fueled by the observation that the &lt;a href="http://esnips.blogs.com/about_esnips_/2007/04/web_20_ecosyste.html"&gt;purchase price for such sites is related to reach&lt;/a&gt; ("eyeballs", size of the audience). This is reminiscent of &lt;a href="http://en.wikipedia.org/wiki/Metcalfe%27s%20law"&gt;Metcalfe's Law&lt;/a&gt;. A more thorough analysis of the &lt;a href="http://www.techcrunch.com/2008/06/23/modeling-the-real-market-value-of-social-networks/"&gt;market value of social networks&lt;/a&gt; was recently posted in TechCrunch, by Michael Arrington.
&lt;/p&gt;&lt;p&gt;
&lt;a href="http://www.pascalrossini.com/wordpress_eng/?p=370"&gt;A very few&lt;/a&gt; fortunate web startup founders do not need to consider a business model beyond their big exit, even in the current economic climate. The new owners, however, will be forced to monetize their sexy new purchase. For the vast majority of web startup founders the business model will be important and is often considered and tested from the very start. 
&lt;/p&gt;&lt;p&gt;
The default monetization method is advertising, &lt;a href="http://tomokeefe.com/2008/06/16/advertising-preferred-business-model-for-58-of-startups/"&gt;preferred by 58% of web startups&lt;/a&gt; (this figure includes affiliate marketing) according to &lt;a href="http://www.bizak.com/"&gt;Bizak&lt;/a&gt;. Of the strictly advertising sites, Google's AdSense is adopted by 54%. I imagine this number is higher for web 2.0 social sites. Nonetheless, &lt;a href="http://tomokeefe.com/2008/06/17/product-sales-top-earnings-per-visitor-adsense-the-lowest-earnings/"&gt;AdSense earnings per visitor (EPV) are the lowest&lt;/a&gt; among the various monetization methods. As an example, Tom OKeefe writes about &lt;a href="http://tomokeefe.com/2008/06/11/mahalos-poor-google-adsense-earnings/"&gt;Mahalo's poor Google AdSense earnings&lt;/a&gt;, and Allen Stern predicts that &lt;a href="http://www.centernetworks.com/mahalo-adds-affiliate-links-revenue"&gt;affiliate revenue could surpass Google AdSense revenue&lt;/a&gt; for &lt;a href="http://www.mahalo.com/"&gt;Mahalo&lt;/a&gt; in the long-term. Decrying AdSense as &lt;a href="http://tomokeefe.com/2007/12/14/adsense-has-become-worthless/"&gt;"worthless"&lt;/a&gt;, Tom OKeefe asks "What's Next?".
&lt;/p&gt;&lt;p&gt;
Many of the hugely popular sites are struggling to better monetize. YouTube, for example, &lt;a href="http://tomokeefe.com/2008/06/04/monetizing-youtube/"&gt;is struggling to justify its $1.65 billion purchase price&lt;/a&gt;. Also, Facebook faces a rough road ahead, with "only" &lt;a href="http://online.wsj.com/article/SB120465155439210627.html"&gt;$150 million in ad sales in 2007&lt;/a&gt; and projections of &lt;a href="http://valleywag.com/390466/facebook-ad-revenue-projections-revised-down-to-265-million"&gt;$265 million in 2008&lt;/a&gt;, and &lt;a href="http://www.readwriteweb.com/archives/the_ultimate_twitter_revenue_model.php"&gt;Aidan Henry proposes solutions&lt;/a&gt; to the "perennial debate surrounding &lt;a href="http://www.twitter.com/"&gt;Twitter&lt;/a&gt;'s revenue model", and the CEO of Mahalo, Jason Calacanis, even chimes in with his own &lt;a href="http://www.calacanis.com/2008/01/02/the-three-business-models-that-make-twitter-a-billion-dollar-bus/"&gt;Twitter business model suggestions&lt;/a&gt;.
&lt;/p&gt;&lt;p&gt;
This struggle may no longer be necessary. Our novel Affinity Targeting technology allows a user to be targeted to entities they are most likely to appreciate, in any domain of life. On-line communities and sites with users can increase their earnings by adding both their site and users into our system. Those users are then targeted to entities of interest (products, services, media, jobs, sites, other users, etc.). Targeting, leading to a commercial transaction, will result in affiliate revenue, part of which is shared with the originating site of the purchasing user. According to Bizak.com, affiliate earnings per visitor are 16 times greater than AdSense earnings. An affiliate model with highly specific user targeting should increase such earnings significantly.
&lt;/p&gt;&lt;p&gt;
The benefits don't end at monetizing eyeballs; sites and sellers can precisely target users to themselves and their items, thereby increasing sales and reducing costs. Communities, groups and fan clubs all seek to attract enthusiastic members. In our system, users will be targeted to the communities they are most likely to appreciate, leading to increased membership and customers. Also, sellers and providers benefit by precise targeting of users to products and services they are most likely to purchase. This will increase sales, and reduce dependence on marketing, advertising, SEO, etc. All sellers and providers are required to do is profile their products, services, jobs, etc. for the system (in the unique way we need the info) and agree to our affiliate model. There are no other costs to them.
&lt;/p&gt;&lt;p&gt;
&lt;a href="#" onClick="displayimage('http://www.inspiralogy.com/model1f_ttl.png','The figure above depicts a solution to several critical needs: internet sites and sellers must increase their revenue, reduce expenses, and attract the most ideal new users or members.  In our solution, sites and sellers add their existing users (no private information is required) and/or items into the system. Users are then targeted (via the targeting engine) to three different kinds of entities (circles): other users (if they are so inclined), groups (sellers, sites, communities, etc.), and items (products, services, media, jobs, etc.). When a user is targeted to a commercial item and makes a purchase, the seller provides an affiliate fee to the system, part of which is shared with the group that brought the user to the system. Also, if a group added an affiliate item into the system that they are not directly selling (for example, an Amazon.com book), part of any affiliate fee earned from that item is shared with that group. Follow the green arrows to see the flow of money. Note that sites and sellers may contribute users and/or items, and users and/or items may be entered independently of a site or seller.')"&gt;&lt;img src="http://www.inspiralogy.com/model1f_ttlsm.png" style="width: 400px; height: 235px; display: block; margin-left: auto; margin-right: auto; border: none;"/&gt;&lt;/a&gt;
&lt;/p&gt;&lt;p&gt;
&lt;blockquote&gt;&lt;small&gt;The figure above (click to enlarge) depicts a solution to several critical needs: internet sites and sellers must increase their revenue, reduce expenses, and attract the most ideal new users or members.  In our solution, sites and sellers add their existing users (no private information is required) and/or items into the system. Users are then targeted (via the targeting engine) to three different kinds of entities (circles): other users (if they are so inclined), groups (sellers, sites, communities, etc.), and items (products, services, media, jobs, etc.). When a user is targeted to a commercial item and makes a purchase, the seller provides an affiliate fee to the system, part of which is shared with the group that brought the user to the system. Also, if a group added an affiliate item into the system that they are not directly selling (for example, an Amazon.com book), part of any affiliate fee earned from that item is shared with that group. Follow the green arrows to see the flow of money. Note that sites and sellers may contribute users and/or items, and users and/or items may be entered independently of a site or seller.&lt;/small&gt;&lt;/blockquote&gt;
&lt;/p&gt;&lt;p&gt;
Our plan is to grow the system organically by bootstrapping it on FaceBook and OpenSocial. We will do this in a way where critical mass never becomes a significant issue. At a certain point the affinity matrix of objects will be large enough to attract sites, communities, sellers and providers. At that point, we will offer our own API, customizable web interface, or client software, such that a site and its users can interact with the system the way the site sees fit. In the beginning we will use existing affiliate and payment processors, but eventually this will likely be done with our own systems. Our affinity engine and business model represents the ideal win-win solution for sites, sellers and users: better targeting, discovery, user satisfaction, monetization, reduced expenditures, etc. Ultimately, we see this targeting system attracting a significant fraction of on-line sites, communities and commercial entities.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-4456585587353730100?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/4456585587353730100/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=4456585587353730100' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/4456585587353730100'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/4456585587353730100'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/06/changing-world-business-model.html' title='Changing the World: The Business Model'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-4422516427235554589</id><published>2008-04-25T21:31:00.004+03:00</published><updated>2008-05-03T05:27:16.113+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Social Web'/><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><category scheme='http://www.blogger.com/atom/ns#' term='Identity'/><title type='text'>The Affinity Graph</title><content type='html'>&lt;p&gt;
Is the Affinity Graph the anticipated Internet Singularity?
&lt;/p&gt;&lt;p&gt;
Tim Berners-Lee, the father of the World Wide Web, has been talking about this concept of the future &lt;a href="http://dig.csail.mit.edu/breadcrumbs/node/215"&gt;"Internet of things."&lt;/a&gt; By "things" he means the people and other objects on the internet, and he argues that those things and the connections between them are the key aspects of the web. This, he argues, is the primary evolution of the walled gardens of "Web 2.0" into something far more important. He calls this evolution the Giant Global Graph, while others call it Web 3.0 or the &lt;a href="http://www.computerworld.com.au/index.php/id;1524791164"&gt;Semantic Web&lt;/a&gt;.
&lt;/p&gt;&lt;p&gt;
The use of the term "Graph" has been met with a bit of consternation by those who argue that we already have the term "network" to describe these connections. &lt;a href="http://scobleizer.com/2007/09/22/dave-winer-says-i-sound-like-a-monkey/"&gt;Robert Scoble describes&lt;/a&gt; the difference in reference to social relationships where your social network is who you know, while your Social Graph describes who you are associated with based on common objects of interest (passions, concerns, politics, religion, work, school, etc.). He says: "&lt;i&gt;The Social Graph is NOT my social network. My Social Network is my friends list. But the Social Graph shows a LOT more than that&lt;/i&gt;." A Graph then is not simply the simple connections, but the types and context of connections and the strengths of those connections. 
&lt;/p&gt;&lt;p&gt;
While the Graph will ultimately know what is currently song #3 on your iPod, some metadata about the song, as well as all the other people who have the same song as #3 on their iPods, one must wonder "what's the point"? How does this help me discover that I should be a dolphin trainer, or to find new people that share my way of thinking? Once the monstrous amount of data on the Graph is accessible to robots, many will be applying data mining and filtering algorithms, and massive amounts of CPU, to try to generate usable information about the people and other objects on the web. 
&lt;/p&gt;&lt;p&gt;
Tim Berners-Lee envisioned the ability to create "intelligent agents", sort of like advanced email filters, to perform many of the more tedious tasks, easier and faster. I talked about a similar kind of agent in the post "&lt;a href="http://unirec.blogspot.com/2008/03/your-identity-proxy.html"&gt;Your Identity Proxy&lt;/a&gt;". Real progress will be achieved when future technology will be able to offer the users a much more personalized and enjoyable experience, and of course better targeting of those users with commercial objects. In practical terms, this will require the storage of as much data as possible about users and their objects so that futuristic computer programs will be able to make sense of the identities of those users and the meanings of those objects, and also to make predictions about the basic affinities between the objects and users. Some even predict that given enough information, "&lt;a href="http://youtube.com/watch?v=NLlGopyXT_g"&gt;the machine&lt;/a&gt;" will begin to transcend the metadata and attain a kind of sentience (or sapience).
&lt;/p&gt;&lt;p&gt;
This is similar to the ideas of Gary Flake who hypothesized that continued advancements in networked information and other technologies will create a "virtuous cycle" leading to what he terms the &lt;a href="http://portal.acm.org/citation.cfm?id=1183617"&gt;"Internet Singularity"&lt;/a&gt;. As with the Global Graph, we are far from advanced enough technologically to see these concepts realized in the near future.
&lt;/p&gt;&lt;p&gt;
Let me propose that both the Internet Singularity and the Global Graph are overlapping concepts that are largely achievable today through the Affinity Graph, a major element of this project. As of late 2007, we have had the technology to begin to store the affinity relationships and strengths between users and all other objects on the internet and mobile devices. This is a much simpler abstraction, where we store the most important kind of meaning (affinity) for the typical user. In other words, the most important benefit of the Graph or Singularity, e.g. searching, personalization, and discovery, can be generated, stored and queried in a much more feasible way than is predicted for the Graph, Semantic Web, or Singularity.
&lt;/p&gt;&lt;p&gt;
With the Affinity Graph, the similarity in meaning of objects, including people, will be known. Universal categorization, classification, hierarchies and affinity matching will all be made fairly trivial. Users will have immediate access to their future favorites in every domain of life; likewise objects (and those that care about them) will know which users are likely to most appreciate those objects (marketers? advertisers? evangelists?). This is the point at which the Utopian dreams of internet visionaries is realized. The Affinity Graph does not make irrelevant other forms of abstractions or metadata upon which computer scientists are free to set loose their strong AI. There are many other kinds of meaning, and those will be explored by computers in time.
&lt;/p&gt;&lt;p&gt;
The Singularity is here, as is the Global Graph, in ways that are most important to users.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-4422516427235554589?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/4422516427235554589/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=4422516427235554589' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/4422516427235554589'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/4422516427235554589'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/04/affinity-graph.html' title='The Affinity Graph'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-179135989865354797</id><published>2008-03-19T12:03:00.002+02:00</published><updated>2008-05-03T05:26:43.146+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><category scheme='http://www.blogger.com/atom/ns#' term='Identity'/><title type='text'>Big World, Short Life</title><content type='html'>&lt;p&gt;
The world is big and life is short. We've solved this problem.
&lt;/p&gt;&lt;p&gt;
To restate the problem: in our short lives, we are unlikely to ever find the people and things that we would most enjoy and appreciate. This is unfortunate.
&lt;/p&gt;&lt;p&gt;
Have you been feeling the pain? Not finding your soulmate? No best buddies? Have the suspicion that the most incredible music is out there, somewhere? Feel like you never found your ideal vocation? Actually there is little chance you could have found the optimal things in life. As I mentioned in a &lt;a href="http://unirec.blogspot.com/2008/03/your-identity-proxy.html"&gt;previous post&lt;/a&gt;, it would take us thousands of years to meet every other human, listen to every song, read every book, evaluate every vocation, etc. 
&lt;/p&gt;&lt;p&gt;
Many of us have grown to accept our mortality, and the &lt;a href="http://www.walrusmagazine.com/articles/2006.04-essay-on-time/"&gt;tyranny of time&lt;/a&gt;. We've had to accept the limitations in the time we have to explore options and find those optimal things. This acceptance has silenced our normally inquisitive and innovative inclinations to find solutions to problems; it seems an insurmountable problem, and, frankly, dwelling on mortality is not entirely pleasant. Those who haven't accepted mortality will deny the existence of the problem and thus the need for a solution.
&lt;/p&gt;&lt;p&gt;
I didn't set out to solve the 'short life' problem. Actually, that's not entirely true - I'm a huge health and nutrition nut: I plan to be &lt;a href="http://healthyto120.com/"&gt;healthy to at least age 120&lt;/a&gt;. But in this post, and in this project, I'm not talking about extending human lifespan. It is the 'big world' problem that we are addressing, and the problem may not be so big after all.  The innovation came first, and then it occurred to me that the thousands of years it would take to find your favorites could be compressed significantly.
&lt;/p&gt;&lt;p&gt;
I'll use the analogy from a &lt;a href="http://unirec.blogspot.com/2008/03/your-identity-proxy.html"&gt;previous post&lt;/a&gt;. Many of us receive hundreds of emails every day. Without an email filter, it would take us hours to sort through and pick out the emails we prefer. We don't have enough time to perform this task, nor would we want to. The email filter, if it works well, presents to you only those emails that you are most likely to prefer. Reviewing your emails becomes a much quicker and simpler task. &lt;a href="http://pespmc1.vub.ac.be/CHINNEG.html"&gt;Information overload&lt;/a&gt; is reduced.
&lt;/p&gt;&lt;p&gt;
In a similar way, &lt;a href="http://unirec.blogspot.com/2008/02/forget-everything-you-know.html"&gt;our discovery engine&lt;/a&gt; sorts through thousands of years of people, media, opportunities, ideas, causes, products, etc, and presents to you only those things you are most likely to prefer. 
&lt;/p&gt;&lt;p&gt;
So you need no longer fear your own mortality. :-)
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-179135989865354797?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/179135989865354797/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=179135989865354797' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/179135989865354797'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/179135989865354797'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/03/big-world-short-life.html' title='Big World, Short Life'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-3891089938559432073</id><published>2008-03-12T18:35:00.001+02:00</published><updated>2008-03-12T18:57:08.782+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><category scheme='http://www.blogger.com/atom/ns#' term='Identity'/><title type='text'>Your Identity Proxy</title><content type='html'>&lt;p&gt;There seems to be a bit of confusion about the distinction between the terms "identity" and "identification" in popular discussion. The terms are often used interchangeably, and are used differently in different contexts. I thought I would write a bit about these and other related concepts, including a new concept that we introduce.
&lt;/p&gt;&lt;p&gt;
The terms are infused with the complexity of multiple disciplines (philosophy, psychology, sociology, neurology, religion, etc.), each with their own usage and take on the meanings. To add to the complexity, identity is now an important concept with different meanings for government, commerce, and the internet.
&lt;/p&gt;&lt;p&gt;
Who are you? Are you different from your neighbor? From your identical twin? Is there something about you that distinguishes you from everybody else? The subjective versions of this are the "self-image" (a person's own model of his identity) and the identity perception of someone by others. Is it "the self" or the the ego of psychology? Is this the "soul" of certain faiths? Is it the mind? The brain? What about the body? Is identity a product of nature, nurture, or both together? Many questions.
&lt;/p&gt;&lt;p&gt;
Most of us cannot be relied upon to accurately describe our identities, though sometimes best friends can get pretty close (we get closer, see below). This is the reason metadata contributed by users about themselves or their works is not considered accurate. A personal tag cloud is just an ego trip. It is highly subjective. Web page meta keywords are no longer relied upon by search engines or advertisers because they are so inaccurate. This is the source of the delay in the promised "Semantic Web" revolution.
&lt;/p&gt;&lt;p&gt;
I like to think of Identity as that mental thingy that distinguishes you from every other person. It is the objective, non-corporeal entity that is the sum of all the biology and environmental influences that constitutes what it is to be you, at this moment in your life. Despite the similarities, you have a different identity from your identical twin because your minds and bodies have had different experiences. You also have a different identity than yourself of one year ago because you've had new experiences... and of course your brain has suffered some oxidative degeneration (&lt;a href="http://www.healthyto120.com/health/tips/bad-fats.html"&gt;"vegetable oil", anybody?&lt;/a&gt;). 
&lt;/p&gt;&lt;p&gt;
But in the real world, and for the purposes of government, commerce, most things that make the world work, it is the corpus that counts. You are you because you are contained within the body of you. Identity equals body. The body that is recognized as you by facial recognition, and authenticated by fingerprinting, retinal or corneal scanning, etc. Science fiction has enjoyed this mind-body identity confusion with numerous examples in movies and television (&lt;i&gt;"This body is not mine, and I have to be clever to convince my friends of my true identity"&lt;/i&gt;).
&lt;/p&gt;&lt;p&gt;
Now, &lt;a href="http://www.downes.ca/cgi-bin/page.cgi?post=12"&gt;identification is the assertion&lt;/a&gt; that you are actually you (&lt;i&gt;"I may look like a fly, but it's really me!"&lt;/i&gt;). Having the face of Nancy is an assertion that you are Nancy, i.e. your friends and family will identify you as being the identity they call "Nancy". Identical twins and masks can confuse the identification in opposite ways. 
&lt;/p&gt;&lt;p&gt;
One can authenticate their identification with some available mechanism that provides some level of authentication. Visual similarity to your picture ID card (is ID "identity" or "identification"?) is a common form, voice recognition on the phone is another common one; &lt;i&gt;"Hello, it's Nancy"&lt;/i&gt; works only if you sound like Nancy. We can authenticate the body fairly well, but the mind is more difficult (&lt;i&gt;"Nancy doesn't seem like herself today. Maybe she's been taken over by an alien."&lt;/i&gt;). 
&lt;/p&gt;&lt;p&gt;
On the internet, there are various uses for the terms identity, identification, authentication and anonymity. Your Facebook profile is a reflection of your identity, or an exhibition of your identity, most probably with identifying elements like your name and photos. In some cases you may have multiple online "identities" representing different facets of your actual identity. Those facets are sometimes identified by usernames and avatars indicative of the identity or sub-identity or idealized identity they represent. 
&lt;/p&gt;&lt;p&gt;
For a new user, identity may initially not be important: an anonymous user is self contained, requiring no identification or authentication. But as other users get to know that user, they will expect that it is consistently backed by the same identity. As it develops a reputation, the identity behind that user identification will want to maintain exclusive ownership of that identification, via some kind of authentication that ensures such exclusivity.
&lt;/p&gt;&lt;p&gt;
There are many systems for authentication, each attempting to ensure that the user instance is an active reflection of the same identity. Online banking is an example. There are two levels of authentication here. First, the owner of the username is the identity called "Nancy" with these identifying personal details. Second, that the username instance (i.e. the just logged in identity) is also the "Nancy" identity (access management). The first is corpus related: Nancy walks into her bank and gets her login details based on corpus identity. The second is mind identity: does Nancy remember her username and password, or where she scribbled them?
&lt;/p&gt;&lt;p&gt;
Our project introduces another concept to the scene: the identity proxy. In our case, it is an objective proxy of your identity that makes choices on your behalf, likely the same choices you would make, even when you are not logged in. In a sense, it is like an email filter that follows your instructions and helps you deal with information overload by automating that small bit of your identity that prefers certain emails over others. Ours is much more powerful in reducing information overload because your identity proxy automates the filtration of all available information and options, in every domain of life. Your identity proxy is an accurate and objective reflection of your identity and it understands and automates your decision making processes. There is no greater weapon against the tyranny of choice and information overload. 
&lt;/p&gt;&lt;p&gt;
Without an email filter, it would take us hours per day to delete the spam and read the relevant emails. We would quickly lose patience and only find a fraction of real emails. Likewise, it would take us thousands of years to meet every other human, listen to every song, read every book, evaluate every vocation, etc, in order to find the ones we like. It's a big world, and, sadly, life is short. The identity proxy does not live our lives for us - it makes our lives richer by allowing us to find those things that we wouldn't have found unless we lived for thousands of years.
&lt;/p&gt;&lt;p&gt;
Also, at it's core, the identity proxy requires no corpus identification, i.e. no personal or demographic details are necessary in the registration process. Nobody can use the registration information to track you down (track down the corpus). Privacy is intact.
&lt;/p&gt;&lt;p&gt;
Your identity proxy is singular. Having more than one identity proxy is a waste of time because every time you register accurately the system should see you as being identical (or close) to your previous proxy. Registering inaccurately serves no purpose because the proxy will make choices that do not reflect your identity, and the choices will not be as fulfilling for you.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-3891089938559432073?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/3891089938559432073/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=3891089938559432073' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/3891089938559432073'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/3891089938559432073'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/03/your-identity-proxy.html' title='Your Identity Proxy'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-1787155701139994809</id><published>2008-02-18T01:58:00.002+02:00</published><updated>2008-03-13T15:50:07.916+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Team'/><category scheme='http://www.blogger.com/atom/ns#' term='Feedback'/><title type='text'>Counsel of the Wise</title><content type='html'>&lt;p&gt;
One of the consistent recommendations to innovators is to seek the counsel of some very smart people who understand the field of the innovation. One must be open to objective critical feedback in order to determine if the innovation should be developed. Initially, I started with family and friends, but recently I sought the feedback and input of academics and businesspeople with whom I was not formerly acquainted. So far the response has been very positive. In fact, four of those smart people have volunteered to become members of the project's board of advisers. 
&lt;/p&gt;&lt;p&gt;
In addition to all of the great feedback from those meetings and follow-up discussions, the board has another important function - as references. Anyone who would so easily dismiss my claims of a novel disruptive internet paradigm, or the "preference engine" technology, must first reconcile the existence of this volunteer advisory board. My hope is that the added legitimacy of the board will further my contacts in the field, and especially among expert developers and angel investors. Here is the current board that can be contacted for references:
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;b&gt;Avi Noy&lt;/b&gt;, Ph.D. &lt;small&gt;[&lt;a href="http://research.haifa.ac.il/~avinoy/"&gt;Research Page&lt;/a&gt;, &lt;a href="http://www.linkedin.com/pub/0/ba/740"&gt;LinkedIn&lt;/a&gt;]&lt;/small&gt;&lt;br /&gt;
Adjunct Lecturer, Graduate School of Business, University of Haifa&lt;br /&gt;
PhD, Information Systems, The University of Haifa&lt;br /&gt;
MBA, Technion Institute, Haifa&lt;br /&gt;
LLB, The University of Haifa&lt;br /&gt;
BSc, Computer Engineering, Cum Laude, Technion Institute&lt;br /&gt;
&lt;/li&gt;&lt;li&gt;
&lt;b&gt;Doron Erblich&lt;/b&gt; &lt;small&gt;[&lt;a href="http://www.linkedin.com/in/doron"&gt;LinkedIn&lt;/a&gt;]&lt;/small&gt;&lt;br /&gt;
CEO, &lt;a href="http://www.noyasoftware.com/"&gt;Noya Software&lt;/a&gt;, Haifa&lt;br /&gt;
Center of Computing and Information Systems (&lt;a href="http://en.wikipedia.org/wiki/MAMRAM"&gt;Mamram&lt;/a&gt;), Israel&lt;br /&gt;
BA, Computer Science, Open University, Israel&lt;br /&gt;
&lt;/li&gt;&lt;li&gt;
&lt;b&gt;Yosi Dagan&lt;/b&gt;, MBA &lt;small&gt;[&lt;a href="http://www.linkedin.com/in/yosidagan"&gt;LinkedIn&lt;/a&gt;]&lt;/small&gt;&lt;br /&gt;
MBA, Heriot-Watt University, Edinburgh, Scotland&lt;br /&gt;
BSc, Electrical Engineering, Ben Gurion University&lt;br /&gt;
&lt;/li&gt;&lt;li&gt;
&lt;b&gt;Yuval Dan-Gur&lt;/b&gt;, Ph.D. &lt;small&gt;[&lt;a href="http://www.rada.com/rada/p_about/management_popup.asp?iid=773"&gt;Work Page&lt;/a&gt;, &lt;a href="http://www.linkedin.com/pub/0/8a8/bb9"&gt;LinkedIn&lt;/a&gt;]&lt;/small&gt;&lt;br /&gt;
VP Engineering, &lt;a href="http://www.rada.com/"&gt;RADA Electronic Industries&lt;/a&gt;&lt;br /&gt;
Winner of the Israel National Security Prize&lt;br /&gt;
PhD, Information Systems, The University of Haifa&lt;br /&gt;
MsC, Industrial and Management Engineering, Technion Institute&lt;br /&gt;
BSc, Electrical Engineering, Technion Institute&lt;br /&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-1787155701139994809?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/1787155701139994809/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=1787155701139994809' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/1787155701139994809'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/1787155701139994809'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/02/counsel-of-wise.html' title='Counsel of the Wise'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-531111191525041071</id><published>2008-02-12T21:33:00.001+02:00</published><updated>2008-02-13T19:10:20.024+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><category scheme='http://www.blogger.com/atom/ns#' term='Identity'/><title type='text'>The Serendipity Revolution</title><content type='html'>&lt;p&gt;Traditionally, the success of recommender systems is evaluated by predicting accuracy of recommendations off-line using existing datasets. For example, see the &lt;a href="http://netflixprize.com/"&gt;million dollar Netflix prize&lt;/a&gt; for a meager 10% improvement of their collaborative filtering algorithm. Netflix provided access to 100 million of its customers’ movie ratings to train new algorithms and test them. In other words, the algorithm is &lt;a href="http://www-users.cs.umn.edu/~mcnee/mcnee-chi06-acc.pdf"&gt;judged more accurate&lt;/a&gt; the more it recommends movies the user has already seen. Recommendations based upon this traditional accuracy metric are not the most useful to users.
&lt;/p&gt;&lt;p&gt;
Researchers know that success of recommendations is better measured by recording user satisfaction - the positive emotional response at having discovered something new that one likes. But that is more difficult to measure - as it requires a community of users and a useful mechanism to compel (or at least strongly encourage) the reporting of satisfaction, it's strength and perhaps type. Satisfaction of recommendations seems to follow in ascending order of the following recommendation types:
&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;i&gt;Low quality, low accuracy recommendation.&lt;/i&gt; Users obviously don't appreciate having their time wasted in evaluating something that the system should have known the user would not be likely to appreciate. These are "trust-busters"; the user will lose trust in the system.&lt;/li&gt;
&lt;li&gt;&lt;i&gt;An accurate, but known recommendation.&lt;/i&gt; An item the user is already aware of. The user likes the item, but it is not novel. Trust is maintained because at least the system recommended something that the user already likes. Too many of these recommendations imply an excess number of false-negatives or "missed opportunities".&lt;/li&gt;
&lt;li&gt;&lt;i&gt;A novel, but obvious recommendation.&lt;/i&gt; A novel recommendation is something new and appreciated, but something the user would have discovered on his/her own. For example, a new song from a favorite musician, or a new movie from a favorite director. The user will have a positive, though muted, reaction. Many users will suspect that there were "missed opportunities", given the huge number of unfamiliar items in any domain.&lt;/li&gt;
&lt;li&gt;&lt;i&gt;A serendipitous recommendation.&lt;/i&gt; A serendipitous recommendation is something new, non-obvious and appreciated that the user would likely not have discovered on his/her own. For example, an unfamiliar song from an unfamiliar musician, or a unfamiliar movie from an unfamiliar director. The user will likely have a very positive reaction, though it has been argued that, in some users, such recommendations may be seen as obscure and not immediately appreciated.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;
The serendipitous recommendation is obviously the ideal for most users, the problem is that collaborative filters tend to focus on what is commonly known and popular - items that the user has heard about or items that the user would have experienced eventually because of their "&lt;a href="http://glinden.blogspot.com/2007/10/recommender-systems-and-diversity.html"&gt;blockbuster nature&lt;/a&gt;". Many of the most interesting items for the user may be buried in the "long tail", so some collaborative filtering systems have attempted to tweak their algorithms to try to maximize this type of recommendation by reducing the more popular recommendations. Even so, recommendation diversity tends to be reduced in collaborative filtering systems, leading to a large number of false-negatives or "missed opportunities".
&lt;/p&gt;&lt;p&gt;
Recommendations based on a user's core identity will not focus on the popular, or items from artists or directors the user likes, or that the user's friends like. Instead, the user will be recommended items from the entire item landscape that by definition the user is most likely to appreciate based on that core identity (their "preference engine"). Thus the recommendation diversity (coverage of item space) within a domain (such as music) is as large as the diversity of items within that domain, leading to a large number of serendipitous recommendations - possibly the vast majority. Keep in mind that the number of domains in our community is also unlimited, and the same core identity can be used to recommend anything and everything in life.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-531111191525041071?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/531111191525041071/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=531111191525041071' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/531111191525041071'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/531111191525041071'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/02/serendipity-revolution.html' title='The Serendipity Revolution'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-3998066952409894045</id><published>2008-02-04T23:06:00.009+02:00</published><updated>2008-03-31T08:11:06.888+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Ruby on Rails'/><category scheme='http://www.blogger.com/atom/ns#' term='Feedback'/><category scheme='http://www.blogger.com/atom/ns#' term='Social Web'/><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><category scheme='http://www.blogger.com/atom/ns#' term='Intellectual Property'/><category scheme='http://www.blogger.com/atom/ns#' term='Identity'/><title type='text'>Forget Everything You Know</title><content type='html'>&lt;p&gt;
I'm getting a relatively large amount of email feedback, now that I'm putting a little effort into PR - and much of the feedback reflects existing biases. Recommender and social technologies have largely plateaued, with new innovations being relatively minor and more fine-tuning in nature. The possibility that something is being developed that is completely new, disruptive, and significantly better than existing services is not something many in the field are willing to consider. Admittedly, the secrecy of this project has not helped to convince people of that possibility, but this emotional rejection often takes the form of certain mistaken assumptions.   
&lt;/p&gt;&lt;p&gt;
So I'll make another attempt to preempt the preconceptions and misconceptions with a discussion of some of the mistaken assumptions, both technical and paradigm-related, from the feedback. We can call this the Frequently Made Assumptions (FMA) list:
&lt;/p&gt;
&lt;dl&gt;
&lt;dt&gt;There are so many social networks, nobody wants another one.&lt;/dt&gt;
&lt;dd&gt;Our system is not a social net, but one effect is that it creates a completely new community paradigm. Immediately after registration, we can quantify a  person’s relationship to every other person, item, idea, endeavor, etc. in the system. Those relationships emerge and are presented to the new user as options and suggestions that then leads to further exploration, enjoyment, socializing, purchases and perhaps community involvement. So you don’t NEED to have (or get) your friends or contacts on the system to derive value. &lt;/dd&gt;
&lt;dt&gt;There are so many recommender sites, and the results are lukewarm.&lt;/dt&gt;
&lt;dd&gt;Instead of looking at your past choices and making certain assumptions, we are looking at the CAUSE of your preferences. We call this your "preference engine" which yields results of far higher accuracy and quality than existing recommender systems. Much of the recommendations will be serendipitous and cover the "long tail" as well. We don't care what your friends prefer, or what your past choices were. We know what you will like - from the inside.&lt;/dd&gt;
&lt;dt&gt;"Web 2.0" stuff like this has no business model beyond advertising. I can't see any concrete business or how you want to generate revenue from your invention.&lt;/dt&gt;
&lt;dd&gt;Imagine that every user has immediate access to their future favorites of: people, music, movies, books, recreation, groups, products, services, ads, travel destinations, vocations, jobs, teams, politics, religion, ideas, websites, articles, news items, games, etc. Highly successful affiliations of all types, direct sales and downloads, and highly targeted advertising would be the obvious business model components. Note that even advertisements are objects in our system, and can be targeted with person-level granularity.&lt;/dd&gt;
&lt;dt&gt;No idea is new, no matter how good it is, at least 50 other people have thought of it.&lt;/dt&gt;
&lt;dd&gt;Show me.&lt;/dd&gt;
&lt;dt&gt;The future of the internet is the Semantic Web, haven't you heard? &lt;/dt&gt;
&lt;dd&gt;Yeah, I've been hearing that for a long long time. The semantic web is too primitive and highly problematic. Certain aspects will be adopted by some, but there will be no utopia there.&lt;/dd&gt;
&lt;dt&gt;Are you trying to promote just an idea?&lt;/dt&gt;
&lt;dd&gt;We have the paradigm, a test site, and three patent applications. The test site is the first part of the application, which I am developing in Ruby on Rails. It is operating currently and in the process of being tested by select individuals. I am trying to form a team to develop the second part, which will require some sophisticated social software development and efficient algorithms programming. &lt;/dd&gt;
&lt;dt&gt;LinkedIn (or some other site) is doing the exact same thing!&lt;/dt&gt;
&lt;dd&gt;No, LinkedIn is not doing anything even remotely similar. If it’s not clear what our system does, read on.&lt;/dd&gt;
&lt;dt&gt;Why would visitors want to come to your site or spread the word?&lt;/dt&gt;
&lt;dd&gt;Again, imagine that every user has immediate access to their future favorites of: people, music, movies, books, recreation, groups, products, services, ads, travel destinations, vocations, jobs, teams, politics, religion, ideas, websites, articles, news items, games, etc. The system itself acts as a good friend that knows you best, and understands all. A personal and trusted relationship will develop with each user. Users need not reveal any personal information, so privacy is intact. Add these to the benefits of existing recommender systems and social and professional networks, and I think you will agree that popularity will not be an issue.&lt;/dd&gt;
&lt;dt&gt;Nobody likes unlimited connectivity, you are creating a flood of people/info that will overwhelm users.&lt;/dt&gt;
&lt;dd&gt;This is about reducing information overload and the tyranny of choice – it’s not about creating unlimited connectivity. It’s about giving the user access to only the things he/she is most likely to appreciate: people and every other thing in life. This is the holy grail of internet futurists and marketers, etc. There will be no unwelcome flood – only welcome and surprising options.&lt;/dd&gt;
&lt;dt&gt;People only want to interact with their primary network. They don't want to be bothered by strangers.&lt;/dt&gt;
&lt;dd&gt;First, we are not trying to be another Social Network. Social Networks separate people as nodes or degrees of separation. Separation is not good if it keeps away ideal relationships. But removing those separations, while preventing a flood, requires new kinds of filtration. A user’s potential ultimate friends and romantic partners, ideal business partners, etc. most likely lie far outside of one’s immediate network. Our novel discovery techniques provide the filter, thus there is no flood – only the most ideal are presented as options for the user.&lt;/dd&gt;
&lt;dt&gt;Are you just creating another matchmaking site?&lt;/dt&gt;
&lt;dd&gt;Objects in our system also include people: we expect matchmaking to be a significant part of the interest in our system. But remember that people can also be matched with every other kind of object, so the system is much, much more. Also, user-user matching can also be used to discover friendships, business partnerships, roommates, travel buddies, etc.&lt;/dd&gt;
&lt;dt&gt;Are you just creating a technology to sell to the Amazon.coms of this world so they can make better business?&lt;/dt&gt;
&lt;dd&gt;The discovery engine is not separable from the community. The community must be created (possibly by aggregating or integration of existing communities), but it’s not a social network, nor just a recommendation engine. It’s a holistic and completely new paradigm, and it is destined to change the world.&lt;/dd&gt;
&lt;dt&gt;Are you just creating yet another standalone dotcom, or will this be a feature of other services? Folks are fatigued with so many separate services.&lt;/dt&gt;
&lt;dd&gt;The discovery engine is not separable from the community, however, the community need not be a standalone data island - the format is ideal for integration of multiple services. But even as a standalone entity, the novel paradigm will make it quite popular. Others will want to integrate and take advantage of our universal recommendation system.&lt;/dd&gt;
&lt;dt&gt;You must be constructing a commonality map. This is not new.&lt;/dt&gt;
&lt;dd&gt;There is no “commonality map”, no use of shared things to relate people. This is not a people-centric or shared object-centric social model, as everything within our system is an object. It is beyond the "object-centered" or "people-only" debate. You’ll have to put aside your knowledge of existing methods and community/recommender concepts in order to understand the new disruptive paradigm.&lt;/dd&gt;
&lt;dt&gt;You must be using data mining edge activity to produce (more) accurate node profiles. This is not new.&lt;/dt&gt;
&lt;dd&gt;Our system does not involve data mining. Data mining generates very poor quality information compared to the kind of information we get. As I said, ours is something completely new.
&lt;/dd&gt;
&lt;dt&gt;You must be using existing algorithms and filtering methods. Collaborative filtering? Artificial intelligence? Content Based? Horting? &lt;/dt&gt;
&lt;dd&gt;Ours is a completely new paradigm, not horting or any of the other existing algorithms or systems. It is not a collaborative filter of any sort. It does not rely on personal or demographic data and it is not content based, requiring feature analysis. It is something completely novel.&lt;/dd&gt;
&lt;dt&gt;Registration involves Psychological testing and interpretation.&lt;/dt&gt;
&lt;dd&gt;Psychologists will be very interested in what we are doing, but we are not using Psychological testing and interpretation. There are no Rorschach inkblots or squiggles and shapes, no questionnaires, no puzzles, etc.&lt;/dd&gt;
&lt;dt&gt;Registration involves some sort of metaphysical or astrological process.&lt;/dt&gt;
&lt;dd&gt;No pseudoscience, metaphysics or mysticism, I promise. It is a completely logical and quantitative process. Hard to imagine, I know, but I assure you it is quite legitimate. In the future, PhD theses will be based on it.&lt;/dd&gt;
&lt;dt&gt;Registration involves personal and demographic data used to enable audience segmentation.&lt;/dt&gt;
&lt;dd&gt;No personal or demographic information needs to be entered at any time. Audience segmentation is a severe limitation to behavioral targeting and recommendations. Our system provides object-level granularity.&lt;/dd&gt;
&lt;dt&gt;You fill in a sparse matrix to infer missing relationships and preferences.&lt;/dt&gt;
&lt;dd&gt;The matrix is dense from the very start. No missing relationships are inferred and filled in.&lt;/dd&gt;
&lt;dt&gt;Core identity is impossible to determine, and certainly not in a quick registration. &lt;/dt&gt;
&lt;dd&gt;Don't judge what is impossible if you don't understand it. I would be happy to arrange a meeting with you to demonstrate how it works. If you are not willing to listen, then you deserve the box that imprisons you.&lt;/dd&gt;
&lt;dt&gt;How can you derive someone's essential identity from a name and password?&lt;/dt&gt;
&lt;dd&gt;Now THAT would be magic! Of course the answer is that we don't. There is a quick one-page registration and we get their essential identity from this - more than you can possibly imagine. But this is not the essential innovation, only a very cool result. &lt;/dd&gt;
&lt;dt&gt;Why universal? Wouldn't it be easier to stick with a single vertical domain?&lt;/dt&gt;
&lt;dd&gt;Our system prefers diversity, the more object types the better. It can operate on only one domain, but this would be a huge waste. Don’t forget, everything in life can be discovered, separately or simultaneously. The difference in complexity, between one domain and universality, is not great.&lt;/dd&gt;
&lt;dt&gt;How are you going to compile a database of all the "objects" in the world? This is no trivial task!&lt;/dt&gt;
&lt;dd&gt;Our staff will not have to compile anything (as with Pandora), nor are we relying on users to enter feature data. Whatever you're thinking - that ain't it. We are using a completely novel approach.&lt;/dd&gt;
&lt;dt&gt;No matter how you do it, reaching critical mass is gonna be a bitch.&lt;/dt&gt;
&lt;dd&gt;Ours is a slave to the network effect like all the others. However, as long as the system is sufficiently populated with objects of any variety (people OR things), the system will begin to work. In other words, with the right conditions the first user can see benefit. The obvious advantages over social nets, recommender applications, and the multitude of vertical applications should spread the word quickly. Most agree that popularity would not be an issue.&lt;/dd&gt;
&lt;dt&gt;You don't reveal the "magic" on your blog, so the project must be hype and at least appears not to be serious.&lt;/dt&gt;
&lt;dd&gt;First let me apologize for the lack of technical info in this blog. Certainly this will be off-putting to some, but I assure you that it is very serious. This is not an academic project - it is business, and the lack of technical information is simply a necessity for protecting confidential information. In the world of internet start-ups, innovations are rarely disclosed publicly before the first public beta site. Patents offer some protection, but there is no replacement for a healthy maintenance of secrecy at this early stage. I am happy to disclose the technical solutions to appropriate individuals under confidentiality assurances. Contact me at the email address in my profile to arrange a meeting in Israel or the States.
&lt;/dd&gt;
&lt;dt&gt;Why be so stealthy - this is not the Manhattan Project!&lt;/dt&gt;
&lt;dd&gt;If this really IS the next internet revolution as we suspect, then we shouldn't treat is as just another cool web app. We’re planning to change the world.&lt;/dd&gt;
&lt;dt&gt;I can't see the purpose of this blog.&lt;/dt&gt;
&lt;dd&gt;Consider the articles in this blog to be teasers for the purpose of attracting interest from those with vision and curiosity. We are looking for investors and hackers (highly competent programmers and developers). Do you know any?&lt;/dd&gt;
&lt;dt&gt;If the system performs as you say, then it will clearly be disruptive. But these claims are a bit extraordinary... why should I take time to arrange a meeting to learn more?&lt;/dt&gt;
&lt;dd&gt;Everyone's gotta gamble with their time - weighing the potential upside, probabilities, and time spent. I know the frustration of wasting my time hearing a stupid idea. But natural curiosity keeps me listening, just in case. Contact me at the email address in my profile to arrange a meeting in Israel or the States and let me change your world. In fact, invite a group of the smartest web-focused people (including angel investors and hackers) you know.&lt;/dd&gt;
&lt;/dl&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-3998066952409894045?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/3998066952409894045/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=3998066952409894045' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/3998066952409894045'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/3998066952409894045'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/02/forget-everything-you-know.html' title='Forget Everything You Know'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-3403905421556958155</id><published>2008-01-30T11:53:00.000+02:00</published><updated>2008-02-05T19:35:29.994+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Feedback'/><title type='text'>Skepticism vs. Cynicism</title><content type='html'>&lt;p&gt;
I just received a response email from a professor at a large American university:
&lt;/p&gt;
&lt;blockquote&gt;"I'm sorry, but your presentation of your idea makes it sound like the
equivalent of a perpetual motion machine, as you promise a secret
ingredient that will figure out the core of who an individual is without
essentially no data about them.
&lt;br /&gt;&lt;br /&gt;
I wish you good luck, but I'm afraid I'm not willing to invest any
further time in exploring your ideas. "&lt;/blockquote&gt;
&lt;p&gt;
Now, I appreciate feedback and certainly don't want to discourage it - but I thought this was a good opportunity to review some of the objections to this project and the limited technical information disclosed. What can we draw from the above email?
&lt;/p&gt;&lt;ol&gt;
&lt;li&gt;A perpetual motion machine would be revolutionary if it were true, and so would this project. &lt;/li&gt;
&lt;li&gt;A perpetual motion machine cannot exist because it violates the laws of physics, and so would this project violate our expectations about what is possible.&lt;/li&gt;
&lt;li&gt;Anyone proposing such a revolution/violation is not worthy of further consideration, and the ideas are not worthy of further exploration, even a meeting would be a waste of time.&lt;/li&gt;
&lt;/ol&gt;&lt;p&gt;
I am a skeptic myself, and so I certainly appreciate skepticism, but our project does not violate any laws of physics. It sounds fantastic, but I assure you the algorithm is sound. The professor claims we have "&lt;i&gt;essentially no data about them&lt;/i&gt; [users]" - though obviously we know a huge amount about a registering user. The difference is that we get information that can be used to quantify relationships, and the information can be gathered during a quick one-page registration. Teaching the machine about the "&lt;i&gt;core of who an individual is&lt;/i&gt;", i.e. a person's 'preference engine', has never been done - and if this professor can't do it, certainly nobody can. Meetings with three local professors in the last two weeks produced the expected jaw-dropping effect, and because of them I may now have a linear algebra option for the core algorithm, though I'm still not convinced. 
&lt;/p&gt;&lt;p&gt;
Anyway, this emailing professor has chosen not only not to meet, but seems annoyed at having been exposed to such ridiculous suggestions. I also understand a certain amount of frustration that the technical solutions, enabling full evaluation, are not disclosed (though I doubt the emailing professor would have read that far). Readers must also understand that technical solutions cannot be made public, despite patent-pending status. There are too many horror stories about disclosed innovations being stolen by large, well-lawyered corporations. I am willing to meet, traveling considerable distance, in order to satisfy that skepticism. Vision, curiosity and open-mindedness should be sufficient for a meeting, and perhaps this is a useful filter: why would I want to associate with those without these traits?
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-3403905421556958155?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/3403905421556958155/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=3403905421556958155' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/3403905421556958155'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/3403905421556958155'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/01/skepticism-vs-curiosity-and-vision.html' title='Skepticism vs. Cynicism'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-6239206517454084825</id><published>2008-01-14T12:13:00.001+02:00</published><updated>2008-01-28T17:20:59.382+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Social Web'/><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><category scheme='http://www.blogger.com/atom/ns#' term='Identity'/><title type='text'>One Degree of Separation</title><content type='html'>&lt;p&gt;
Social networks rely on your primary network - your existing friends and contacts - to introduce you to THEIR friends and contacts. Each of the people in the network are called 'nodes', each with one or more connections to other nodes. 
&lt;img src="http://identiverse.com/images/socntwk.jpg" style="width: 300px; display: block; margin-left: auto; margin-right: auto; border: none;"/&gt;
Each of those connections is sometimes called a degree of separation; a friend of a friend (FOAF) would then be two degrees of separation. The famous phrase "six degrees of separation" was based on work by psychologist Stanley Milgram who determined that any two Americans, connected in the nation-wide extended network, are separated by an average of five intermediaries, i.e. six steps or degrees.
&lt;/p&gt;&lt;p&gt;
Despite their connectedness, two people separated by so long a chain are extremely unlikely to ever meet. In fact, we usually only ever meet the friends of our friends: an extremely small fraction of the larger network. Web services like &lt;a href="http://www.linkedin.com"&gt;LinkedIn&lt;/a&gt;, the business contact network, tracks your chain to three degrees of separation - though I wonder how often the third degrees ever connect. [Friendster tracked the chain even further, and this pursuit has been &lt;a href="http://www.inc.com/magazine/20070601/features-how-to-kill-a-great-idea.html"&gt;credited with Friendster's downfall&lt;/a&gt;, as tracking long chains is very difficult computationally and has much larger hardware requirements.]
&lt;/p&gt;&lt;p&gt;
Online Social Networks are &lt;a href="http://seattletimes.nwsource.com/html/living/2003292646_onlinebacklash09.html"&gt;not really social&lt;/a&gt;, and the network - as degrees of separation - serves mostly to separate. So, if one really wants to &lt;a href="http://unirec.blogspot.com/2008/01/social-standardization-and-death-of.html"&gt;'kill' social nets&lt;/a&gt;, one needs to get rid of the 'net' (the multiple degrees of separation that separate people) in order to bring people together. Jyri Engeström argues that social networks should not be based on individual connections between people that can be counted and accumulated, rather people must be &lt;a href="http://www.zengestrom.com/blog/2005/04/why_some_social.html"&gt;connected by shared objects&lt;/a&gt;. We agree and take this to the next level by making everything in the virtual community an object, where each object is connected to every other object.
&lt;/p&gt;&lt;p&gt;
&lt;b&gt;The New Paradigm&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;
As proposed in the last post, what is lacking in the &lt;a href="http://unirec.blogspot.com/2008/01/social-standardization-and-death-of.html"&gt;current data islands and the proposed schema solutions&lt;/a&gt; is a way of harnessing the true power of the collective to actually reduce information overload and increase discovery. This will require a revolution in content and relationship discovery that can only arise with a completely new kind of information filtration and recommender technology.
&lt;/p&gt;
&lt;blockquote&gt;"&lt;i&gt;The social web will be powered by recommender systems&lt;/i&gt;".&lt;br /&gt;
&lt;small&gt;&lt;a href="http://blog.recommenders06.com/wp-content/uploads/2006/09/riedl.pdf"&gt;Open Issues in Recommender Systems&lt;/a&gt;&lt;br /&gt;
John Riedl, Bilbao Recommenders School, 2006&lt;/small&gt;
&lt;/blockquote&gt;
&lt;p&gt;
The true power of the collective will be realized with the proper integration of social media, new universal discovery techniques, and associated detailed portable identity and personalization info. The result is a Social Web based on &lt;i&gt;one degree of separation&lt;/i&gt;: all people and things are related to each other directly, with each such relationship differing only in type and strength. The following graphic is a representation of such a "one degree" circle of people relationships, but keep in mind that each person is also similarly related to all items, ideas, endeavors, etc. in the system as well.

&lt;img src="http://identiverse.com/images/onedegree.jpg" style="width: 300px; display: block; margin-left: auto; margin-right: auto; border: none;"/&gt;

&lt;/p&gt;&lt;p&gt;
Critical to this new paradigm are the new universal discovery techniques that I've hinted at previously. &lt;a href="http://unirec.blogspot.com/2008/01/current-recommender-types-2.html"&gt;Current recommender systems&lt;/a&gt;, including collaborative filters, are too primitive and limited to accomplish the task. Instead, we have applied certain bioinformatics concepts to solve the puzzle of simulating the &lt;a href="http://unirec.blogspot.com/2008/01/cause-vs-effect-of-human-preference.html"&gt;human preference engine&lt;/a&gt; without requiring "strong AI". This starts with a quick determination of a person's "core identity", that internal mechanism which is responsible for generating appreciation, and sifting through the chaos and making choices. 
&lt;/p&gt;&lt;p&gt;
Determining that "core identity" is a critical breakthrough as it allows us to quantify the relationship (strength and type) between all people, and between all people and all other things in the system. It also can yield portable data that can be used to quantify such relationships between users and items from multiple data islands, and can even be used in mobile devices and in real-world activity. This discovery system involves no collaborative filtering, psychological testing or interpretation, statistical or stochastic methods, etc. 
&lt;/p&gt;

&lt;blockquote&gt;"&lt;i&gt;But there is no go-to discovery engine - yet. Building a personalized discovery mechanism will mean tapping into all the manners of expression, categorization, and opinions that exist on the Web today. It's no easy feat, but if a company can pull it off and make the formula portable so it works on your mobile phone - well, such a tool could change not just marketing, but all of commerce&lt;/i&gt;."&lt;br /&gt;
&lt;small&gt;&lt;a href="http://money.cnn.com/magazines/fortune/fortune_archive/2006/11/27/8394347/"&gt;The race to create a 'smart' Google&lt;/a&gt;&lt;br /&gt;
by Jeffrey M. O'Brien, writing for Fortune Magazine&lt;/small&gt;
&lt;/blockquote&gt;

&lt;p&gt;
In addition to the current benefits of the social web, the integration of these universal discovery techniques will allow:
&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;A brief one-page registration with no need for private information. Qualifies as 'Cold-Start' for people and also items, ideas, endeavors, etc.&lt;/li&gt;
&lt;li&gt;Immediate access to promising relationships of all types, i.e. &lt;i&gt;universal recommendations&lt;/i&gt;. These relationships are the predicted interest and affinity between a person and all other people, music, movies, books, recreation, groups, products, services, ads, travel destinations, vocations, jobs, teams, politics, religion, ideas, websites, articles, news items, games, etc.&lt;/li&gt;
&lt;li&gt;Portable data that can be compared and relationships quantified. This portable data can be used between social and data islands, for mobile devices and in real-world activity.&lt;/li&gt;
&lt;li&gt;No language or cultural barriers: no folksonomy or semantic constraints.&lt;/li&gt;
&lt;li&gt;No need for existing relationships. Emphasis is on relationship discovery, though existing friends and contacts are revealing.&lt;/li&gt;
&lt;li&gt;No need to observe history of actions and choices. A one-page registration is enough to provide significantly more information, and better information, than collaborative filters can accumulate.&lt;/li&gt;
&lt;li&gt;The new system will act as a &lt;a href="http://unirec.blogspot.com/2008/01/cause-vs-effect-of-human-preference.html"&gt;good friend&lt;/a&gt; who knows you well and delivers trusted recommendations of all types, both solicited and unsolicited.&lt;/li&gt;
&lt;li&gt;Reduced privacy concerns as personal or demographic data is unnecessary.&lt;/li&gt;
&lt;li&gt;Automatic person-level granularity. Each relationship has a strength and type.&lt;/li&gt;
&lt;li&gt;Universal recommendations allows for highly successful affiliations of all types, direct sales and downloads, and highly targeted advertising as the diverse &lt;a href="http://blogs.mediapost.com/spin/?p=1020"&gt;business model&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Ratio of discovery to effort is high. No need for constant messages, spam, requests, friend searches, etc. &lt;/li&gt;
&lt;li&gt;Discovery is filtration, so '&lt;a href="http://www.winextra.com/2007/11/28/information-overload-has-become-an-avalanche/"&gt;information overload&lt;/a&gt;' and the '&lt;a href="http://www.sciammind.com/print_version.cfm?articleID=00056941-1933-1196-906983414B7F0000"&gt;tyranny of choice&lt;/a&gt;' are greatly reduced.&lt;/li&gt;
&lt;li&gt;Enables highly personalized search engine functionality, news aggregation, and many other forms of person-level information filtration.&lt;/li&gt;
&lt;li&gt;Constant excitement of discovery, so no "what's next?" reaction. No limit to novelty and interest, little boredom. No feeling of wasted time.&lt;/li&gt;
&lt;li&gt;Highly useful and usable: the keys to success of any product or service.&lt;/li&gt;
&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-6239206517454084825?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/6239206517454084825/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=6239206517454084825' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/6239206517454084825'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/6239206517454084825'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/01/one-degree-of-separation.html' title='One Degree of Separation'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-4392185234223581805</id><published>2008-01-08T13:33:00.000+02:00</published><updated>2008-01-13T20:51:38.938+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Social Web'/><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><title type='text'>Social Standardization and the Death of Social Networks</title><content type='html'>&lt;blockquote&gt;"&lt;i&gt;...we’re reaching an inflection point where some fundamental conceptions of the web (and social networks) need to change&lt;/i&gt;".&lt;br /&gt;
&lt;small&gt;from &lt;a href="http://factoryjoe.com/blog/2007/09/20/stop-building-social-networks/"&gt;Stop building social networks&lt;/a&gt;, by Chris Messina&lt;/small&gt;&lt;/blockquote&gt;
&lt;p&gt;
It seems that everybody is predicting the end of something due to something else, typically calling the later a 'killer app'. Are VOIP and email replacing the phone and fax? Is social media replacing Google search, &lt;a href="http://bokardo.com/archives/social-networks-are-killing-email/"&gt;email&lt;/a&gt;, communication in general? Is IM replacing email? Well, who would have predicted that the trusty typewriter would disappear in the span of a few years? It seems many are making another prediction: Social Nets are on their way out, at least in their current configuration. In this post, I'll talk about the problems and proposed solutions.
&lt;/p&gt;&lt;p&gt;
Social Nets are hugely popular and are obviously doing something right. They were clearly a revolution in online communication and information sharing. Let's first list why people enjoy them. They allow you to:
&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;express yourself and try to look cool&lt;/li&gt;
&lt;li&gt;people-watch / voyeurism / "gawk at strangers"&lt;/li&gt;
&lt;li&gt;'collect friends' and compete to see who has more&lt;/li&gt;
&lt;li&gt;waste time doing semi-fun alone stuff with apps, etc.&lt;/li&gt;
&lt;li&gt;keep in touch with existing friends (the primary network)&lt;/li&gt;
&lt;li&gt;make new friends, dates and business contacts (the largely unfulfilled promise of the 'network')&lt;/li&gt;
&lt;li&gt;manage your personal data&lt;/li&gt;
&lt;li&gt;exchange knowledge and information&lt;/li&gt;
&lt;li&gt;re-connect with old friends and colleagues&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;
As for the negatives, here are some of the points mentioned on the blogosphere:
&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;'Friend collecting' is not 'social'. No real communication takes place, and no real friends are made. Checkmarking someone as a friend is not being social. Not much relationship building going on.&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.winextra.com/2007/11/28/information-overload-has-become-an-avalanche/"&gt;Information Overload&lt;/a&gt; is not reduced, quite the opposite: too many people, messages, spam, etc. There is a limit to our ability to absorb information: our internal filters cannot handle it. 
&lt;blockquote&gt;"&lt;i&gt;There isn’t enough time in the day for any person to find value in what a 1,000 people have to say - our internal filters just won’t allow it. At some point all that information; whether it be valuable or just fluff, becomes nothing more than white noise&lt;/i&gt;".&lt;br /&gt;
&lt;small&gt;from &lt;a href="http://www.winextra.com/2007/09/21/enough-with-the-social-crap-i-think-im-gonna-puke/"&gt;Enough with the social crap I think I’m gonna puke&lt;/a&gt;, by Steven Hodson&lt;/small&gt;&lt;/blockquote&gt;
&lt;/li&gt;
&lt;li&gt;"Massive waste of time" / "It takes too much time" / '&lt;a href="http://radar.oreilly.com/archives/2007/12/decentralizing.html"&gt;Social Net Fatigue&lt;/a&gt;'&lt;/li&gt;
&lt;li&gt;Privacy concerns / 'abuse of trust'. Services track user activity on and off the service, and post some of those activities to the "friends". Combining information from multiple sources may reveal private information.&lt;/li&gt;
&lt;li&gt;Social nets are 'Walled Gardens'. They are not portable - information is trapped within the bounds of each service. New users must re-enter profile information, must search and re-add network contacts, and must reset notification and privacy preferences for each new social net joined.&lt;/li&gt;
&lt;li&gt;Social nets are by definition 'network-centric'. Most users are exposed only to friends of friends (i.e. two degrees of separation). This presents an obstacle to discovering true friends and contacts, most of the potential being outside of your network.&lt;/li&gt;
&lt;li&gt;No Business Model beyond popularity and possibly advertising. Also, because new users on social networks often misrepresent themselves and enter false personal information, demographic data for advertisers is therefore unreliable.&lt;/li&gt;
&lt;li&gt;The "superficial emptiness"&lt;/li&gt;
&lt;li&gt;The "what's next?" phenomenon (after exhausting the novelty of the site) / Lack of Innovation&lt;/li&gt;
&lt;li&gt;Not granular enough - no ability to group friends and contacts in categories, or indicate how close or trustworthy those relationships are.&lt;/li&gt;
&lt;li&gt;Tired of having to add friends or accept friend requests in all of these networks.&lt;/li&gt;
&lt;li&gt;Use a given service only because that's where your friends are.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;
&lt;b&gt;Proposed Solutions:&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;
Many feel that Identity/Info concepts like OpenID, OpenSocial, FOAF, the 'Semantic Web', Microformats, have great potential in solving a few of the above problems. 
&lt;/p&gt;
&lt;blockquote&gt;"&lt;i&gt;a distributed, user-centric identity scheme would destroy almost every "walled garden" social software application on the web&lt;/i&gt;".&lt;br /&gt;
&lt;small&gt;from &lt;a href="http://bexhuff.com/identity-management-will-destroy-social-software/"&gt;Identity Management Will Destroy Social Software&lt;/a&gt;, by Brian 'Bex' Huff&lt;/small&gt;&lt;/blockquote&gt;
&lt;p&gt;
The idea is that each internet user would have a single universal and portable profile that would be used and understood by all services, thereby elimiating the need to enter and configure the same information and connections on every new service. Ideally, this would have the effect of removing the walls between services, creating a single large community or 'cloud' where "relationships transcend networks/documents".
&lt;/p&gt;&lt;p&gt;
The social and data islands that dot the internet can clearly be helped by some kind of standardized profile that can be uploaded to (and modified by) each service. The burden of registration and establishing relationships would be greatly reduced. Such a profile can grow to include all the data that a person might share, including photos and information, music, movie, web site favorites, etc. As long as all services agreed on standardization, this should work pretty well. As an example, browser standardization is largely successful - though differences do exist and can be frustrating for developers and surfers alike.
&lt;/p&gt;&lt;p&gt;
&lt;b&gt;The Next Revolution:&lt;/b&gt;
&lt;/p&gt;&lt;p&gt;
Schemas, however, will not solve most of the issues mentioned above, and some are made worse (like privacy concerns). Some even argue that standardization and identity aggregation would &lt;a href="http://benmetcalfe.com/blog/index.php/2007/02/12/do-we-really-want-to-aggregate-identiy/"&gt;not be entirely apprieciated&lt;/a&gt;. As much as schemas depend on FOAF information, most of the problems with social networks will remain. If one really wants to 'kill' social nets, one needs to get rid of the 'net' part, i.e. the degrees of separation. &lt;b&gt;What is lacking in the current data islands and the proposed schema solutions is a way of harnessing the true power of the collective to actually reduce information overload and increase discovery&lt;/b&gt;. The next revolution in content and relationship discovery can only arise with a completely new kind of information filtration and recommender technology.
&lt;/p&gt;
&lt;blockquote&gt;"&lt;i&gt;The social web will be powered by recommender systems&lt;/i&gt;".&lt;br /&gt;
&lt;small&gt;&lt;a href="http://blog.recommenders06.com/wp-content/uploads/2006/09/riedl.pdf"&gt;Open Issues in Recommender Systems&lt;/a&gt;&lt;br /&gt;
John Riedl, Bilbao Recommenders School, 2006&lt;/small&gt;
&lt;/blockquote&gt;
&lt;p&gt;
The true power of the collective can only be realized with the proper integration of social media, new universal discovery techniques, and associated detailed portable identity and personalization info. The result is a Social Web based on &lt;i&gt;one degree of separation&lt;/i&gt;: all people and things are related to each other directly, with each such relationship differing only in type and strength. More on this new paradigm shortly.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-4392185234223581805?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/4392185234223581805/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=4392185234223581805' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/4392185234223581805'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/4392185234223581805'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/01/social-standardization-and-death-of.html' title='Social Standardization and the Death of Social Networks'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-7332495219487179866</id><published>2008-01-05T22:13:00.000+02:00</published><updated>2008-01-18T16:10:29.468+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><category scheme='http://www.blogger.com/atom/ns#' term='Identity'/><title type='text'>Cause vs. Effect of Human Preference</title><content type='html'>&lt;blockquote&gt;
"&lt;span style="font-style:italic;"&gt;One crucial unsolved problem for recommender systems is how best to learn about a new user&lt;/span&gt;".&lt;br /&gt;
&lt;small&gt;&lt;a href="http://www.iuiconf.org/02pdf/2002-001-0019.pdf"&gt;Getting to Know You: Learning New User Preferences in Recommender Systems&lt;/a&gt;&lt;br /&gt;
Rashid, et al, 2002&lt;/small&gt;
&lt;br /&gt;&lt;br /&gt;
"&lt;span style="font-style:italic;"&gt;Success comes from understanding both data and people&lt;/span&gt;"&lt;br /&gt;
&lt;small&gt;&lt;a href="http://blog.recommenders06.com/wp-content/uploads/2006/09/riedl.pdf"&gt;Open Issues in Recommender Systems&lt;/a&gt;&lt;br /&gt;
John Riedl, Bilbao Recommenders School, 2006&lt;/small&gt;
&lt;br /&gt;&lt;br /&gt;
"&lt;span style="font-style:italic;"&gt;The problem with recommendation systems is...  it measures and acts upon the effect, not the cause&lt;/span&gt;".&lt;br /&gt;
&lt;small&gt;&lt;a href="http://www.uie.com/brainsparks/2006/12/13/uietips-article-watch-and-learn-recommendation-systems-are-redefining-the-web/#comment-39712"&gt;Response to “&lt;span style="font-style:italic;"&gt;UIEtips Article: Watch and Learn: Recommendation Systems are Redefining the Web&lt;/span&gt;”&lt;/a&gt;&lt;br /&gt;
Adam Smith, 2006&lt;/small&gt;
&lt;/blockquote&gt;
&lt;p&gt;
So far, the internet has been all about effect. What other people say they like, you might also like; what you liked in the past suggest what you may like in the future. Google does it with PageRank; Amazon.com and Netflix do it with their recommender systems. They act based on your, or others', past preferences (the effect) rather than the cause of your past preferences. As you interact with the web, applications can record your actions and choices in order to create a filter with which to formulate suggestions that you might appreciate in the future.
&lt;/p&gt;&lt;p&gt;
But this is not the way the natural social process of recommendation seeking works. If you really want a good recommendation you ask someone who knows you well, as an individual. This is the way &lt;a href="http://www.rashmisinha.com/recommenders.html"&gt;good friends&lt;/a&gt; do it. We accept recommendations from good friends because they understand our core identity (hopefully) and have no ulterior motives (hopefully). For example, as a single guy, I will never again go on a blind date unless the intermediary is a good friend who understands my taste and my attitudes, values, personality, etc., as well as that of the prospective date. One could make an assumption based on my past dates and relationships, but it would be an assumption based on insufficient (see below) and indirect data: the effect rather than the cause. 
&lt;/p&gt;&lt;p&gt;
What is the cause? Preferences do not appear out of thin air, they are a result of your core identity: some combination of nature and nurture, your genes and your cultural and social influences, the configuration of your brain. This is the direct cause of your preferences: it is your &lt;span style="font-style:italic;"&gt;preference engine&lt;/span&gt;. Unfortunately, it is a black box that we cannot really open. Possibly in the future there will be a scanning device that can capture and replicate your precise neural configuration. With this copy, and sufficient understanding of the human mind, we might be able to accurately predict your choices. In making a choice, the steps are:
&lt;/p&gt;
&lt;ol&gt;&lt;li&gt;Core Identity + Exposure -&gt; Preferences (i.e. Brazilian Supermodels)&lt;/li&gt;&lt;li&gt;Preferences + Availability -&gt; Choices &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;(Damn!)&lt;/li&gt;&lt;/ol&gt;
&lt;p&gt;
Current recommender systems, such as collaborative filters, attempt to simulate a filter at the second stage. What we need is a way to accurately simulate your filter at the first: not quite a copy of your brain - but close.
&lt;/p&gt;&lt;p&gt;
Your preferences are also extremely limited by your limited exposure. Take music as an example. I love music, but I have only heard a tiny fraction of a percent of all music. So how the hell can my current favorites be expected to be entirely descriptive of my true taste or ultimate favorites? I have been exposed to that which is largely popular, better marketed, in English, etc. Music recommender applications suffer from this limitation: they consider only what I have already heard, and so they receive highly skewed data about my true taste. It would be great to have a good friend who is the ultimate "long tail" DJ and can match me to music based on his knowledge of my core identity and detailed knowledge about all music and musical tastes.
&lt;/p&gt;
&lt;blockquote&gt;"&lt;span style="font-style:italic;"&gt;Thus, the task is not so much to see what no one yet has seen, but to think what nobody yet has thought about that which everybody sees&lt;/span&gt;".&lt;br /&gt;
– Arthur Schopenhauer&lt;/blockquote&gt;
&lt;p&gt;
It seems obvious that far better recommendations would result from an intimate knowledge of a person's core identity. But identity is mysterious and unapproachable; better left to the fantasies of pipe-smoking psychologists. In reality, it is the chain around the elephant's leg. We all have the tools to break free from the constraints of assumption, but smart people have not previously applied themselves to the task.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-7332495219487179866?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/7332495219487179866/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=7332495219487179866' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/7332495219487179866'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/7332495219487179866'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/01/cause-vs-effect-of-human-preference.html' title='Cause vs. Effect of Human Preference'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-3247385513479545200</id><published>2008-01-02T17:10:00.000+02:00</published><updated>2008-01-10T12:50:28.882+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><title type='text'>Current Recommender Types</title><content type='html'>&lt;p&gt;
There are a number of types of recommender systems currently available. They vary significantly in their mode of action and ultimate user experience. In terms of results, recommender systems are expected to offer sufficient good quality recommendations ('New Favorites'). In addition to this, the quality of the results is also dependent on minimizing false positives ('Trust Busters') and false negatives ('Missed Opportunities'). In other words, users should also not be shown inappropriate results and should not be denied appropriate results. 
&lt;/p&gt;&lt;p&gt;
The quality of the user experience is also influenced by the time and effort required to give the recommender system enough information to make minimally reasonable recommendations. Users are sometimes asked to fill out lengthy questionnaires, or applications require that a user's history of choices or ratings be observed and recorded. It takes time and effort before things start working well. These days, users don't like to wait for anything and expect immediate gratification - delivering instant results upon quick registration is called 'cold start'. However, existing applications that permit a 'cold start' lack anything close to sufficient information, explicit or implicit, required to make accurate, high-quality recommendations.
&lt;/p&gt;&lt;p&gt;
There are a number of strategies that recommender systems are taking today. These include:
&lt;/p&gt;
&lt;ol&gt;&lt;li&gt;Non-personalized: "Web 1.0" technology offering the highest rated or most popular items to all users. No intrinsic personalization, poor quality results, but immediate.&lt;/li&gt;&lt;li&gt;Demographic: Require some knowledge about the user in order to group similar users together (i.e. by age, gender, area code, other similar features). Poor quality recommendations, low personalization, though slightly better than the above. May require "private" information, and depending on the length of the questionnaire, registration can take time.&lt;/li&gt;&lt;li&gt;Simple answer or ratings matching: Matches users based on explicit matching of answers, selections, ratings, etc. Makes recommendations with extremely limited scope, many missed opportunities, requires answers or observations.&lt;/li&gt;&lt;li&gt;Heuristics, probabilistic models (Bayesian, Markov), decision tree, neural net, etc. An application must collect a large amount of user-item preferences, or user/item features before quality recommendations are possble. This approach attempts to identify the underlying logic (or apply certain assumptions, in the case of heuristics) to a user's choices.&lt;/li&gt;&lt;li&gt;User-based Collaborative Filtering: similarity of historical choices or actions allows the application to find highly correlated users. The assumption is that users who agreed in the past might tend to agree in the future. Limited immediate results, most items will not be rated/answered (sparsity). Users with non-typical opinions or taste (the '&lt;a href="http://mobblog.cs.ucl.ac.uk/2007/06/15/recommender-systems-and-the-long-tail/"&gt;long tail&lt;/a&gt;') may not get good recommendations.&lt;/li&gt;&lt;li&gt;Item-based collaborative filtering: Finds items that tend to be preferred together. Limited immediate results, and users with non-typical opinions or taste may not get good recommendations.&lt;/li&gt;&lt;li&gt;Content-Based: Find items with similar features (Keywords, author, genre, i.e. DNA) to known preferences of a user. Items must be properly and thoroughly represented as a set of features - this generally requires a large staff. Generally limited to a single domain as there may be few cross-domain features. Limited immediate results.&lt;/li&gt;&lt;/ol&gt;
&lt;p&gt;
There are many recommendation engines and recommender applications available on the internet and many more seem to be popping up all the time. Currently they all have severe limitations and offer mediocre to poor quality results when compared to, say, recommendations by a best friend. Examples of current applications include:
&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;&lt;a href="http://eharmony.com"&gt;eHarmony&lt;/a&gt; requires a very lengthy questionnaire and uses a proprietary empirical heuristic to match people romantically. It's success depends on the quality of the questions and the heuristic, the person's willingness to answer truthfully, and the person's willingness to spend a few hours to register. Mixed results are reported, but there is certainly an advantage over matchmaking sites that allow daters to make their own bad choices.&lt;/li&gt;&lt;li&gt;&lt;a href="http://pandora.com"&gt;Pandora&lt;/a&gt; and &lt;a href="http://last.fm"&gt;Last.fm&lt;/a&gt; both recommend music though they do so in &lt;a href="http://www.stevekrause.org/steve_krause_blog/2006/01/pandora_and_las.html"&gt;different ways&lt;/a&gt;. Pandora's large staff must determine the separable features ("DNA") of a song and observe a user's choices in order to extract common features of a user's preference. Last.fm seems to work by grouping users of similar taste. Both suffer from &lt;a href="http://knowledge.wharton.upenn.edu/article.cfm?articleid=1818"&gt;reduced choice diversity&lt;/a&gt; for slightly different reasons. Both are mildly satisfactory, but also suffer from excessive false negatives and false positives, and require recording your existing preferences. Two roommates using the same account will likely see poor results.&lt;/li&gt;&lt;li&gt;&lt;a href="http://amazon.com"&gt;Amazon.com&lt;/a&gt;'s recommendations work by observed a user's choices and activity and grouping items (books, CDs, DVDs, etc.) that tend to be chosen or viewed by the same users. After viewing or choosing items, you are presented with: "users who liked X (the currently viewed item) also liked Y (a correlated item). As may be considered a typical pattern, users who buy for multiple people, like for children or friends, will likely see poor results.&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.esnips.com/socialdna"&gt;Social DNA&lt;/a&gt; sounds like it works similar to Pandora, but the granularity is significantly greater, and unlike eHarmony, there seems to be no heuristic - matching is all or nothing (i.e. explicit ratings and questions). This is expected to lead to extremely high false negatives, relatively few true positives, and, since matches will likely occur with only a tiny fraction of possible DNA (highly limited explicit information yeilds a sparse matrix), considering the complexity of human beings, mostly false positives.&lt;/li&gt;&lt;/ul&gt;
&lt;p&gt;
In order to get relatively high quality and accurate recommendations, a large amount of explicit ratings/choices (and/or possibly implicit activity) must be recorded. This is extremely hard to do: users are less likely to maintain interest while the machine learns, and this will be increasingly true in the future. &lt;a href="http://techdirt.com/articles/20061120/120841.shtml"&gt;Currently, users must be content with mediocre results&lt;/a&gt;, but a trade-off will develop between accuracy/quality and user patience.
&lt;/p&gt;&lt;p&gt;
Another frequent limitation is that users can act maliciously or inappropriately to skew results. Due to the limitations of current applications, users may feel the need to modify or exaggerate their choices in order to get better results. On the other end, users who want to promote certain items to others may give or encourage false ratings, views or descriptions (called 'Shilling') through manual or automated efforts or attacks. Also, privacy becomes an issue as users may explicitly or implicitly reveal private information about themselves. Details include demographics, personal details, taste, ratings, opinions, etc. Systems administrators (and possibly hackers) will have free access to this data. 
&lt;/p&gt;&lt;p&gt;
Accurate, high quality, robust and broad scope recommendations have been the holy grail for internet futurists for quite some time, though we are still a long way from that goal. The problem is largely technical: recommendations are a really tough problem. Mathematics/statistics, clever algorithms and artificial intelligence are stretching the results to the maximum, given the poor quality data available from users during registration or interaction with the application. The solution is to get high quality data about the user's identity or individuality and match based on that, rather than matching based on a user's history. The problem is that teaching the machine about the core identity of a person is science fiction. Or is it?
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-3247385513479545200?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/3247385513479545200/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=3247385513479545200' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/3247385513479545200'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/3247385513479545200'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2008/01/current-recommender-types-2.html' title='Current Recommender Types'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-3785157294590784769</id><published>2007-12-23T14:32:00.017+02:00</published><updated>2009-12-28T11:38:18.149+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Ruby on Rails'/><title type='text'>Live fuzzy search using n-grams in Rails</title><content type='html'>&lt;p&gt;
&lt;span style="color: #775533; font-size: 1.2em"&gt;Update [December 17, 2009]:&lt;/span&gt;
&lt;ul style="color: #775533"&gt;
&lt;li&gt;You can now see these live applications:&lt;/li&gt;
&lt;ul&gt;
 &lt;li&gt;&lt;a href="http://www.heebz.com"&gt;Heebz.com&lt;/a&gt;&lt;/li&gt;
 &lt;li&gt;&lt;a href="http://gigcatcher.com"&gt;Gigcatcher.com&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;li&gt;This post is the basis for a few Rails plugins: &lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="http://github.com/bmaland/no_fuzz"&gt;no_fuzz&lt;/a&gt; - Simple Rails-plugin that provides offline fuzzy-search for ActiveRecord [see my fork of this as well]&lt;/li&gt;
&lt;li&gt;&lt;a href="http://github.com/mkristian/kristians_rails_plugins/tree/master/act_as_fuzzy_search"&gt;act_as_fuzzy_search&lt;/a&gt; - add a fuzzy search to model for DataMapper&lt;/li&gt;
&lt;li&gt;&lt;a href="http://github.com/iulianu/rails-fuzzy-search"&gt;rails-fuzzy-search&lt;/a&gt; - and more recent fork of the above.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;/p&gt;
&lt;p&gt;
We wanted to give users the ability to perform a search. It is fairly easily set up a small search engine on a site or allow users to perform a search from a major search engine such as Google directly from the site itself. Our site needed something a bit more sophisticated and useful.
&lt;/p&gt;&lt;p&gt;
First, we wanted to allow users to start typing in the search box, and have the site suggest results from the database as the user types and display them in a drop-down list under the search box. This is sometimes referred to as auto-complete, auto-suggest or "Live Search". Rails accomplishes this typically through its &lt;a href="http://api.rubyonrails.org/classes/ActionView/Helpers/JavaScriptMacrosHelper.html"&gt;JavaScript macros helpers&lt;/a&gt; that wrap the Prototype JavaScript framework and the Scriptaculous JavaScript controls and visual effects library. See also:
&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;&lt;a href="http://demo.script.aculo.us/ajax/autocompleter"&gt;AJAX autocompletion demo&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://ajax.suaccess.org/savor-the-scriptaculous/create-an-auto-complete-field/"&gt;Create an Auto-Complete Field with script.aculo.us&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;p&gt;
We needed some extra functionality: we needed a single search feature to search multiple attributes of a model and, upon selection, populate a number of form fields with a number of attributes and associations. It actually gets even more complex than this, and perhaps I'll write more about it in a future article.
&lt;/p&gt;&lt;p&gt;
Typically, a search looks for exact matches of what is being typed. However, many important results are missed when an exact match is required: people often misspell, and related words that share the same word root will often not be found. So we wanted to allow users to find things that did not match perfectly with what was being typed. Furthermore, we wanted the results to be sorted such that the better the match, the higher a result is located on the drop down list.
&lt;/p&gt;&lt;p&gt;
One option for Rails developers is to use &lt;a href="http://ferret.davebalmain.com/trac/wiki"&gt;Ferret&lt;/a&gt;, a high-performance text search engine library based on Apache Lucene. Ferret is a reasonable option, and we may eventually go this route, but we wanted something simpler that did not require installing a library - we only needed the fuzzy search capability. Ferret's code is largely in C, rather than Ruby, and its fuzzy search is based on &lt;a href="http://en.wikipedia.org/wiki/Levenshtein_distance"&gt;Levenshtein edit distance&lt;/a&gt; formula which calculates the "minimum number of operations needed to transform one string into the other, where an operation is an insertion, deletion, or substitution of a single character".
&lt;/p&gt;&lt;p&gt;
Instead, we decided it should be rather simple to create an n-gram distance algorithm and leverage database indexing to create a fast n-gram distance fuzzy search, purely in Ruby. What is n-gram distance? It is the measure of the similarity of two strings by comparing the number or significance of matching substrings of length &lt;span style="font-style:italic;"&gt;n&lt;/span&gt;. It is particularly useful when searching a database of strings, for example a directory of names, rather than lengthy text (though that may certainly be possible as well). Our focus will be on a database of strings.
&lt;/p&gt;&lt;p&gt;
As an example, let's take the most popular name in Europe, and two of it's spellings: Muhammed and Mohammed (there are also many other spellings). Typical live searches will not be able to help someone who has started to type one rather than the other, as the second letter is different. Live searches need at least the first two or three letters of the input to match. First, let's add a word start character (for code, we prepend a space) to the beginning of the words because word boundaries constitute useful information. Next, let's break each word into sliding frames of three letters (trigrams). Muhammed becomes: *mu, muh, uha, ham, amm, mme, med. Mohammed becomes *mo, moh, oha, ham, amm, mme, med. The two names share 4 out of 7 trigrams.
&lt;/p&gt;&lt;p&gt;
First, migrate a trigrams table with columns: tg (trigram, a string), person_id (the key in the people table, an integer), and score (an integer, though you may not need it). Make an index on the first two columns in that order. Next, generate a trigrams model. Then you must populate the trigrams table with trigrams from the people table. This is easy to do as a separate migration:
&lt;/p&gt;
&lt;textarea cols="50" rows="24" wrap="off"&gt;
class PopulateTrigrams &lt; ActiveRecord::Migration
  def self.up
    #require 'enumerator' ?
    #convert all people into trigram data. use person firstname and lastname data.
    Person.find(:all).each do |person|
      # add a space at the beginning as this will emphasize the beginning of the word
      word = ' ' + person.lastname
      (0..word.length-3).each do |idx|
        tg = word[idx,3]
        # we can score last names differently for search matches
        Trigram.create(:tg =&gt; tg, :person_id =&gt; person.id, :score =&gt; 2) unless Trigram.find_by_tg_and_person_id(tg,person.id)
      end
      word = ' ' + person.firstname
      (0..word.length-3).each do |idx|
        tg = word[idx,3]
        Trigram.create(:tg =&gt; tg, :person_id =&gt; person.id, :score =&gt; 1) unless Trigram.find_by_tg_and_person_id(tg,person.id)
      end
    end
  end

  def self.down
    Trigram.delete_all
  end
end
&lt;/textarea&gt;
&lt;p&gt;
Here, we are are storing trigrams from each person's first and last names, and in this example, we are scoring matches to the last name at twice the score of the first name. This is just an example, you can score it any way you want. If you don't need to distinguish scores, you can eliminate the score column. Anyway, this migration will populate the trigrams table, and since the trigram and person_id are indexed, any search is easy on the database - and fast.
&lt;/p&gt;&lt;p&gt;
The autocomplete action method in the controller might look like this:
&lt;/p&gt;
&lt;textarea cols="50" rows="10" wrap="off"&gt;
  def auto_complete_for_person_full_name 
    limit = 10 #how many results do you want?
    entry = ' ' + params[:person][:full_name].downcase # space is word boundary, full_name is a virtual attribute
    trigrams = (0..entry.length-3).collect {|idx| entry[idx,3]}
    trigram_groups = Trigram.sum(:score, :conditions =&gt; [ "tg IN (?)", trigrams], :group =&gt; 'person_id')
    top_ids = trigram_groups.sort_by {|a| -a[1]}.first(limit).map {|a| a[0]} #sorts by score, gets top scorers, returns simple array of ids
    unsorted_people = Person.find(top_ids) #get unsorted objects
    @people = top_ids.map{|id| unsorted_people.detect{|person| person.id == id}} #sort objects according to sorted ids
    render :partial =&gt; 'full_names' #AJAX update of dropdown list
  end
&lt;/textarea&gt;
&lt;p&gt;
Do you see what's going on here? The entry is split into trigrams, then the scores of all matching trigrams in a person's name are summed. People are sorted based on their total score, and the top matches are sent to the partial. Very simple. The partial might look like this:
&lt;/p&gt;
&lt;textarea cols="50" rows="8" wrap="off"&gt;
&lt;ul class="full_names"&gt;
&lt;% for person in @people do -%&gt;
  &lt;li class="person"&gt;
    &lt;div class="id" style="display:none"&gt;&lt;%= person.id %&gt;&lt;/div&gt;
    &lt;div class="full_name"&gt;&lt;%=h person.firstname %&gt; &lt;b&gt;&lt;%=h person.lastname %&gt;&lt;/b&gt;&lt;/div&gt;
  &lt;/li&gt;
&lt;% end -%&gt;
&lt;/ul&gt;
&lt;/textarea&gt;
&lt;p&gt;
The ID is included, though hidden, so user selection of a person automatically includes the id of the person from the database table for further use.
&lt;/p&gt;&lt;p&gt;
It works beautifully, and it seems very fast so far - though with a huge number of items, database storage may become an issue. Any further ideas, suggestions, requests?
&lt;/p&gt;

&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.workingwithrails.com/recommendation/new/person/18230-steve-ruttenberg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 180px; height: 68px;" src="http://3.bp.blogspot.com/_bu_aLR84YIg/SyMVaBfZLWI/AAAAAAAAAE8/RQofaNQVmJU/s400/wwr.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5414194713850359138" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-3785157294590784769?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/3785157294590784769/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=3785157294590784769' title='10 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/3785157294590784769'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/3785157294590784769'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2007/12/live-fuzzy-search-using-n-grams-in.html' title='Live fuzzy search using n-grams in Rails'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_bu_aLR84YIg/SyMVaBfZLWI/AAAAAAAAAE8/RQofaNQVmJU/s72-c/wwr.png' height='72' width='72'/><thr:total>10</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-2512190271098569125</id><published>2007-12-22T15:47:00.000+02:00</published><updated>2008-01-10T12:43:39.234+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Ruby on Rails'/><title type='text'>Adventures in Self-Referential Database Associations</title><content type='html'>&lt;p&gt;
Rails makes object associations easy and has a number of different ways to configure associations and even store extra information about each relationship. Most of the time, those relationships are between business objects that wrap rows in two separate database tables. For example, let's say we wanted to be able to record, read and update grades and attendance of students in a particular school class. We might create a student model and a school class model, and the association between them may contain the grades and attendance information. We use many of these kinds of model object associations, and the procedure is fairly straightforward and written about extensively. See the following links to learn more:
&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;&lt;a href="http://wiki.rubyonrails.org/rails/pages/Associations"&gt;Associations in Ruby on Rails&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://ar.rubyonrails.com/classes/ActiveRecord/Associations/ClassMethods.html"&gt;Module: ActiveRecord::Associations::ClassMethods&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://wiki.rubyonrails.org/rails/pages/Beginner+Howto+on+has_many+:through"&gt;Howto on has_many :through in Ruby on Rails&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;p&gt;
For our project, we also needed a way to store various qualities about relationships between pairs of objects of the same model type, i.e. between rows of the same database table. We would need this for a number of different purposes. Those creating 'social net' applications know this kind of association well; they must document, for example, who are the direct friends of a particular user, and possibly further information about each such relationship. All those users are stored in the same database table, so a separate "join" table is created to store the keys of both users, plus any other information about the relationship.
&lt;/p&gt;&lt;p&gt;
To accomplish this, we used self-referential has_many :through associations that use a join model and methods to simplify the reading and updating of attributes of the association. To read more about these more specialized associations, here are some sites:
&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;&lt;a href="http://blog.hasmanythrough.com/2007/10/30/self-referential-has-many-through"&gt;Self-referential has_many :through associations&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://matt-beedle.com/2007/06/02/how-to-model-an-internal-emailing-system-using-self-referential-has_many-through-associations/"&gt;How to model an internal emailing system&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.aldenta.com/2006/11/10/has_many-through-self-referential-example/"&gt;Another has_many :through Self-referential Example&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;p&gt;
More details on this later.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-2512190271098569125?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/2512190271098569125/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=2512190271098569125' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/2512190271098569125'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/2512190271098569125'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2007/12/adventures-in-self-referential-database.html' title='Adventures in Self-Referential Database Associations'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-8038859963081655122</id><published>2007-12-19T19:38:00.001+02:00</published><updated>2008-01-10T12:40:55.705+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Ruby on Rails'/><title type='text'>Why Ruby on Rails?</title><content type='html'>&lt;p&gt;
&lt;a href="http://en.wikipedia.org/wiki/Ruby_programming_language"&gt;Ruby&lt;/a&gt; is a beautiful, purely object oriented, interpreted programming language influenced by Perl and Smalltalk, among others. &lt;a href="http://en.wikipedia.org/wiki/Ruby_On_Rails"&gt;Rails&lt;/a&gt; is a web application framework built on top of Ruby that is designed to speed and simplify the development of database-driven web applications. Rails is relatively new and not widely used relative to other web programing languages, but nonetheless its growth has been tremendous. Rails has seen so much growth because of its reputation as the fastest and easiest web development framework. Also, to achieve similar functionality Rails typically requires far fewer lines of code than the more popular web development languages.
&lt;/p&gt;&lt;p&gt;
When we first started out, we were fairly familiar with Perl and PHP - so it was a difficult decision to attempt to bring the team into the Rails community. We expected that the delay required to learn Rails would be later compensated by faster development. So we decided to learn Ruby and Rails and at least build the Builders' Site prototype with Rails.
&lt;/p&gt;&lt;p&gt;
The Israeli Rails community is rather small, but nonetheless I attempted to try to attract co-founders and developers for a previous related project through the &lt;a href="http://groups.google.com/group/israelrb"&gt;israel.rb group&lt;/a&gt;. There are lots of startups in Israel, and I'm not surprised that demand exceeds supply of capable developers. At the time I was not able to interest any qualified Rails hackers to join the Israel team - part of the problem may have been that none of them knew what the project was really about, only that some bombastic American has a stealth startup.
&lt;/p&gt;&lt;p&gt;
Despite the lack of Rails experts on the team, the learning curve was not excessive and we were quickly creating some fairly complex functionality. The Rails principle of "&lt;a href="http://en.wikipedia.org/wiki/Convention_over_Configuration"&gt;Convention over Configuration&lt;/a&gt;" helps to speed development of more typical functionality. We encountered some difficulty when our functionality quickly exceeded the convention. Luckily there are &lt;a href="http://railsforum.com/"&gt;some real heroes&lt;/a&gt; on the internet who publish their experiences with similar issues and try to help those trying to learn and develop along the same lines.
&lt;/p&gt;&lt;p&gt;
Critics of Ruby and/ or Rails argue that Ruby is slow and web applications developed in Rails do not 'scale'. Most of this criticism appears to be sour grapes. It is true that there are no blockbuster applications written in Rails, only medium sized applications, but reaching "medium size" exceeds the achievements of the vast majority of web startups. Rails defenders argue that scaling Rails applications simply involves purchasing more servers and hardware, and that the greatest limitations are database related - and all web development languages suffer from this. They argue that savings on developer-hours more than makes up for extra costs in hardware.
&lt;/p&gt;&lt;p&gt;
The debates continue, as does our development. Perhaps I will post some of the obstacles and specific solutions we found. So far we like Rails, and have no plans to go elsewhere. When we are ready to increase our development or scale the application, perhaps we will consider again all our options. The decision will, in part, depend on the availability of capable developers.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-8038859963081655122?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/8038859963081655122/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=8038859963081655122' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/8038859963081655122'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/8038859963081655122'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2007/12/why-ruby-on-rails.html' title='Why Ruby on Rails?'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-564334219621154562</id><published>2007-12-19T12:22:00.000+02:00</published><updated>2008-01-10T12:38:52.786+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Intellectual Property'/><title type='text'>Why do we Patent?</title><content type='html'>&lt;p&gt;
Now that we've discussed our thoughts on the next internet revolution, and therefore the goals for the project, let's skip ahead and tell you that the algorithms have already been worked out and three patent applications filed. We are patent pending on the algorithms, the various uses of those algorithms, and business methods associated with this project. But there are many who question the ethics and effectiveness of patents, especially patents on software and algorithms. The software industry has demonstrated patents aren't needed for innovation to take place. So why have them?
&lt;/p&gt;&lt;p&gt;
The first thing to recognize in any discussion of patents is that they have been and are essential to a profit-making business. Innovation can be a hobby, but it is also integral to high-tech business. Patents reward and therefore encourage innovation because the innovator is more likely to see the full benefit from the innovation and hard work and money. Algorithms, like any innovation, can meet the patenting criteria of "unique", "non-obvious" to those skilled in the art, and "useful". They have been successfully defended in the past. There is no real difference between an algorithm, and a design, process or utility. Arguments against patenting in general tend to be ideological in nature, and this is not the forum for debating Marxism.
&lt;/p&gt;&lt;p&gt;
As a startup project, investment is critical. But investment may be hard to come by unless we have something with which to convince investors that the market won’t be immediately entered by many competitors doing the same thing, especially by those larger and better equipped than we are. Investors want to know that their money will produce gains, not go down the drain due to new entrants copying or reverse engineering the technology whose development they funded. Patents offer investors security. Investment allows startup companies to defend patents.
&lt;/p&gt;&lt;p&gt;
Also, we are spending significant amounts of our own time, effort and money on development. If we were pursuing a hobby, we would eagerly make our innovations publicly available and open source. There is much to be proud of, show off, and share. Open source is a wonderful thing, it is driving much development today, and can even result in revenue if done properly and developed quickly. This is not an indictment of Open Source, but we decided to try to maintain more control of the development and business of the project. If someone can propose a way to do this without patenting, please let us know.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-564334219621154562?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/564334219621154562/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=564334219621154562' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/564334219621154562'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/564334219621154562'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2007/12/why-do-we-patent.html' title='Why do we Patent?'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5160001102716841899.post-758618981810170343</id><published>2007-12-18T13:40:00.000+02:00</published><updated>2008-01-14T19:12:17.410+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Social Web'/><category scheme='http://www.blogger.com/atom/ns#' term='Web 2.0'/><category scheme='http://www.blogger.com/atom/ns#' term='Recommender Systems'/><title type='text'>The Future of the Internet</title><content type='html'>&lt;p&gt;I found an &lt;a href="http://money.cnn.com/magazines/fortune/fortune_archive/2006/11/27/8394347/"&gt;interesting analysis&lt;/a&gt; of existing applications and ideas about the future of the internet on the CNN website by Jeffrey M. O'Brien, writing for Fortune Magazine. He says:&lt;/p&gt;

&lt;blockquote&gt;&lt;i&gt;But there is no go-to discovery engine - yet. Building a personalized discovery mechanism will mean tapping into all the manners of expression, categorization, and opinions that exist on the Web today. It's no easy feat, but if a company can pull it off and make the formula portable so it works on your mobile phone - well, such a tool could change not just marketing, but all of commerce.
&lt;br /&gt;&lt;br /&gt;
"The effect of recommender systems will be one of the most important changes in the next decade," says University of Minnesota computer science professor John Riedl, who built one of the first recommendation engines in the mid-1990s. "The social web is going to be driven by these systems." &lt;/i&gt;&lt;/blockquote&gt;

&lt;p&gt;I see a growing interest in &lt;a href="http://en.wikipedia.org/wiki/Recommender_system"&gt;recommender systems&lt;/a&gt; everywhere I look, and I tend to agree with the above that the next big thing likely involves personalized discovery and recommendations. It seems clear that in order to improve the current state of the internet &lt;a href="http://www.usatoday.com/tech/columnist/andrewkantor/2007-06-14-internet-organization_N.htm"&gt;information overload&lt;/a&gt; must be reduced as well as the &lt;a href="http://www.sciammind.com/print_version.cfm?articleID=00056941-1933-1196-906983414B7F0000"&gt;burden of choice&lt;/a&gt; - this will require advancements in recommender technology. One can also assume that the internet will continue and expand the benefits of "&lt;a href="http://www.ebizmba.com/articles/ebizmba25.html"&gt;Web 2.0&lt;/a&gt;" and the collaborative internet. These benefits are, like most products and services, increased ease of use and increased usefulness.
&lt;/p&gt;&lt;p&gt;
Some feel that the "&lt;a href="http://en.wikipedia.org/wiki/Semantic_Web"&gt;Semantic Web&lt;/a&gt;" or similar technologies are the future. Essentially this involves "&lt;a href="http://youtube.com/watch?v=NLlGopyXT_g"&gt;teaching the machine&lt;/a&gt;" about content through the creation of universal, machine readable formats. Those critical of the limited scope of "&lt;a href="http://en.wikipedia.org/wiki/Folksonomy"&gt;folksonomies&lt;/a&gt;" champion a more standardized approach, thinking that if only people could be compelled to be disciplined in their approach toward organization of content, that some kind of utopia would emerge. There are as many critics of the Semantic Web as there are of folksonomy, and the fact that people have been talking about schemas and such for years, seems to indicate that - short of some radically new approach - &lt;a href="http://www.well.com/%7Edoctorow/metacrap.htm"&gt;there will be no utopia there&lt;/a&gt;. Interestingly, it may be a phenomenon called "&lt;i&gt;&lt;a href="http://radar.oreilly.com/archives/2007/12/decentralizing.html"&gt;social network fatigue&lt;/a&gt;&lt;/i&gt;" that finally forces services to adopt a standardized portable social/data/identity ontology.
&lt;/p&gt;&lt;p&gt;
The new revolution must also bring people together in much better ways than those that employ the "&lt;a href="http://en.wikipedia.org/wiki/Social_network"&gt;degrees of separation&lt;/a&gt;" format. Online Social Networks are extremely popular because they allow people to keep in virtual contact with their existing friends and contacts, and they can facilitate personal expression and PR. Unfortunately, social nets are &lt;a href="http://seattletimes.nwsource.com/html/living/2003292646_onlinebacklash09.html"&gt;not really social&lt;/a&gt;, and the network - as degrees of separation - serves mostly to separate. Registering for one of these sites is a rather cold experience unless you already have tons of friends and contacts already registered. Otherwise you have to spam your friends and nag them to register - something that they may not appreciate. The burden of registration and establishing relationships is compounded as more and more social nets are created, each of them requiring the time consuming input of the same personal data.
&lt;/p&gt;&lt;p&gt;
The next big thing must facilitate discovery of new people based only on similarities of their core identities, rather than focusing only on a user's primary social network. Two &lt;a href="http://www.telegraph.co.uk/news/main.jhtml?xml=/news/2007/10/27/wtwins127.xml"&gt;identical twins, separated at birth&lt;/a&gt; and raised in countries on opposite sides of the planet, speaking different languages, should be able to find each other. Bringing people together must include the separate objectives of: romance, friendship, business relationships, work team formation, roommates, travel and recreation buddies, etc.
&lt;/p&gt;&lt;p&gt;
In addition to people finding other people of similar core identity, the next big thing will need to do the same for non-human entities, like music, movies, TV, books, news, web pages, articles, games, products, services, vocations, jobs, travel destinations, politics, religion, advertising, or any other activity, enterprise, product, service, endeavor, idea, belief, passion or item.
&lt;/p&gt;&lt;p&gt;
In order for such matching to be &lt;a href="http://web.engr.oregonstate.edu/%7Eherlock/papers/eval_tois.pdf"&gt;accurate and satisfying&lt;/a&gt; for the user, there must be low false positives ("trust busters"), low false negatives ("missed opportunities") and sufficient true positives ("new favorites"). &lt;a href="http://techdirt.com/articles/20061120/120841.shtml"&gt;No current&lt;/a&gt; recommendation or recommender system comes close to even approaching this goal. Matching cannot be based on identical answers or choices, as there can never be enough questions to encompass all of human individuality. More importantly, people's existing interests in music, books, movies, etc. is not strictly linked to their core identities, but rather often more on prevailing cultural and social influences. 
&lt;/p&gt;
&lt;blockquote&gt;&lt;i&gt;&lt;b&gt;In our short lives, we are unlikely to ever find the people and things that we would most enjoy and appreciate. This is unfortunate.&lt;/b&gt;&lt;/i&gt;&lt;/blockquote&gt;
&lt;p&gt;
The new internet revolution must match people to other people and things based solely on that core identity. It must do it with only a brief registration, where, upon registering, you are immediately presented with your ultimate best friends and soulmates, ideal potential business partners, as well as new favorites in every area of life. It should do it without language barriers (i.e. folksonomy or schemas), without the need for a large staff (i.e. Pandora), without a tedious registration (i.e. eHarmony), without having to observe your history (i.e. Amazon.com, Netflix, Last.fm, etc.), without requiring existing friends or contacts (i.e. Facebook, MySpace, LinkedIn, etc.), and it must be completely free for users but able to convert many of those recommendations into revenue.
&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5160001102716841899-758618981810170343?l=unirec.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://unirec.blogspot.com/feeds/758618981810170343/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=5160001102716841899&amp;postID=758618981810170343' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/758618981810170343'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5160001102716841899/posts/default/758618981810170343'/><link rel='alternate' type='text/html' href='http://unirec.blogspot.com/2007/12/future-of-internet.html' title='The Future of the Internet'/><author><name>Steven Ruttenberg</name><uri>http://www.blogger.com/profile/09875544429370940477</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://3.bp.blogspot.com/_bu_aLR84YIg/TPqEj9pvavI/AAAAAAAAAFQ/8RaT0NO_GCE/S220/3D_crop1e.jpg'/></author><thr:total>0</thr:total></entry></feed>
