One Line Answers

There needs to be a search engine that provides one line answers to questions. I just finished two tasks: helping my kids with their homework and debugging the intricacies from a LAMP program as I ported it from Ubuntu to Centos. If you do a google for questions, you get paid slots, correct page heading, but no relevant info, signups to see answers, etc. Any question marked with a [Solved] should be extracted and voted on and returned as a one line answer. How do you tell the version of your mysql? Where's the repo file for Chromium? Does pecl-xxx run on 64 bit? How do you turn on 64 bit flash in Chrome? Or even simple stuff, where did the Chumash Indians live? What crafts did they do? Right now all Google results take some of the excerpts out, but not exactly what's needed. Maybe we need a 140 character best answer similar to how they used to do the code obfuscation/optimization contests. Who can have the best 140 character answser and then include it inline in the search results.

There needs to be a search engine that provides one line answers to questions. I just finished two tasks: helping my kids with their homework and debugging the intricacies from a LAMP program as I ported it from Ubuntu to Centos. If you do a google for questions, you get paid slots, correct page heading, but no relevant info, signups to see answers, etc. Any question marked with a [Solved] should be extracted and voted on and returned as a one line answer. How do you tell the version of your mysql? Where's the repo file for Chromium? Does pecl-xxx run on 64 bit? How do you turn on 64 bit flash in Chrome? Or even simple stuff, where did the Chumash Indians live? What crafts did they do? Right now all Google results take some of the excerpts out, but not exactly what's needed. Maybe we need a 140 character best answer similar to how they used to do the code obfuscation/optimization contests. Who can have the best 140 character answser and then include it inline in the search results.

Gregory Alan Bolcer
10:52 AM Sep 12 2011

Do you think the best such engine is powered by humans or powered by machines?

Is it curated or is it aggregated?

Will asking the #lazyweb on Twitter suffice, or is lazyweb insufficient?

Adam Rifkin
3:17 PM Sep 12 2011

The lazyweb takes too long, but distilling other people's lazyweb answers might work. I think it's a combination of archeology and presentation that's missing. The distilling piece might be able to be machines.

Gregory Alan Bolcer
3:27 PM Sep 12 2011

One way to approach about this problem predates the internet and web -- basically how do we extract "useful" information from long texts like books and papers that were the output of scholarly work. This is what info-science pioneer Paul Otlet tried, although with manual labor and index cards. http://en.wikipedia.org/wiki/Paul_Otlet#The_Mundaneum

A foundational assumption there is that are discrete facts can be separated out of their context.

Google (and search engines) and their presentation of "relevant" snippets are kind of a continuation of that, extracting from semi-structured text, but using automated techniques.

DuckDuckGo's Zero Click stuff - http://duckduckgo.com/api.html (and Google's ill-fated coop and subscribed links) were something a little different - given more structured info they try to "answer" specific queries with facts. I guess Wolfram Alpha is like that too in some ways.

The broad alternative strategy to this is rather than pre-coordating a way to refine and repurpose existing information sources, create new ones focused on question answering as needed -- I see http://ask.metafilter.com, http://quora.com, and the newly release http://jig.com to be in that category.

Given the decreasing quality of search results, automated crap content farms used to game them for financial reason, and that monopolies tend to not innovate as quickly in the absence of competition, I expect much of the interesting work in improving the experience for these type of fact-queries will be from the latter category.

The big problem I see for those sites is scale and authority. It's hard to scale a community to answer lots of questions, it's hard to incentivize it as communities grow, and often the knowledge and authority to answer questions effectively is inversely correlated to the amount of time you have to mess around on the internet.

One way to approach about this problem predates the internet and web -- basically how do we extract "useful" information from long texts like books and papers that were the output of scholarly work. This is what info-science pioneer Paul Otlet tried, although with manual labor and index cards. <a href="http://en.wikipedia.org/wiki/Paul_Otlet#The_Mundaneum">http://en.wikipedia.org/wiki/Paul_Otlet#The_Mundaneum</a>

A foundational assumption there is that are discrete facts can be separated out of their context.

Google (and search engines) and their presentation of "relevant" snippets are kind of a continuation of that, extracting from semi-structured text, but using automated techniques.

DuckDuckGo's Zero Click stuff - <a href="http://duckduckgo.com/api.html">http://duckduckgo.com/api.html</a> (and Google's ill-fated coop and subscribed links) were something a little different - given more structured info they try to "answer" specific queries with facts. I guess Wolfram Alpha is like that too in some ways.

The broad alternative strategy to this is rather than pre-coordating a way to refine and repurpose existing information sources, create new ones focused on question answering as needed -- I see <a href="">http://ask.metafilter.com,</a> <a href="">http://quora.com,</a> and the newly release <a href="http://jig.com">http://jig.com</a> to be in that category.

Given the decreasing quality of search results, automated crap content farms used to game them for financial reason, and that monopolies tend to not innovate as quickly in the absence of competition, I expect much of the interesting work in improving the experience for these type of fact-queries will be from the latter category.

The big problem I see for those sites is scale and authority. It's hard to scale a community to answer lots of questions, it's hard to incentivize it as communities grow, and often the knowledge and authority to answer questions effectively is inversely correlated to the amount of time you have to mess around on the internet.

Adam Mathes
3:57 PM Sep 12 2011

I'm trying, I'm trying...

Eric Barker
5:28 PM Sep 13 2011

Adam this was a great answer. I'm giving you Props!!

Adam Rifkin
10:20 AM Sep 14 2011

I'm working on something that (among other things) should address exactly this: crafting perfect snippets, by human cooperation, for any query that could reasonably answered by a perfect snippet. Ping me for more details!

Gordon Mohr
2:16 PM Sep 13 2011

Hey Gordon! But of course you *always* do interesting stuff! Will drop you an email offlist.

Gregory Alan Bolcer
4:52 PM Sep 13 2011

Where can we learn more?

Adam Rifkin
5:06 PM Sep 21 2012

I presented the rationale and demo'd an early prototype in a recorded presentation last year, which you can view at en.thunkpedia.org. You can also follow @thunkpedia for tweets-on-the-theme and project-updates.

Such updates have been rare because other things this year put Thunkpedia on the backburner for a while, but I've recently picked it back up and will have something open for real users "real soon".

Gordon Mohr
7:19 PM Sep 21 2012

Sweet. How big is a thunk?

Adam Rifkin
12:16 AM Sep 23 2012

Initial thunk format is...

Always displayed:

a 1-line (~66 char) context: this appears before the main text, in lower-contrast, to provide a label and disambiguation. However, it need not be unique per thunk, so is not quite a traditional 'title'.
an up-to-333-character main body 'text': Just enough for a complete sentence or a few, with simple formatting … but no inline outlinks.

Displayed below if expanded in-place (aka 'the annex'):

about 1 line's worth of 'hooks': keywords/synonyms/tags to assist findability (white-hat SEO)
'links': up to 3 outlinks, with title/anchortext, to resources that support the above 'text'
other metadata to-be-determined

The goal is for result lists of thunks to be easy to scan, read, and reorder in-place. This should minimize the need to navigate away or reformulate queries to get the full story, and better enable instant inline review/feedback.

Initial thunk format is...

Always displayed:

<ul><li>a 1-line (~66 char) context: this appears before the main text, in lower-contrast, to provide a label and disambiguation. However, it need not be unique per thunk, so is not quite a traditional 'title'.</li>

<li>an up-to-333-character main body 'text': Just enough for a complete sentence or a few, with simple formatting … but no inline outlinks.</li>

</ul>

Displayed below if expanded in-place (aka 'the annex'):

<ul><li>about 1 line's worth of 'hooks': keywords/synonyms/tags to assist findability (white-hat SEO)</li>

<li>'links': up to 3 outlinks, with title/anchortext, to resources that support the above 'text'</li>

<li>other metadata to-be-determined</li>

</ul>

The goal is for result lists of thunks to be easy to scan, read, and reorder in-place. This should minimize the need to navigate away or reformulate queries to get the full story, and better enable instant inline review/feedback.

Gordon Mohr
11:13 AM Sep 23 2012

This is awesome and reminds me a lot of Ted Nelson for some reason.

Adam Rifkin
12:03 PM Sep 23 2012

One Line Answers

Gregory Alan Bolcer stashed this in Search

You May Also Like: