VE Team's profileVirtual Earth / Live Map...PhotosBlogListsMore Tools Help

Blog


    September 18

    "80% of Enterprise data has a geo component". Says Who?

    I don't lend a lot of credence to most statistics as it is widely understood that up to 68% of them are bogus to start with, but there is one that certainly has a lot of staying power in mapping and GIS circles. It goes something like “80 percent of data stored in databases has a spatial component.” It’s usually used in context of selling a geocoding solution to add coordinates to these tables or a visualization platform to help you spot trends in your precious 80% that isn’t living up to its potential. I’ve heard it stated with confidence by analysts and seen it used widely in press releases dozens of times over the years. What are the origins of the statement? Is it even plausible? I chuckle every time I read it, not because I don’t believe it to be accurate but because I actually know the origins of it. :-) The most recent example I saw was just the other day buried in the announcement of the EPA’s choice of mapping platforms:

    Because roughly 80 percent of the world’s business data has a location component, the Virtual Earth tool adds significant value to the EPA’s efforts to unlock, view and act upon the location component inherent in its data

    Before I reveal the origins of the stat, let’s think about its validity. I’ve always interpreted ‘data’ in this context to mean ‘records’. In other words if any field or fields of a table is spatial in nature, it would qualify. What’s spatial? I think of that as anything that can be geocoded, addresses being the most common type of data. A post code, Phone number, IP address all can be geocoded as can dozens of other types of data. It’s also important to consider that data in tables keyed to another table containing location should also count; if an order record is tied by customer ID to a customer table with a delivery address, that order has location. So I’ll buy it; the vast majority of data in a database has a spatial component. 80 percent of it in fact if that makes you feel better :-)

    So who came up with it? I’ve heard GIS Hall of famer Roger Tomlinson often credited with it. But there’s a 74% chance that isn’t true. I trace it back to a Product Manager and Product Marketer at MapInfo. To protect the innocent, I won’t reveal their names here but one sounds like a sneeze if you say it fast and the other is often confused with a member of the Pixies. MapMarker was the first mapping product I ever worked on at MapInfo around 94/95. MapMarker could rip through a table of a 100k street addresses or postcodes in an hour on a 486 and assign coordinates, something we take for granted today. To support MapMarker’s value proposition, Sneeze and Pixie came up with the 80% figure and used it in fact sheets and other collateral. From there it just spread and was recycled endlessly until it became accepted as fact. I have no idea what kind of research went into coming up with the 80% figure; it could have simply been a wild guess that sounded reasonable and defensible at the time (probably the origin of many stats ;-) At any rate, I think it’s probably in the ballpark and if nothing else serves to remind us of the value in treating location as a first class citizen in our data stores. If anyone has a source for this stat pre 1994, share it in comments. And if Pixie or Sneeze happen to be reading this and want to reveal their identities and take credit for their stat, jump in.

    UPDATE: a couple of my ex-MapInfo colleagues contacted me to inform that the stat was used at MapInfo before sneeze was at the company. Further, one of them attributes the origins to MapInfo founders Laszlo Bardos and Sean O’Sullivan with Pixie later referencing it in MapMarker's marketing materials.

    Comments (14)

    Please wait...
    Sorry, the comment you entered is too long. Please shorten it.
    You didn't enter anything. Please try again.
    Sorry, we can't add your comment right now. Please try again later.
    To add a comment, you need permission from your parent. Ask for permission
    Your parent has turned off comments.
    Sorry, we can't delete your comment right now. Please try again later.
    You've exceeded the maximum number of comments that can be left in one day. Please try again in 24 hours.
    Your account has had the ability to leave comments disabled because our systems indicate that you may be spamming other users. If you believe that your account has been disabled in error please contact Windows Live support.
    Complete the security check below to finish leaving your comment.
    The characters you type in the security check must match the characters in the picture or audio.
    VE Team has turned off comments on this page.
    July 23
    July 23
    July 23
    July 23
    July 23
    July 22
    July 21
    July 21
    Picture of Anonymous
    (沒有名稱) wrote:
    徵信是個富有挑戰的工作,忙碌、刺激、激情。最后的結局總是有人哭,有人笑。在徵信社工作的幾年時間里,我感悟到很多生活的無奈與殘酷。閑暇時我去中古车市场,感覺工作太緊張,放松一下而已,不是純粹的為了淘二手车。私家偵探也就是徵信工作人員似乎是個很酷的詞,但是如果誰需要私家偵探來幫忙的話,他(她)的婚姻也許已經亮起了紅燈。每天陪著星際客戶喝著咖啡,似乎是一件很爽的事,可是她們的眼淚比咖啡要苦的多。常常聽他們說嘈雜的燈光音響麻痹著他們的神經,讓他們過著醉生夢死的生活。其實,誰不渴望幸福美滿的愛情。沒有團體制服不了那些玩弄生活、玩弄感情的人,于是,徵信社應運而生。這不知道是人們的福音還是不幸,希望天下的每一個人都歡樂著、幸福著。坐月子是女人一生的大事,女人可以为了襁褓中的婴儿放弃一直引用的红酒而吃月子餐,这时候的女人是幸福而伟大的。有人歡喜有人優,有人狂笑有人大哭,我不想看到女人太多的眼淚,辭職后,我做了一位安靜、平和的室內設計師。虽然上下班没有了车接车送,但我租车怡然自得。偶爾和ecosway的朋友小聚,很愜意!
    July 9
    Picture of Anonymous
    yagin wrote:
    販売管理システム
    顧客管理システム
    seo
    sem
    SEOサービスは費用対効果抜群です。
    PPC広告(Overture、AdWords)を中心に、広告の投資対効果を徹底します。
    June 16
    wrote:

    China Tour Select is a China travel agency providing  China toursBeijing Tours, Tibet Tours, and Yangtze river cruises. We specialize in "Tailor-Made" and "Customized" private and group tours to China. We'll try our best to offer you China tours with super value and most considerate service.          

    Apr. 3
    Feb. 14
    Picture of Anonymous
    will wrote:
    my concern about the 80% of information having a spatial component thing...is that you can put put a spin on most types of information claiming that there is a spatial component. 80% sounds like a great way to sell GIS. my question is how useful is information that is claimed to have a spatial component? My computer has a spatial component - it sits on my desk which is in my apartment, which is in my neighborhood. if a record in a table is tied to an address or location on the earth does that mean that it has a spatial component? GEEZ. I think that 80% is about 60% inflated. I mean how often do people use maps and spatial data in their jobs and at work? Ok, if you are a fireman or pizza delivery guy you probaby use maps a lot, but all you're interested in is getting from A to B as quickly as possible. My opinion is that about 20% of information has a useful spatial component. How did I derive that number? I didn't but it is just a guess from how I have seen people use things like maps and GIS. Most of the time people cannot make the connection that information is spatial or maps are more than just maps.
    Let's stop pushing this 80% thing. It makes me feel like I am trying to sell something.
    Nov. 15
    Picture of Anonymous
    (no name) wrote:
     Hilarious and relevant post!  I too, have often wondered about the 80% statistic that I've heard bantered around by others (and myself over the past 10+ years.  Working in local government since 2000, I've read the stat applied specifically to government records and datasets constantly.  I've always thought it was attributed to Tomlinson as well, but I can't confirm.  However, among the myriad GIS implementation plans I've read over the years, at least one had a reference.  I had hoped the reference would be in my "Beyond Maps-GIS and Decision Making in Local Government" book by ESRI press.  The introduction to Chapter One has the line "Because at least 70 to 80 percent of the average local government's work involves land or geographically related issues or tasks..."  Well, that sounds more specific at least, but alas, no reference listed.  Will continue to search! :-)
    Sept. 18

    Trackbacks

    The trackback URL for this entry is:
    http://virtualearth.spaces.live.com/blog/cns!2BBC66E99FDCDB98!9153.trak
    Weblogs that reference this entry
    • None