Is mrperolsson the same person as MrPerOlsson?

by Mr Per Olsson 24. January 2010 15:04

A thought came up when I built a simple website for displaying points in a indoor fotball tournament a couple of weeks ago and it was actually a mistake when I was stressed getting data into the system that rendered an error for me that is the source to the question in the title

I wrote Timmele GoiF in one place in the system (id of a team in the tournament) and Timmele GoIF (capital i in GoIF) and of course the system did not work since i used a bit of scripting in the page.
As a human I know, of course, that this is the same team since I know the team. But should a system know this?

Of course, I should not have used the teams name as an ID in the system, it would have been easier/better to have used a numeric id or something that is better for the database engine to index and for, in my case, scripting to understand.

But if names is an identifier in a system, the question arise:
Is a capital version of TIMMELE GOIF the same as the lower case timmele goif?
In older system, for example the ERP Garp, it is a distinction but in more modern SQL server based systems (depending on setup of course) there is no distinction.

How should it be? What is the best, integrationwise?

A lot of questions and, Im sure, a lot of answers.

In my world it should be the same if I want to integrate between two system and if it is a standalone system it could be different since I dont have to talk to others but myself (myself being the system)

Bookmark and Share DotnetKicks dotnetshoutout

If you enjoyed this post, make sure you subscribe to my RSS feed!

Tags: ,

Professional | Data quality

Comments

1/24/2010 3:44:16 PM #

Henrik Liliendahl Søresen

You just hit my favourite subjects Per.

Solving these kind of problems is a journey starting with the uppercase and lowercase problem (case sensitiveness) like:

mrperolsson vs MrPerOlsson

Next challenge may be blanks like:

“MrPerOlsson” vs “Mr Per Olsson”

Then phonetic similarities:

“Mr Per Olsson” vs “Mr Per Olson”

Atomic similarity (like with or without a salutation):

“Mr Per Olsson” vs “Per Olsson”

Fortunately you don’t have to reinvent the wheel. There are lots of algorithms out there dealing with this.

I have founded a LinkedIn group called “Data Matching” for people interested in the subject:

http://www.linkedin.com/groups?gid=2107798

Henrik Liliendahl Søresen Denmark | Reply

1/24/2010 4:08:25 PM #

Per

Thanks for the comment, I joined the group in LinkedIn so I can learn more in the subject.

Per Sweden | Reply

1/24/2010 4:13:44 PM #

trackback

Social comments and analytics for this post

This post was mentioned on Twitter by MrPerOlsson: Is mrperolsson the same as MrPerOlsson - http://snipurl.com/u6bjp  #dataquality

uberVU - social comments | Reply

1/25/2010 5:40:51 AM #

pingback

Pingback from topsy.com

Twitter Trackbacks for
        
        HRAHK - Heavens Restroom And Hells Kitchen - your creative partner | Is mrperolsson the same person as
        [hrahk.se]
        on Topsy.com

topsy.com | Reply

Add comment




  Country flag

biuquote
  • Comment
  • Preview
Loading