De-dupe and standardize data
I would prefer a focus on completely de-duping and standardizing the existing data before adding any new features.
Starting “The Big Clean” now!
This is a really big project and is going to consist of a bunch of small projects to make sure things stay cleaner and more standardized in the future.
And a lot of it will be behind the scenes so don’t be surprised if you aren’t seeing much happening.
I see so many duplicates of the same show… I think it would help if we had an option to flag them… is there one ?
Hi! Just joined, and am excited about logging shows.
I've been (what I would call) a frontline volunteer data quality checker for the FamilySearch genealogical project for nine years. Adding, editing, and a lot of dedup and standardizing are all involved. If dev(s) open up more micro-task crowdsourcing, I’d enjoy contributing to the "sweep.”
Also: I agree with the recs about an entry “merge” function, and the point made about one of the standardization perils being some user’s feelings of entry “ownership” (seen that a lot on FS.)
Concert Mojo commented
Masquerade (ATL) Heaven, Hell and Purgatory are different venues. They used to be in the same building but are now in different buildings (though right near each other). They must remain separate since there are sometimes concerts in all 3 on the same night. I have often seen shows in 2 or 3 on the same night. Not sure how important it is to reflect if it is OG Masquerade or new one in the Underground.
@Justine Baddeley, we're working on a way to separate out concerts that 'appear' to be duplicates (same band, date, venue, location) but really aren't. We'll be implementing a new option where you'll be able to mark a band as an 'Early Show', 'After Party', etc and that will keep them from being merged together. Hopefully we'll have that released in July.
Good idea to de-dup... until I've noticed it merged 2 separate concerts played at the same venue on the same day into 1, the bands played an all ages afternoon show then later an +18 only show the same day, I went to both so instead of saying that I went to 2 shows it's only counting as 1. Even updated the tour title to distinguish the 2 and they still merged
Jill Steiner McCall commented
Agree with Dan Curhan below IE: The Masquerade in Atlanta has an old locations from the 90's and a new locations stating in the 2000's - plus multiple locations for different stages Hell & Heaven.
Granted the old and new are 'technically' valid as they do have different street addresses if those were to be used, but the Heaven and Hell stage locations are definitely redundant.
Also the band Drivin' N' Cryin' is entered something like 4-5 different time with various different apostrophe options allowed. There should be a way to force people to use only one option for them.
dan curhan commented
It's not just concerts with duplicity - search "middle east" under Venues. There are THREE PAGES of results, all for what appear to be the same venue: The Middle East in Cambridge, MA, which has both an upstairs stage and a downstairs stage so should return, at most, two results.
I love the service and the website format and all, but the data is all over the place!
Greg Fasolino commented
I think the #1 priority should be merging all duplicates. Too many concerts have multiple entries for the exact same event. Even worse, we have some members who think their entries are "theirs" and belong to them uniquely. Each concert that occurred needs to have just one merged entry to avoid this kind of stuff.
Greg Fasolino commented
Love this website, the idea is fantastic, but it needs much more standardization, merging of duplicate entries, etc. Is there a way to make it so people cannot add in duplicates? (Setlist.fm does this automatically).
Matt Suda commented
The search has not been working lately. Using basic keywords as suggested on the search page comes back with no results found with only suggestions to import from SongKick. This has resulted in many duplicated concert listings lately.
Update: After contacting support this was fixed
Mark F. King commented
Consolidating multiple entries of the same concert would tidy things up. Once you figure out how exactly you are going to do this I would like to help with it. Let me know. Thanks.
Only two days since I'm scanning the data base and yes, there are many duplicates. I hope the hosts of this web place want it be tidied up. I think, it is about 20% maybe more of the concert numbers should be down.
There are a lot of duplicate entries for concerts or festivals and it makes it difficult to search through the database. If there was a button to suggest combining entires, or if we were able to manually do it that would be amazing. A way so the people won't lose access to that archived concert but everyone will be combined under one. For festivals in particular it's frustrating, because some either are lacking of majority of the artists attending, or the other minor information like location is missing. So if it was all combined under one it would be much cleaner and easier plus since we can pick which artists we actually saw there shouldn't be that much of an issue. AND! if someone opens up a concert archive and they can't recall most of the information other people would be able to add to that archive rather than having 10 different entries with none of them having complete information. (Also maybe being able to add a header photo to a concert page, like the tour poster)
Is there anyway you can stop people putting multiple venues, places, bands etc. Because they spell differently or use commas or different grammar. It’s annoying to find multiple concerts purely because entries have been misspelt
Troy C. commented
Completely agree with this. There are many concerts with 3+ entries because of slight naming differences. It also makes it tough to search by tour or venue
Davey Gravy commented
I wholeheartedly agree with this enhancement. There are too many duplicates and the data could use a clean sweep.
Jayson Hanks commented
Agreed, but i would add that you shouldnt use abbreviations such as St. paul, but instead Saint Paul, or Saint Louis, MO
Ability to merge a concert so that there aren't loads of different posts for the same concert
Love this site so far! Great job overall with plenty of room to grow. Here are a few ideas from a design/UX standpoint.
1. Generate a universal list for band names, venues etc so that when you enter new concert details the dropdown menu suggestions aren't the same band in 3 different spellings. For example, a band like St. Paul and the Broken Bones has a second instance as St. Paul & the Broken Bones. The only difference is the ampersand instead of the longform "and". It shows up like 2 different bands.
Thanks for reading. I really do think this is a great website overall! Keep it going.