Questions tagged [unique-data]

11 questions
1635
votes
11 answers

Which hashing algorithm is best for uniqueness and speed?

Which hashing algorithm is best for uniqueness and speed? Example (good) uses include hash dictionaries. I know there are things like SHA-256 and such, but these algorithms are designed to be secure, which usually means they are slower than…
Earlz
  • 22,658
  • 7
  • 46
  • 60
19
votes
2 answers

Is there an elegant way to check unique constraints on domain object attributes without moving business logic into service layer?

I have been adapting domain-driven design for about 8 years now and even after all these years, there is still one thing, that has been bugging me. That is checking for a unique record in data storage against a domain object. In September 2013…
Andy
  • 10,238
  • 4
  • 25
  • 50
3
votes
2 answers

Should email be unique across application users?

I am working on a medical application which will be implemented for USA patients and health providers. I want that there should be only 1 record of a user without any duplication. And I believe that putting a unique constraint on email across the…
3
votes
1 answer

Reproducible uniqueness

So, I have a bunch of calls that are all being generated with a UUID1 throughout the day. At the end of the call the call is processed and some metrics around that call are generated and stored in Rethinkdb/Cassandra. Each call will generate…
Jared Mackey
  • 163
  • 5
2
votes
4 answers

Algorithm for permutations of a string of characters, with no duplicate orders

I've seen a lot of good questions about similar problems but not exactly what I'm looking for. Given: a non-unique set of symbols (or characters) (1, 1, 2, 3) Expected: an output similar to this (the order isn't important, only the elements…
triplethreat78
  • 223
  • 1
  • 8
1
vote
4 answers

Is using multiple UUIDs decrease chance of collisions exponentially?

For example if you have a single UUID with a collision probability of x, if you concatenate 2 UUIDs, does the collision probability become x^2? val0 = generate_uuid() val1 = generate_uuid() final_val = val0 + val1 So with each additional uuid,…
Joan Venge
  • 1,950
  • 2
  • 18
  • 24
1
vote
3 answers

Deduplicating data during batch processing

I constantly struggling to solve data duplication problems efficiently (storing data from any source to RDBMSes). My main concern is speeding up inserts/batch processing. Scenario: I read data from different sources, mainly in json format and I need…
appl3r
  • 149
  • 4
0
votes
2 answers

Ensure unicity of a 10 characters alphanumeric String

I have a requirement of developing a service that must generate each day few millions of random and unique alphanumeric String of length 10 (I can't increase this length, this is a customer prerequisite). Having a ID, the next ID must not be…
Rouliboy
  • 137
  • 7
0
votes
1 answer

CRM and external systems: How to associate data with contacts?

This question is about Dynamics CRM but is valid for every system that allows merging entities. Having data related to a contact in multiple external systems, how would you know which contact the data is related to? Using ContactId works until that…
Jan Sommer
  • 170
  • 8
-1
votes
1 answer

How to maintain unique, customer facing Booking or Reservation Ids

I am facing trouble in creating a unique, customer facing Booking Id. It's a simple thing to do at backend where we can create a UUID and maintain uniqueness. But the UUID looks ugly from a Frontend Customer perspective. I wouldn't want to keep…
blogbydev
  • 161
  • 1
  • 6
-1
votes
2 answers

Identifying similar customer data

I have a large database of customers. There's a need to identify customers who have two or more customer IDs. This comes mostly from people at the front-desk creating new customers instead of searching for their existing IDs or doing a bad data…
azathoth
  • 107