rhu | SQL gripe

SQL is a standard language for expressing database design. It has the following useful constructs:

* A datum may be assigned the value NULL, which means (roughly) "there is no meaningful value to put here." NULL is not considered equal to any other value, even another NULL.

* A column (such as "the mayors of all the cities") may be constrained to have unique values ("no person can be the mayor of more than one city at the same time.")

But if you mark a column as unique, it won't allow more than one row to have NULL as its value, even though testing two NULLs for equality returns false. So there's no way to express "If there is a value here, it must be unique; but any number of rows may have an unknown or empty value at the same time."

Surely I'm not the first person to run into this problem. But I can't find any good solutions --- neither in my SQL books nor via Google.

Edited to add: Thanks to

abbasegal for suggesting setting up a unique nonclustered index on a schemabound view.

S	M	T	W	T	F	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Most Popular Tags

"what i believe" - 22 uses
about dad - 24 uses
ask lj - 32 uses
bible bystanders - 14 uses
biking - 16 uses
bookbinding - 18 uses
books - 80 uses
daf - 24 uses
dafcast - 48 uses
drowsy chaperone - 10 uses
dvar torah - 9 uses
fonts - 8 uses
food - 28 uses
funny - 12 uses
geekitude - 23 uses
genealogy - 20 uses
humor - 40 uses
israel - 48 uses
jcds - 8 uses
judaism - 342 uses
kabbalat shabbat - 40 uses
kids - 166 uses
kvelling - 14 uses
kvetches - 71 uses
language - 15 uses
links - 88 uses
liturgy - 22 uses
mit mystery hunt - 20 uses
mourning - 26 uses
music - 158 uses
newton - 10 uses
panda magazine - 18 uses
pdz - 18 uses
pesach - 34 uses
pesachim - 10 uses
politics - 34 uses
puzzles - 428 uses
quotes - 10 uses
review - 36 uses
reviews - 95 uses
siddur - 93 uses
silly - 79 uses
software - 14 uses
talmud - 28 uses
torah - 8 uses
turning 40 - 8 uses
typography - 12 uses
vacation - 10 uses
voice lessons - 14 uses
work - 10 uses

Flat | Top-Level Comments Only

From:

sethg

You could also refactor the table into two separate tables, i.e., instead of

CREATE TABLE one_big_table (
id INTEGER PRIMARY KEY,
foo INTEGER UNIQUE -- may be null
-- other fields
);

do something like this:

CREATE TABLE first_little_table (
id INTEGER PRIMARY KEY
-- other fields
);

CREATE TABLE second_little_table (
id INTEGER NOT NULL REFERENCES first_little_table.id,
foo INTEGER UNIQUE NOT NULL
);

CREATE VIEW one_big_view AS
SELECT * FROM first_little_table LEFT JOIN second_little_table USING (id);
--SQL99 outer join syntax

There are relational-theory purists (well, at least one purist that I know of) who would say that you should always refactor a schema this way instead of allowing fields in a table to be null.

530nm330hz.livejournal.com

Tried that. But in this case there are other constraints that can be expressed only within one table.

The actual example is that my main table is a list of Engines, which have among their many properties a Status, which is an enumerated tinyint, and a CurrentTaskId, which is a foreign key into the Tasks table. I want to ensure that no task is simultaneously assigned to more than one engine, that no engine is simultaneously assigned to more than one task, and that an engine's CurrentTaskId is constrained to be NULL for certain status values (such as INITIALIZING or AVAILABLE) and non-NULL for other status values (such as ASSIGNED or WORKING). MSSQL, at least, does not allow check constraints to include SELECT statements, and it would be prohibitively expensive to wrap each update in a transaction with deferred checking. So I chose the DB design that lets me describe two out of my three constraints, and tried to ensure the third by careful programming. But apparently there's a race condition that I didn't catch, and I want to use SQL constraints to nail the thing.

In my experience, relational-theory purists don't concern themselves much with piddling details like throughput and preventing deadlock by eliminating the need for transactions.

Brainripples

SQL gripe

(no subject)

(no subject)

Profile

January 2013

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags