database design Database design is the organization of data according to a database model. The designer determines what data must be stored and how the data elements interrelate. With this information, they can begin to fit the data to the database model.Teorey, T ...

, a lossless join decomposition is a decomposition of a

relation Relation or relations may refer to: General uses * International relations, the study of interconnection of politics, economics, and law on a global level * Interpersonal relationship, association or acquaintance between two or more people * ...

r

into relations

r_1, r_2

such that a

natural join In relational algebra, a join is a binary operation, written as R \bowtie S where R and S represent relations, that combines their data where they have a common attribute. Natural join Natural join (⨝) is a binary operator that is written as ...

of the two smaller relations yields back the original relation. This is central in removing redundancy safely from

database In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...

s while preserving the original data. Lossless join can also be called non-additive.

Definition

A relation

r

on schema

R

''decomposes losslessly'' onto schemas

R_1

and

R_2

\pi_(r) \bowtie \pi_(r) = r

, that is

r

is the natural join of its projections onto the smaller schemas. A pair

(R_1, R_2)

is a ''lossless-join decomposition'' of

R

or said to ''have a lossless join'' with respect to a set of functional dependencies

F

if any relation

r(R)

that satisfies

F

decomposes losslessly onto

R_1

and

R_2

. Decompositions into more than two schemas can be defined in the same way.

Criteria

A decomposition

R = R_1 \cup R_2

has a lossless join with respect to

F

if and only if In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...

the closure of

R_1 \cap R_2

includes

R_1 \setminus R_2

R_2 \setminus R_1

. In other words, one of the following must hold: *

(R_1 \cap R_2) \to (R_1 \setminus R_2) \in F^+

(R_1 \cap R_2) \to (R_2 \setminus R_1) \in F^+

Criteria for multiple sub-schemas

Multiple sub-schemas

R_1,R_2,...,R_n

have a lossless join if there is some way in which we can repeatedly perform lossless joins until all the schemas have been joined into a single schema. Once we have a new sub-schema made from a lossless join, we are not allowed to use any of its isolated sub-schema to join with any of the other schemas. For example, if we can do a lossless join on a pair of schemas

R_i, R_j

to form a new schema

R_

, we use this new schema (rather than

R_i

R_j

) to form a lossless join with another schema

R_k

(which may already be joined (e.g.,

R_

)).

Example

* Let

R = \

be the relation schema, with attributes , , and . * Let

F = \

be the set of functional dependencies. * Decomposition into

R_1 = \

and

R_2 = \

is lossless under because

R_1 \cap R_2 = A

and we have a functional dependency

A \rightarrow BC

. In other words, we have proven that

(R_1 \cap R_2 \rightarrow R_1 \setminus R_2) \in F^+

References

{{Reflist Databases Data modeling Database constraints Database normalization Relational algebra