- Fragments
- Fragment size
- Libraries
- Adaptors, vectors
- Reads
- Read length
- Single end
- Paired end
- Mate pairs
October 23, 2018
\[N\exp\left(-\frac{(L-T)N}{G}\right)\]
If \(T\) is too small, we risk confusing two parts of the genome
Before any biological considerations, the expected number of times we see a repeat of size \(T\) just by chance, is \(G/4^k\)
Thus, the probability that two reads match wrongly is \(4^{-T}\). And there are almost \(N^2\) pairs of reads
Expected number of chimeras: \(N^2 4^{-T}\)