Abstract.

This paper studies linear and affine error-correcting codes for correcting synchronization errors such as insertions and deletions. We call such codes linear/affine insdel codes. Linear codes that can correct even a single deletion are limited to having an information rate at most \(1/2\) (achieved by the trivial two fold repetition code). Previously, it was (erroneously) reported that more generally no nontrivial linear codes correcting \(k\) deletions exist, i.e., that the \((k+1)\) -fold repetition codes and its rate of \(1/(k+1)\) are basically optimal for any \(k\) . We disprove this and show the existence of binary linear codes of length \(n\) and rate just below \(1/2\) capable of correcting \(\Omega (n)\) insertions and deletions. This identifies rate \(1/2\) as a sharp threshold for recovery from deletions for linear codes and reopens the quest for a better understanding of the capabilities of linear codes for correcting insertions/deletions. We prove novel outer bounds and existential inner bounds for the rate vs. (edit) distance trade-off of linear insdel codes. We complement our existential results with an efficient synchronization-string-based transformation that converts any asymptotically good linear code for Hamming errors into an asymptotically good linear code for insdel errors. Last, we show that the \(\frac{1}{2}\) -rate limitation does not hold for affine codes by giving an explicit affine code of rate \(1-\epsilon\) which can efficiently correct a constant fraction of insdel errors.

Keywords

  1. error correcting codes
  2. linear code
  3. insertion/deletion errors

MSC codes

  1. 94B05
  2. 94B25
  3. 94B35
  4. 94B65

Get full access to this article

View all available purchase options and get full access to this article.

References

AEL95.
N. Alon, J. Edmonds, and M. Luby, Linear time erasure codes with nearly optimal recovery, in Proceedings of the 36th Annual IEEE Symposium on Foundations of Computer Science, 1995, pp. 512–519.
AGFC07.
K. A. S. Abdel-Ghaffar, H. C. Ferreira, and L. Cheng, On linear and cyclic codes for correcting deletions, in Proceedings of the 2007 IEEE International Symposium on Information Theory (ISIT), 2007, pp. 851–855.
AGHP92.
N. Alon, O. Goldreich, J. Håstad, and R. Peralta, Simple constructions of almost k-wise independent random variables, Random Structures Algorithms, 3 (1992), pp. 289–304.
BGH17.
B. Bukh, V. Guruswami, and J. Håstad, An improved bound on the fraction of correctable deletions, IEEE Trans. Inform. Theory, 63 (2017), pp. 93–103.
BGZ15.
J. Brakensiek, V. Guruswami, and S. Zbarsky, Efficient Low-Redundancy Codes for Correcting Multiple Deletions, CoRR, abs/1507.06175, 2015.
BGZ18.
J. Brakensiek, V. Guruswami, and S. Zbarsky, Efficient low-redundancy codes for correcting multiple deletions, IEEE Trans. Inform. Theory, 64 (2018), pp. 3403–3410.
CHL19.
K. Cheng, B. Haeupler, X. Li, A. Shahrasbi, and K. Wu, Synchronization strings: Efficient and fast deterministic constructions over small alphabets, in Proceedings of the ACM-SIAM Symposium on Discrete Algorithms (SODA), 2019.
CJLW18.
K. Cheng, Z. Jin, X. Li, and K. Wu, Deterministic document exchange protocols, and almost optimal binary codes for edit errors, in Proceedings of the 59th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2018, Paris, IEEE Computer Society, 2018, pp. 200–211.
CJLW19.
K. Cheng, Z. Jin, X. Li, and K. Wu, Block edit errors with transpositions: Deterministic document exchange protocols and almost optimal binary codes, in Proceedings of the 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019), LIPIcs Leibniz Int. Proc. Inform. 132, C. Baier, I. Chatzigiannakis, P. Flocchini, and S. Leonardi, eds., Dagstuhl, Germany, 2019, pp. 37:1–37:15.
GHS20.
V. Guruswami, B. Haeupler, and A. Shahrasbi, Optimally resilient codes for list-decoding from insertions and deletions, in Proceedings of the 52nd Annual ACM Symposium on Theory of Computing, 2020, pp. 524–537.
GW17.
V. Guruswami and C. Wang, Deletion codes in the high-noise and high-rate regimes, IEEE Trans. Inform. Theory, 63 (2017), pp. 1961–1970.
Hae19.
B. Haeupler, Optimal document exchange and new codes for insertions and deletions, in Proceedings of the 60th IEEE Annual Symposium on Foundations of Computer Science, 2019, pp. 334–347.
HER18.
S. K. Hanna and S. E. Rouayheb, Guess & check codes for deletions, insertions, and synchronization, IEEE Trans. Inform. Theory, 65 (2018), pp. 3–15.
HRS19.
B. Haeupler, A. Rubinstein, and A. Shahrasbi, Near-linear time insertion-deletion codes and \((1+\epsilon)\) -approximating edit distance via indexing, in Proceedings of the ACM Symposium on Theory of Computing (STOC), 2019, pp. 697–708.
HS17.
B. Haeupler and A. Shahrasbi, Synchronization strings: Codes for insertions and deletions approaching the Singleton bound, in Proceedings of the 49th Annual ACM Symposium on Theory of Computing, ACM, 2017, pp. 33–46.
HS18.
B. Haeupler and A. Shahrasbi, Synchronization strings: Explicit constructions, local decoding, and applications, in Proceedings of the 50th Annual ACM Symposium on Theory of Computing, ACM, New York, 2018, pp. 841–854.
HSS18.
B. Haeupler, A. Shahrasbi, and M. Sudan, Synchronization strings: List decoding for insertions and deletions, in Proceedings of the International Colloquium on Automata, Languages and Programming (ICALP), 2018, pp. 76:1–76:14.
HSV18.
B. Haeupler, A. Shahrasbi, and E. Vitercik, Synchronization strings: Channel simulations and interactive coding for insertions and deletions, in Proceedings of the International Colloquium on Automata, Languages and Programming (ICALP), 2018, pp. 75:1–75:14.
HY18.
T. Hayashi and K. Yasunaga, On the list decodability of insertions and deletions, in Proceedings of the IEEE International Symposium on Information Theory (ISIT), IEEE, 2018, pp. 86–90.
SAK01.
K. W. Shum, P. Vijay Kumar, I. Aleshnikov, H. Stichtenoth, and V. Deolalikar, A low-complexity algorithm for the construction of algebraic-geometric codes better than the Gilbert-Varshamov bound, IEEE Trans. Inform. Theory, 47 (2001), pp. 2225–2241.
SB19.
J. Sima and J. Bruck, Optimal k-deletion correcting codes, in Proceedings of the IEEE International Symposium on Information Theory, 2019, pp. 847–851.
Spi96.
D. A. Spielman, Linear-time encodable and decodable error-correcting codes, IEEE Trans. Inform. Theory, 42 (1996), pp. 1723–1731.
SZ99.
L. J. Schulman and D. Zuckerman, Asymptotically good codes correcting insertions, deletions, and transpositions, IEEE Trans. Inform. Theory, 45 (1999), pp. 2552–2557.
WZ17.
A. Wachter-Zeh, List decoding of insertions and deletions, IEEE Trans. Inform. Theory, 64 (2017), pp. 6297–6304.

Information & Authors

Information

Published In

cover image SIAM Journal on Discrete Mathematics
SIAM Journal on Discrete Mathematics
Pages: 748 - 778
ISSN (online): 1095-7146

History

Submitted: 5 January 2022
Accepted: 27 November 2022
Published online: 6 June 2023

Keywords

  1. error correcting codes
  2. linear code
  3. insertion/deletion errors

MSC codes

  1. 94B05
  2. 94B25
  3. 94B35
  4. 94B65

Authors

Affiliations

Center on Frontiers of Computing Studies, Computer Science Department, Peking University.
Venkatesan Guruswami
Department of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213 USA.
Bernhard Haeupler
Department of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213 USA.
Xin Li
Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218 USA.

Funding Information

National Science Foundation: CCF-1814603, CCF-1910588, CCF-1750808
National Science Foundation: CCF-1617713, CCF-1845349
Funding: The first author was supported in part by a start-up funding of Peking University, a Simons Investigator award (409864, David Zuckerman), and NSF award CCF-1617713. The second author’s research was supported in part by NSF grant CCF-1814603. The third author was supported in part by NSF awards CCF-1814603, CCF-1910588, NSF CAREER award CCF-1750808, and by a Sloan Research Fellowship. The fourth author was supported by NSF award CCF-1617713 and NSF CAREER award CCF-1845349.

Metrics & Citations

Metrics

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited By

View Options

View options

PDF

View PDF

Full Text

View Full Text

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share on social media