Knotting fingerprint

The topological fingerprint was introduced in [1] and motivated by [2]; for various applications see also [3]. It was successfully used to describe entanglement of all proteins [4] deposited in the PDB. Herein, we used it to describe the crucial information about the topology of the given chromosome structure. The fingerprint encodes information about locations of the "knot core", "knot tails", "slipknot loops", and "slipknot tails", whose definitions are given below.




  1. Matrix diagram - Knotting fingerprint
  2. Fingerprints based on different closure methods
  3. Remarks



Matrix diagram - Knotting fingerprint

The knotting fingerprint takes a form of a matrix diagram by using the procedure established by us before for proteins [4]. The matrix diagram is constructed as follows. For a given chain of length N, all its subchains spanning between residues k and l (for 1≤k<l≤N) are analyzed. If a subchain from k to l is knotted (as determined by the analysis described in section Knot detection), then a point with coordinates (k,l) is denoted in relevant color in a plot. For example, points representing knots 31, 41, 51, 52, 62, 63, 76 are respectively green, red, orange, grey, violet. All more complex knots are shown with different shades of grey. The intensity of a color represents a probability of detecting a given knot (knots are detected only with some probability, which is a consequence of a choice of its closure to a closed loop, as explained in section Knot detection). The figure below presents an example of a knotting fingerprint, which in this case contains areas representing a knot and two types of slipknots.

Fig. 1 Left panel example of a knotting fingerprint with S3151624176886352820 topological notation detected in chromosome o, model 01, cell 1 [5]. The matrix arises from the random closure method (20 randomly selected points on the sphere). The right panel shows the location of the knot core, knot tails, slipknot tail and slipknot loop for the sample slipknot of type 31. Please note that only the knot with the highest probability is shown.

In a given knot or slipknot several geometric elements can be distinguished. These elements are denoted in different colors along the diagonal of the matrix diagram as shown schematically in the figure below:

  • knot core (thick blue line in Fig. 1) the shortest subchain for which a knot is detected (i.e. after cutting beads from any terminal of such a subchain, just a trivial knot would be detected); note that "knot core" is defined also for a slipknot
  • knot tail (thin grey lines in Fig. 1, a segment between one terminal of a knotted chain and its "knot core"
  • slipknot tail (thin green line in Fig. 1): in a structure with a slipknot (i.e. a configuration in which the whole chain is unknotted, but it has a knotted subchain), the longest segment starting at one terminal, for which no change in topology is detected
  • slipknot loop (thin orange line in figures above): in a structure with a slipknot, a segment between a "slipknot tail" and a "knot core".

The matrix diagram is interactive: after choosing a knot type (if more than one knot type is detected) from the table, the data corresponding to this knot is shown in the diagram Fig. 2. By default the data corresponding to the knot formed by the whole chain (for knotted chromosomes), or the most complicated slipknot (for chromosomes with slipknots) is shown in the diagram.

Fig. 2 An example for the data presentation for a slipknotted chromosome. In this example the analysis reveals that the entire analyzed chain (chromosome r, model 01, cell 1 [4]) forms an unknot and has a subchain forming a 31 knot. Diagram in the top left: knotting fingerprint revealing the positions of the subchain forming a 31 knot. Top right: graphical representation of the chromosome in JSmol, with the knot core, slipknot and knot tails highlighted in various colors (knot core in blue, slipknot loop in orange, and the knot tail in grey). In grey the knot tail is shown. In the middle left buttons: iHelp - description how to use JSmol, Reload of data and Download - SVG maps and PNG map. Table at the bottom: detailed data about knots and slipknots formed by backbone subchains.




Fingerprints based on different closure methods

Fig. 3 presents the fingerprint of chromosome o, model 1, cell 1 determined based on three different closure methods: centre of mass, random closure and direct closure.

Fig. 3 Fingerprint for chromosome o, model 1, cell 1 determined based on three different closure methods: centre of mass, random closure and direct closure, respectively




Remarks

Remark 1: Please note that topological fingerprints based on the current available data [5] is usually much more complicated than this presented on the Fig. 1.

Remark 2: In the case of the random closure method, to speed up the time of calculation, the fingerprint is determined for every seventh configuration.




[1] Sulkowska, J.I., Rawdon, E.J., Millett, K.C., Onuchic, J.N. and Stasiak, A. (2012) Conservation of complex knotting and slipknotting patterns in proteins. Proc. Natl. Acad. Sci. U.S.A. 109, E1715-E1723
[2] King NP, Yeates EO, Yeates TO. (2007). Identification of rare slipknots in proteins and their implications for stability and folding. J. Mol. Biol. Oct 2007. 373(1):153-66.
[3] E.J. Rawdon, K.C. Millett, J.I. Sulkowska and A. Stasiak, Knot localization in proteins, Biochem. Soc. Trans., 41, 538-541, (2013).
[4] Jamroz M, Niemyska W, Rawdon EJ, Stasiak A, Millett KC, Sulkowski P, Sulkowska JI, KnotProt: a database of proteins with knots and slipknots, Nucleic Acids Res. (2014), 43: D306-D314
[5] Stevens, T.J.; Lando, D.; Basu, S.; Atkinson, L.P.; Cao, Y.; Lee, S.F.; Leeb, M.; Wohlfahrt, K.J.; Boucher, W.; O’Shaughnessy-Kirwan, A.; et al. 3D structures of individual mammalian genomes studied by single-cell Hi-C. Nature 2017, 544, 59-64.


KnotGenom | Interdisciplinary Laboratory of Biological Systems Modelling