Take a look at the Recent articles

A quantitative understanding of human sex chromosomal genes

Sk. Sarif Hassan

Department of Mathematics, University of Petroleum and Energy Studies, Dehradun, India

E-mail : bhuvaneswari.bibleraaj@uhsm.nhs.uk

Pabitra Pal Choudhury

MCKV Institute of Engineering, Liluah, Howrah, West Bengal, India

Antara Sengupta

MCKV Institute of Engineering, Liluah, Howrah, West Bengal, India

Binayak Sahu

Visiting Student at Institute of Mathematics & Applications, Bhubaneswar 751003, Odisha, India

Rojalin Mishra

Visiting Student at Institute of Mathematics & Applications, Bhubaneswar 751003, Odisha, India

Devendra Kumar Yadav

Visiting Student at Institute of Mathematics & Applications, Bhubaneswar 751003, Odisha, India

Saswatee Panda

Visiting Student at Institute of Mathematics & Applications, Bhubaneswar 751003, Odisha, India

Dharamveer Pradhan

Visiting Student at Institute of Mathematics & Applications, Bhubaneswar 751003, Odisha, India

Shrusti Dash

Visiting Student at Institute of Mathematics & Applications, Bhubaneswar 751003, Odisha, India

Gourav Pradhan

Visiting Student at Institute of Mathematics & Applications, Bhubaneswar 751003, Odisha, India

DOI: 10.15761/FGNAMB.1000137

Article
Article Info
Author Info
Figures & Data

Abstract

In the last few decades, the human allosomes are engrossed in an intensive attention among researchers. The allosomes are now already been sequenced and found there are about 2000 and 78 genes in human X and Y chromosomes respectively. The hemizygosity of the human X chromosome in males exposes recessive disease alleles, and this phenomenon has prompted decades of intensive study of X-linked disorders. By contrast, the small size of the human Y chromosome, and its prominent long-arm heterochromatic region suggested absence of function beyond sex determination. But the present problem is to accomplish whether a given sequence of nucleotides i.e. a DNA is a Human X or Y chromosomal genes or not, without any biological experimental support. In our perspective, a proper quantitative understanding of these genes is required to justify or nullify whether a given sequence is a Human X or Y chromosomal gene. In this paper, some of the X and Y chromosomal genes have been quantified in genomic and proteomic level through Fractal Geometric and Mathematical Morphometric analysis. Using the proposed quantitative model, one can easily make probable justification or deterministic nullification whether a given sequence of nucleotides is a probable Human X or Y chromosomal gene or not, without seeking any biological experiment. Of course, a further biological experiment is essential to validate it as the probable Human X or Y chromosomal gene homologue. This study would enable Biologists to understand these genes in more quantitative manner instead of their qualitative features.

Key words

human X and Y chromosomes, genes, fractal dimension, hurst exponent, protein analysis

Introduction

In the last ten years, Genomics has revolutionized the study of evolution. Evolution changes the sequence of DNA molecules, and comparing DNA sequences allow us to reconstruct evolutionary events from the past. The availability of DNA sequences from multiple vertebrates has confirmed that the process of sex chromosome evolution as envisioned by theorists has played out multiple times in the evolution of vertebrate sex chromosomes. However, complete, high-quality sequences of sex chromosomes have led to discoveries that were unanticipated by existing theory.  The next stage of genomic research will begin to derive meaningful knowledge from these genes. A quantitative genomic understanding will have a major impact in the fields of medicine, biotechnology, and the life sciences [1,2].

One of the most frontier contemporary challenges is to make a revolution in medical science by introducing Genetic Therapy [3]. Gene therapy is an experimental technique that uses genes to treat or prevent disease. This method of therapy would countenance us to treat a disorder by inserting a gene into a patient’s cells instead of using drugs or surgery. The most commonly practiced approaches of gene therapy include…

  • Replacing a mutated gene that causes disease with a healthy copy of the gene.
  • Inactivating, or “knocking out,” a mutated gene that is functioning improperly.
  • Introducing a new gene into the body to help fight a disease.

Although gene therapy is a promising treatment option for a number of diseases (including inherited disorders, some types of cancer, and certain viral infections), the technique remains risky and is still under study to make sure that it will be safe and effective. Gene therapy is currently being tested for the treatment of diseases that have no other cures [3-6]. Prior to gene therapy as a practical approach for treating diseases, we must overcome many technical challenges. One of the nontrivial challenges is to get quantitative insight of genes. This would help us in precise characterization of a particular DNA. This quantitative study of genes will be an add-on as the genetic signature of a gene.

In the present study, a mathematical quantification of human X and Y chromosomal genes [7-13] has been done by using Fractal Geometry [14-16]. So on using this proposed quantitative model, one can easily make probable justification (or deterministic nullification) whether a given sequence of nucleotides is a probable Human X/Y chromosomal genes or its homologue or not, without seeking any biological experiment. This study would help researchers in understanding these genes in differentiating from each other through the very nucleotide syntactical presentation.

Model decomposition and representation

(A) DNA 4-colored representation:  Let a DNA sequence is in the form of four-letter (ATGC) nucleotides sequence (Figure 1A). Such sequence shown in Figure 1A is converted as a function (Figure 1B) depicting colors Red, Blue, Green, and Yellow respectively for A, T, G, and C [17,18]. This allows having maximum of 4 colors, i.e. .

Figure 1a. A DNA string of four variables A T C and G of SRY.

Figure 1b. Function generated by proper colour coding for ATGC of SRY.

 (B) Binary representation: A DNA as a one dimensional nucleotide sequence, and is represented as a map such that . This mapping yields a DNA sequence in a binary string format. A portion of such a binary string is shown below of some fixed size (twice of the DNA sequence length). We call this representation of DNA as 2-adic string of DNA [18].

0011100001001010001111100000000000110000100000111100010001001111001111010111111100000100111110001011111101010100100111111110000010110010011110000000110000111100110011011001001100000000000111111110111100110011111111110100011111110111110011111111010000000000111100110000000011111010101110110000100001001111011111010011011001…..(some more 0, 1 are there in the string).

(C) 4-Adic representation: Also we consider a DNA as a string of four variables 0, 1, 2 and 3 (as shown below) corresponding to A T C and G respectively. We name this string as 4-adic string of DNA [17,18].

023010330223000002003002201010220221122200102230322211103122230032031230002002202021310200000122232202022222101222122022221000002202000022333232003010221220022120030000232230222231220212210232222202210022003301222233200010222312332322002322000030303223333000233023310233331212333003012112030200010122200303312…

(D) Threshold decomposition: We have decomposed the four-colored image into four binary images (Figures 2A-D) for a DNA through the threshold decomposition function defined as:

Figure 2. Threshold decomposed binary images of OR10AB1P (Black and white denotes complimentary space and one of the ATGC). (A) A (B) T (C) G, and (D) C.

Those decomposed binary images for one human X and Y are denoted as are shown in the following:

 (E) Skeleton decomposition: Morphological skeletons Figures 5 & 6 a-d for threshold decomposed binary images of SRY shown in (Figures 5 & 6 a-d) is obtained according to (3).

where B is a structuring element that is symmetric about the origin, and .

Density and intricacy of the skeleton for those decomposed binary images depend upon the frequency of occurrence of nucleotide chosen as threshold and their spatial distribution. The intricacy of the skeleton is proportional to the heterogeneity in the spatial distribution of the skeleton.

(F)  Protein plot representations: A number of fundamental properties namely percentage of Accessible Residues (AR), Buried Residues (BR), Alpha Helix (Chou & Fasman) (ALCF), Amino Acid Composition (ACC), Beta Sheet (Chou & Fasman) (BSCF), Beta Turn (Chou & Fasman) (BTCF), Coil (Deleage & Roux) (CDR), Hydrophobicity (Aboderin) (HA), Molecular Weight (MW),  and  Polarity (P) of the human X and Y chromosomal genes have been considered (Figure 3).

Figure 3. Skeletons of  

All protein plots are generated from the gene sequences using Matlab (bioinformatics toolbox) (Figure 4). Then box-counting dimension for each of the protein plot have been calculated through BENOITTM.

Figure 4. Accessible residues (AR-Protein Plot) of SRY

In the next section let us elaborate the methods applied to DNA string to extract the quantitative details.

Methods

The quantitative details of X and Y chromosomal genes have been studied in the light of fractal dimension. The very basic of one such fractal dimension method is Box Counting Dimension which is illustrated below.

Box-counting method: The most practical and commonly used method of calculation of fractal dimension is Box-counting dimension. This is mainly because it is easy to calculate mathematically and because it is easily estimated empirically. We note that the number of line segments of lengththat are needed to cover a line of length l is that the number of squares with side length that are needed to cover a square with area A is and that the number of cubes with side length   that are needed to cover a cube with volume V is .

So in general, the box-counting dimension (or just ``box dimension'') of a set S subset of  as follows:

For , let  be the minimum number of n-dimensional cubes of side-length ε needed to cover S. If there is a number D so that

The D is called the Box-Counting Dimension of S.

It is to be Noted that the Box-Counting Dimension is D if and only if there is some positive constant m so that

The above equation gives

Note that the log m term drops out, because it is constant while the denominator becomes infinite as ε→ 0. Also, since , is negative, so D is positive.

But in practice, this method computes the number of cells required to entirely cover an object, with grids of cells of varying size. Practically, this is performed by superimposing regular grids over an object and by counting the number of occupied cells. The logarithm of N(r), the number of occupied cells, versus the logarithm 1/r of , where r is the size of one cell, gives a line whose gradient corresponds to the box dimension [14].

FD of DNA walks of the genes

The DNA walk is defined as a sum of the progression which is the cumulative sum on the DNA string representation

Also we define . It has been resulted by plotting as we have defined two functions:

Here we compute the Fractal dimension of all DNA walk for the 4-adic string of all sex genes. The plot of the DNA walk for the SRY string is shown in Figure 5.

Figure 5. DNA Walk (Pn,Qn) for SRY

The box-counting dimension for the DNA walk of SRY is 1.94611. In the similar manner we have computed all the box counting dimension of all genes.

Hurst exponent of the DNA sequences

Hurst exponent is referred as the "index of dependence," and is the relative tendency of a time series either to regress strongly to the mean or to cluster in a direction. It is a measure of long range correlation of one-dimensional time series [19,20].

Let us consider a string

The Hurst exponent H is defined as: , where n is the length of the string. The range for which the Hurst exponent, H indicates negative, positive auto-correlation are 0<H<0.5 and 0.5<H<1 respectively. A value of H=0.5 indicates a true random walk, where it is equally likely that a decrease or an increase will follow from any particular value [20].

Here we consider 2-adic strings of DNA for computation of Hurst exponent.

The Hurst exponents of the 2-adic string of SRY are 0.69106. This is how we have computed Hurst exponent for all the human genes [18].

Succolarity

The degree of percolation of an image (how much a given fluid can flow through this image) can be measured through Succolarity, a fractal parameter [21,22].

The succolarity of a binary image is defined as

where ‘dir’ denotes direction; BS(n) where n is the number of possible divisions of a binary image in boxes. The occupation percentage (OP) is defined as, for each box size, k, the sum of the multiplications of the OP(BS(K)), where k is a number from 1 to n, by the pressure PR(BS(K),pc), where pc is the position on x or y of the centroid of the box on the scale of pressure) applied to the box are calculated. Therefore for any binary decomposed images of , the succolarity can be obtained.

Here we compute succolarity of the decomposed images for DNA as shown in the previous section. The succolarity of the four decomposed images of SRY are 0.000351, 0.000782, 0.000272 and 0.000267 respectively.

Similarly, we have computed the succolarity of the decomposed images for all sex genes.

Statistical autocorrelations

It is one of several descriptors, describing how far the values lie from the mean (expected value).

For a given sequence

and the variance at distance N-k is given as

It is easily computable that the variance for the string Figure 1(C) is 1.29.

Mean and SD ordering of gene sequences

A gene is a string constituting of different permutations of the base pairs A, C, T and G where repetition of a base pair is allowed. We can classify the miRNA sequences based on the ordering of poly-string mean of A, C, T, and G in the string. Given a string X, we calculate the mean of poly-strings consisting only of A, C, T and G separately [15,16].

and m is the length of the longest poly-string over the string.

According to the non-decreasing order of mean, we have classified all the genes into different classes. The mean order of sequence 1(A) is AUGC i.e. mean of poly-string of A is less than the same of U and so on.

Results and discussions

Let us now elaborate in detail, the result obtained for all sex genes using the above stated methods.  

Fractal dimension of DNA walk

For all the 92 human sex chromosomal genes along with their homologues in other species, the fractal dimension of the DNA walks lies in the interval (1.896, 1.948) as shown in Tables 1 and Figure 11. 

From Tables 1 and Figure 11 it is clear that the box counting dimension is cantered at 1.94. The human sex genes and their corresponding homologues have almost same box counting dimensions. Although the ordering and length of the gene sequences are different from each other but they share the same box counting dimension of DNA walk.

Table 1. Genes and Their FD of DNA Walk

Genes

FD of DNA Walk

DHRSX (Bos taurus)

1.94577

DHRSX (Homo sapiens)

1.94584

CD99 (Bos taurus)

1.94527

CD99 (Homo sapiens)

1.92993

CD99 (Pan troglodytes)

1.94588

CD9912 (Mus musculus)

1.94584

ZBED1 (Bos taurus)

1.94577

ZBED1 (Homo sapiens)

1.92829

PRKX (Homo sapiens)

1.94595

PRKX (Pan troglodytes)

1.94573

PRKY (Homo sapiens)

1.94595

NLGN4Y (Homo sapiens)

1.90746

NLGN4Y (Pan troglodytes)

1.94615

VCX2 (Homo sapiens)

1.89608

VCX2 (Pan troglodytes)

1.94592

TBL1Y (Homo sapiens)

1.94556

TBL1Y (Pan troglodytes)

1.94579

SC25A6 (Homo sapiens)

1.9298

SC25A6 (Mus muscullus)

1.9458

ASMTL (Gallus gallus domesticus)

1.9457

ASMTL (Homo sapiens)

1.92995

ASMTL (Canis lupus)

1.9458

ASMT (Homo sapiens)

1.9258

ASMT (Gallus gallus domesticus)

1.9459

IL3RA (Pan troglodytes)

1.9458

IL3RA (Mus muscullus)

1.9458

HSFX1 (Homo sapiens)

1.92828

HSFX1 (Pan troglodytes)

1.9454

HSFY2 (Homo sapiens)

1.92982

IL9R (Homo sapiens)

1.92982

IL9R (Pan troglodytes)

1.94576

IL9R (Macca mullata)

1.94571

SYBL1 (Homo sapiens)

1.92956

SYBL1 (Macca mullata)

1.9457

SPRY3 (Homo sapiens)

1.92964

SPRY3 (Macca mullata)

1.93826

Figure 11. Descriptive Statistics of FD of DNA Walk

Hurst exponent of 2-adic DNA strings

We have calculated the Hurst exponent of all 2-adic strings of DNA for all the human sex genes including their homologues. Hurst exponent lies in the interval (0.5559, 0.98187) and therefore the binary strings of all genes are positively auto-correlated (Figure 12).

Figure 12. Descriptive Statistics of Hurst exponent (2-adic)

The Hurst exponents for all human sex genes including their corresponding homologue have same as reflected in (Figure 12).

Succolarity indices

Succolarity measures how much a given fluid can flow through an image, considering as obstacles the set of pixels with a defined color (e.g. white) on 2D images analysis. In other words, it is a measure of continuous density of a 2D pattern.

The succolarity of A for all sex genes lies in the interval (0.000003, 0.2584) and so it is evident that the texture of A for each sex gene is having less density (Figure 1).

The succolarity indices of the genomic textures of T of all the human sex genes with their corresponding homologues are spread over the interval (0.000001, 0.3077) (1). It is seen that the succolarity i.e. the continuous density of the genomic texture of T is very low as it is seen in case of the genomic texture A. The succolarities for all these 93 genes are centred at 0.03 and there is no much deviation among the succolarity indices.

In case of the genomic texture G, The succolarity indices of all sex genes lie in the interval (0, 1.78) as shown in Figure 6. The succolarity indices of the genomic texture of C for all genes are computed. It is observed that the indices range from 0 to 0.2459.

Figure 6. Descriptive Statistics of Succolarities of genomic texture A, T, C and G.

It is seen that the human sex genes and their corresponding homologues share almost same succolarity. Also the succolarity indexes of human sex genes are higher than their corresponding homologues of other species. For an example, the succolarity of A of human sex gene SOY is greater than Macaca mulatta’s SOY of monkey and chimpanzee.

The correlation among succolarities of A, T, C and G are illustrated in the Table 2.

Table 2. Correlation coefficients for Succolarities of A, T, C and G

2021 Copyright OAT. All rights reserv

Variables

SUC-A

SUC-T

SUC-C

SUC-G

SUC-A

1

0.994016

0.974306

0.244261

SUC-T

0.994016

1

0.957539

0.239863

SUC-C

0.974306

0.957539

1

0.251441

SUC-G

0.244261

0.239863

0.251441

1

The correlation coefficient between the Suc-A and Suc-T is high and in contrast the correlation coefficient between the Suc-G and Suc-C is low.

Statistical autocorrelations

The statistical autocorrelations (σ) of the 4-adic representations of all sex genes of human along with their corresponding homologues are being determined. It has been found that the  values lie in the interval (1.06, 1.945).

The sigma values of all sex genes of human and their homologues are normally distributed as shown in the Figure 13.

Figure 13. Descriptive statistics of statistical autocorrelations

Fractal dimensions of threshold decomposition matrices

Here we considered the four different threshold decomposition matrices namely the template of A, T, C and G as we did in the 1.1 (D) for each of the sex-genes (Figure 6). Then we have determined the fractal dimension of the threshold decomposed matrices. 

In the Figure 14, it is seen that the fractal dimension of the template of A, T, G and C lies in the interval (1.71, 1.81), (1.72, 1.89), (1.70, 1.90) and (1.74, 1.87) respectively.

Figure 14. Descriptive statistics of fractal dimension of threshold decompositions

The fractal dimensions of the decomposed template of A and T follow normal distribution whereas fractal dimensions of the other templates do not follow the same.

From the Figure 7, it is seen that the fractal dimensions of the threshold decomposition matrices for A, T, G and C of genes ZFX (from Human X- chromosome) and ZFY (from Human Y-chromosome) are almost same although the genomic template are entirely different in terms of ordering of nucleotides. The aforesaid fact holds good for all the one to one corresponding genes from X and Y chromosomes.  

Figure 7. Histograms of Fractal dimension of threshold decompositions.

Fractal dimensions of skeleton of threshold decompositions

In the earlier subsection, the fractal dimensions of threshold decomposition matrices are found. Let us now find out the fractal dimension of the morphological skeleton of all decomposed threshold matrices. 

In the Figure 15, it is determined that the fractal dimensions of the skeletons of template of A, T, G and C lies in the interval (1.74, 1.80), (1.66, 1.78), (1.65, 1.77) and (1.66, 1.87) respectively. The fractal dimensions of the skeleton of decomposed templates of A and T are normally distributed whereas fractal dimensions of the others template do not follow the same as it happened for threshold decomposition matrices.

Figure 15. Descriptive statistics of fractal dimension of skeleton of threshold decompositions

In cases of ZFX and ZFY genes, the fractal dimensions of the morphological skeletons of the decomposed threshold matrices are almost same. Interestingly, the same is true for all human X and Y chromosomal genes namely as it is evident from the quantitative details (Supp. Met. 1) (Figure 8).

Figure 8. Histograms of Fractal dimension of threshold decompositions.

Fractal dimension of protein plots of genes

The fractal dimension of the protein plots of Accessible Residues (AR), Buried Residues (BR), Alpha Helix (Chou & Fasman) (ALCF), Amino Acid Composition (ACC), Beta Sheet (Chou & Fasman) (BSCF), Beta Turn (Chou & Fasman) (BTCF), Coil (Deleage & Roux) (CDR), Hydrophobicity (Aboderin) (HA), Molecular Weight (MW),  and  Polarity (P) of  the human X and Y chromosomal genes are lie in the interval (1.81, 1.94), (1.77, 1.94), (1.36, 1.94), (1.79, 1.94), (1.79, 1.94), (1.78, 1.94), (1.84, 1.94), (1.79, 1.94), (1.81, 1.94), (1.81, 1.94), (1.88, 1.94) and (1.81, 1.94) respectively as shown in the descriptive statistics Figure 9.

Figure 9. Descriptive Statistics of fractal dimension of Protein Plots

The fractal dimensions of the protein plots of Accessible Residues (AR), Buried Residues (BR), Amino Acid Composition (ACC), Beta Sheet (Chou & Fasman) (BSCF), Beta Turn (Chou & Fasman) (BTCF), Molecular Weight (MW) and Polarity (P) are following normal distribution (Figure 9).

In cases of the gene pairs (ZFX, ZFY) and (UTX, UTY), the fractal dimensions are agreed almost for all of the protein plots except one or two (Figure 10). In our conviction, these non-agreements make them different as X and Y chromosomal gene. Interestingly, this fact is true for all the X and Y chromosomal homologues of human as evident from the quantitative data (Supp. Met-2).

Figure 10. Fractal dimension of Protein Plots of ZFX, ZFY and UTX, UTY.

Quantitative classification of human sex genes

All human sex genes including their homologues are classified based on the order of polystring mean () and polystring standard deviation (). The genes are tabulated in the Suppl. Met. 3A and Supp. Met. 3B. The polystring mean ( ) and polystring standard deviation ()of the human sex gene pair (UTX, UTY) is  same as tabulated in the Suppl. Met. 3A. and Supp. Met. 3B. In contrary, the gene pair (ZFX, ZFY) has the same polystring mean order but polystring SD order of ZFX is TAGC whereas the same of ZFY is TACG.  Even for the homologues of CD99 in three different species namely Homo sapiens, Pan troglodytes and Bos taurus have the same polystring mean order namely TGCA but the polystring SD of CD99 of Pan troglodytes and Bos taurus same but CD99 for human is different from other two. This fact reveals that the ordering of nucleotides in a gene is most important feature to distinguish them from each other and make them unique. The above phenomena are equally true for other human sex genes and their homologues.

Conclusion and future endeavours

In this paper, a quantitative and deterministic detail is adumbrated through which a given string of nucleotides can be inferred as a human X or Y chromosomal gene without seeking any biological experiment. This would help us in screening any given stretch of nucleotides of specific length as a Human sex-gene homologue. This quantitative detail of genomic imprints of sex genes would enable biologist to understand them in more precise way from the very genomic composition level and these understanding are the next challenge of current Genomics. It is noted that the proposed deterministic model is not only meant for human sex genes or its homologue but also can be treated as a standard prototype for other genes and genomes. In our future endeavours we would like to validate the model through the biological experiment.

Authors contributions

Sk. S. Hassan conceptualized the problem and experiments and performed entire research with the rest of the authors of the article.  

Acknowledgements

The authors are indebted to his colleague Dr. Sudhakar Sahoo and the former Director Professor Swadheenananda Pattanayak of Institute of Mathematics & Applications, Bhubaneswar for their kind help and suggestions.

Conflicts of interest

We all authors of the manuscript certify that there is no conflict of interest with any financial organization regarding the material discussed in the manuscript.

References

  1. Watson JD (1990) The human genome project: past, present, and future. Science 248: 44-49. [Crossref]
  2. Sawicki MP, Samara G, Hurwitz M, Passaro E Jr (1993) Human Genome Project. Am J Surg 165: 258–264.
  3. Collins FS, Morgan M, Patrinos A (2003) The Human Genome Project: Lessons from Large-Scale Biology. Science 300: 286-290. [Crossref]
  4. Collins FS, McKusick VA (2001) Implications of the Human Genome Project for medical science. JAMA 285: 540-544. [Crossref]
  5. Friedmann T, Roblin R (1972) Gene therapy for human genetic disease? Science 175: 949-955. [Crossref]
  6. U. S. National Library of medicine (2012) “Genetics Home Reference”.
  7. Andrea Ballabio, David Nelson and Steve Rozen (2006) Genetics of disease: The sex chromosomes and human disease. Curr Opin Genet Dev 16: 1–4. [Crossref]
  8. Zhao L, Koopman P (2012) SRY protein function in sex determination: thinking outside the box. Chromosome Res 20: 153-162. [Crossref]
  9. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, et al. (2001) Initial sequencing and analysis of the human genome. Nature 409: 860-921. [Crossref]
  10. Chandra HS (1985) Is human X chromosome inactivation a sex-determining device? Proc Natl Acad Sci U S A 82: 6947-6949. [Crossref]
  11. Darling SM, Banting GS, Pym B, Wolfe J, Goodfellow PN (1986) Cloning an expressed gene shared by the human sex chromosomes. Proc Natl Acad Sci U S A 83: 135-139. [Crossref]
  12. Lu X, Wu CI (2011) Sex, sex chromosomes and gene expression. BMC Biol 9: 30. [Crossref]
  13. Kent J, Wheatley SC, Andrews JE, Sinclair AH, Koopman P (1996) A male-specific role for SOX9 in vertebrate sex determination. Development 122: 2813-2822. [Crossref]
  14. Mandelbrot BB (1982) "The fractal geometry of nature". New York, ISBN 0-7167-1186-9.
  15. Avnir D (1998) "Is the geometry of Nature fractal". Science 279: 39.
  16. Develi K, Babadagli T (1998) Quantification of natural fracture surfaces using fractal geometry. Math Geology 30: 971-998.
  17. Hassan SS, Choudhury PP, Goswami A (2013) Underlying mathematics in diversification of human olfactory receptors in different loci. Interdiscip Sci 5: 270-273. [Crossref]
  18. Hassan SS, Pal Choudhury P, Daya Sagar BS, Chakraborty S, Guha R, et al. (2011) Quantitative Description of Genomic Evolution of Olfactory Receptors, (Communicated).
  19. Carlo C (2010) Fractals and Hidden Symmetries in DNA. Math Prblm in Engng 507056.
  20. Zu-Guo Y (2002) Fractals in DNA sequence analysis. Chinese Physics 11: 1313-1318.
  21. de Melo RHC, Conci A (2008) Succolarity: Defining a Method to calculate this Fractal Measure." ISBN: 978-80-227-2856-0 291-294.
  22. Deschavanne PJ, Giron A, Vilain J, Fagot G (1999) Genomic signature: characterization and classification of species assessed by chaos game representation of sequences. Mol Biol Evol 16: 1391-1399. [Crossref]

Editorial Information

Editor-in-Chief

Bianciardi Giorgio
University of Siena

Article Type

Research Article

Publication history

Received date: October 25, 2016
Accepted date: December 16, 2016
Published date: December 19, 2016

Copyright

© 2016 Sarif Hassan SK. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Citation

Sarif Hassan SK, Pal Choudhury P, Sengupta A, Sahu B, Mishra R, et al. (2016) A quantitative understanding of human sex chromosomal genes. Fractal Geometry and Nonlinear Anal in Med and Biol 1: DOI: 10.15761/FGNAMB.1000137

Corresponding author

Sk Sarif Hassan

Department of Mathematics, University of Petroleum and Energy Studies, Dehradun, India.

E-mail : bhuvaneswari.bibleraaj@uhsm.nhs.uk

Figure 1a. A DNA string of four variables A T C and G of SRY.

Figure 1b. Function generated by proper colour coding for ATGC of SRY.

Figure 2. Threshold decomposed binary images of OR10AB1P (Black and white denotes complimentary space and one of the ATGC). (A) A (B) T (C) G, and (D) C.

Figure 3. Skeletons of  

Figure 4. Accessible residues (AR-Protein Plot) of SRY

Figure 5. DNA Walk (Pn,Qn) for SRY

Figure 6. Descriptive Statistics of Succolarities of genomic texture A, T, C and G.

Figure 7. Histograms of Fractal dimension of threshold decompositions.

Figure 8. Histograms of Fractal dimension of threshold decompositions.

Figure 9. Descriptive Statistics of fractal dimension of Protein Plots

Figure 10. Fractal dimension of Protein Plots of ZFX, ZFY and UTX, UTY.

Figure 11. Descriptive Statistics of FD of DNA Walk

Figure 12. Descriptive Statistics of Hurst exponent (2-adic)

Figure 13. Descriptive statistics of statistical autocorrelations

Figure 14. Descriptive statistics of fractal dimension of threshold decompositions

Figure 15. Descriptive statistics of fractal dimension of skeleton of threshold decompositions

Table 1. Genes and Their FD of DNA Walk

Genes

FD of DNA Walk

DHRSX (Bos taurus)

1.94577

DHRSX (Homo sapiens)

1.94584

CD99 (Bos taurus)

1.94527

CD99 (Homo sapiens)

1.92993

CD99 (Pan troglodytes)

1.94588

CD9912 (Mus musculus)

1.94584

ZBED1 (Bos taurus)

1.94577

ZBED1 (Homo sapiens)

1.92829

PRKX (Homo sapiens)

1.94595

PRKX (Pan troglodytes)

1.94573

PRKY (Homo sapiens)

1.94595

NLGN4Y (Homo sapiens)

1.90746

NLGN4Y (Pan troglodytes)

1.94615

VCX2 (Homo sapiens)

1.89608

VCX2 (Pan troglodytes)

1.94592

TBL1Y (Homo sapiens)

1.94556

TBL1Y (Pan troglodytes)

1.94579

SC25A6 (Homo sapiens)

1.9298

SC25A6 (Mus muscullus)

1.9458

ASMTL (Gallus gallus domesticus)

1.9457

ASMTL (Homo sapiens)

1.92995

ASMTL (Canis lupus)

1.9458

ASMT (Homo sapiens)

1.9258

ASMT (Gallus gallus domesticus)

1.9459

IL3RA (Pan troglodytes)

1.9458

IL3RA (Mus muscullus)

1.9458

HSFX1 (Homo sapiens)

1.92828

HSFX1 (Pan troglodytes)

1.9454

HSFY2 (Homo sapiens)

1.92982

IL9R (Homo sapiens)

1.92982

IL9R (Pan troglodytes)

1.94576

IL9R (Macca mullata)

1.94571

SYBL1 (Homo sapiens)

1.92956

SYBL1 (Macca mullata)

1.9457

SPRY3 (Homo sapiens)

1.92964

SPRY3 (Macca mullata)

1.93826

Table 2. Correlation coefficients for Succolarities of A, T, C and G

Variables

SUC-A

SUC-T

SUC-C

SUC-G

SUC-A

1

0.994016

0.974306

0.244261

SUC-T

0.994016

1

0.957539

0.239863

SUC-C

0.974306

0.957539

1

0.251441

SUC-G

0.244261

0.239863

0.251441

1