back to publications overview

Origins of Scaling in Genetic Code


Oliver Obst, Mikhail Prokopenko, and Daniel Polani. Origins of Scaling in Genetic Code. In Proceedings of the European Conference on Artificial Life (ECAL), Lecture Notes in Artificial Intelligence, Budapest, 2009. In press.


Download

[PDF] (141 kb)


Abstract

The principle of least effort in communications has been shown, by Ferrer i Cancho and Solé, to explain emergence of power laws (e.g., Zipf's law) in human languages. This paper applies the principle and the information-theoretic model of Ferrer i Cancho and Solé to genetic coding. The application of the principle is achieved via equating the ambiguity of signals used by ``speakers'' with codon usage, on the one hand, and the effort of ``hearers'' with needs of amino acid translation mechanics, on the other hand. The re-interpreted model captures the case of the typical (vertical) gene transfer, and confirms that Zipf's law can be found in the transition between referentially useless systems (i.e., ambiguous genetic coding) and indexical reference systems (i.e., zero-redundancy genetic coding). As with linguistic symbols, arranging genetic codes according to Zipf's law is observed to be the optimal solution for maximising the referential power under the effort constraints. Thus, the model identifies the origins of scaling in genetic coding --- via a trade-off between codon usage and needs of amino acid translation. Furthermore, the paper extends Ferrer i Cancho -- Solé model to multiple inputs, reaching out toward the case of horizontal gene transfer (HGT) where multiple contributors may share the same genetic coding. Importantly, the extended model also leads to a sharp transition between referentially useless systems (ambiguous HGT) and indexical reference systems (zero-redundancy HGT). Zipf's law is also observed to be the optimal solution in the HGT case.


BiBTeX Entry


@InProceedings{   OPP09,
  address	= {Budapest},
  author	= {Oliver Obst and Mikhail Prokopenko and Daniel Polani},
  booktitle	= {Proceedings of the European Conference on Artificial Life
		   (ECAL)},
  note		= {In press.},
  number	= {5777, 5778},
  series	= {Lecture Notes in Artificial Intelligence},
  title 	= {Origins of Scaling in Genetic Code},
  year		= {2009},
  abstract	= {The principle of least effort in communications has been
		   shown, by Ferrer i Cancho and Sol{\'e}, to explain emergence of power laws
		   (e.g., Zipf's law) in human languages. This paper applies the principle and
		   the information-theoretic model of Ferrer i Cancho and Sol{\'e} to genetic
		   coding. The application of the principle is achieved via equating the
		   ambiguity of signals used by ``speakers'' with codon usage, on the one hand,
		   and the effort of ``hearers'' with needs of amino acid translation
		   mechanics, on the other hand. The re-interpreted model captures the case of
		   the typical (vertical) gene transfer, and confirms that Zipf's law can be
		   found in the transition between referentially useless systems (i.e.,
		   ambiguous genetic coding) and indexical reference systems (i.e.,
		   zero-redundancy genetic coding). As with linguistic symbols, arranging
		   genetic codes according to Zipf's law is observed to be the optimal solution
		   for maximising the referential power under the effort constraints. Thus, the
		   model identifies the origins of scaling in genetic coding --- via a
		   trade-off between codon usage and needs of amino acid translation.
		   Furthermore, the paper extends Ferrer i Cancho -- Sol{\'e} model to multiple
		   inputs, reaching out toward the case of horizontal gene transfer (HGT) where
		   multiple contributors may share the same genetic coding. Importantly, the
		   extended model also leads to a sharp transition between referentially
		   useless systems (ambiguous HGT) and indexical reference systems
		   (zero-redundancy HGT). Zipf's law is also observed to be the optimal
		   solution in the HGT case.},
}