Friday, May 15, 2020

20 - 019 Has the Human on a Smartphone got nearer?


The idea that a human could be fully defined digitally on a Smartphone, the thesis of the Digital Human Book, just got a little nearer. But only a little.

NVIDIA an American advanced chip designer previously focussed upon graphics processing chips and now entering the artificial intelligence marketplace has released their DGX A100 AI chip containing 54 billion transistors with a 5 petaflops performance. They are in the same marketplace as the other major chip designers and manufacturers Intel, AMD, Qualcomm and ARM.

It uses their Ampere architecture which is a single micro architecture for its Graphics Processing Unit’s (GPU) supporting both commercial AI and consumer graphics usage. Their specialist capabilities in support of graphics processing have now been applied to the broader capabilities required by artificial intelligence (AI) processing. At the end of the day it is all about the processing of on-off switches be they used for graphics or AI.

The DGX A100 is a 7 nanometer chip produced to this higher concentration of transistors but still on a 2D wafer. Significantly the current Apple A12 chip fitted to the iPhone XS, XS Max and XR and the 2019 iPad Air and 5th Generation iPad Mini is also a 7 nanometer chip produced by the Taiwan Semiconductor Manufacturing Company (TSMC) for Apple in Taiwan. But this only has 6.9 billion transistors. But it does show how near to the leading edge chip technologies your smartphone is getting in terms of its capabilities. When applied to a Smartphone these can be in your pocket now.

But to emulate a Digital Human on a Smartphone we are still probably a factor of several 1000’s if not millions off the processing power that will be needed. But it does show how amazing the biological human is in terms of sheer digital processing power. Unless how we humans actually do it takes us beyond digital (just on-off) into another dimension. Certainly we are also not limited to a 2D wafer like current chips with our storage running to three dimensions (3D) as per the brain. But our digital world is progressing although no doubt in 50 year’s time (2070) we will look back amazed at how crude our digital technologies were in 2020. We will be asking what’s a chip? What’s a Smartphone?

20 - 018 Something different - Consilience


Today we need to leave the biological sciences to zoom upwards and take a much higher perspective. Or oddly enough in this case zoom down to a much lower perspective. All will be revealed.

The book title A Digital Human already eludes to this way of thinking. It looks at the biological human as a digital device. In fact consilience means a unity of knowledge. In truth this is an old word that has gone out of fashion in the 21st Century belonging more to the 19th Century when the sciences were racing ahead establishing their credibility. But it turns out that the book A Digital Human is really about consilience. Maybe a more relevant term maybe consolidation. It is the consolidation of everything to the digital foundation. Put simply the consolidation of everything to the on- off switches that are the building blocks of all things electronic. So why the shift towards the digital?



It is necessary for me as the author of this book, currently a work in progress, to stop and remind myself of the book’s objectives. To me the clear objective of a book is to take you as the reader and lead you through a number mental change processes so by the end of the book you think differently on the subject covered. To do this here I want to briefly consider three authors who cemented in my mind the subject of consilience. They changed my thought processes let us see if they change yours.

Firstly Edward O. Wilson an American biologist born 1929. He published a book called “Consilience” (1998). He quotes from William Whewell in his 1840 synthesis “The Philosophy of Inductive Sciences” as the first to speak of consilience.

To quote:-

“The Consilience of Inductions takes place when an Induction, obtained from a class of facts, coincides with an Induction, obtained from another different class. This Consilience is a test of the truth of the Theory in which it occurs.” Replace the word “induction” with “experiment” to appreciate a 21st Century interpretation.

Then to quote from Wilson’s book:-

“Disciplinary boundaries within the natural sciences are disappearing, to be replaced by shifting hybrid domains in which consilience is implicit. These domains reach across many levels of complexity, from chemical physics and physical chemistry to molecular genetics, chemical ecology and ecological genetics.”

Secondly Bruce F.Katz with his book “Neuroengineering The Future” (2008) in which he quotes William of Occam in the 11th Century where “entities should not be multiplied beyond necessity”. He also quotes Einstein’s famous dictum that a theory should be as simple as possible, but no simpler.

Then to quote fromKatz’s book:-

“…the simpler the theory, the greater its ability to generalise to new phenomena. Complex theories arise because they tend to have many special cases, each encompassing a disparate set of results. Each aspect of the theory covers one and only one such set. However, if a commonality can be found that ranges over these results, it will often be the case that it also explains as yet unseen empirical data.”

This is where Artificial Intelligence (AI) is going to be able to take us once everything is successfully transformed into their digital foundations.

Thirdly William Sims Bainbridge with his book “Nanoconvergence” (2007) brings it all together with the quote in the book taken from his 2000 report “Societal Implications of Nanoscience and Nanotechnology” edited by Mike Roco and himself :-

“A revolution is occurring in science and technology, based on the recently developed ability to measure, manipulate and organise matter on a nanoscale – 1 to 100 billionths of a meter. At the nanoscale, physics, chemistry, biology, materials science and engineering converge toward the same principles and tools. As a result progress in nanoscience will have a very far reaching impact.”

Followed by a more comprehensive statement in a 2003 Converging Technologies for Improving Human Performance Report:-

“We stand at the threshold of a new renaissance in science and technology, based on a comprehensive understanding of the structure and behaviour of matter from the nanoscale up to the most complex system yet discovered, the human brain. Unification of science based unity in nature and its holistic investigation will lead to technological convergence and a more efficient societal structure for reaching human goals. In the early decades of the twenty-first century, concentrated effort can bring together nano-technology, biotechnology, information technology and the new technologies based upon cognitive science. With proper attention to ethical issues and societal needs, the result can be a tremendous improvement in human abilities, new industries and products, societal outcomes, and quality of life.”

So as an author have I started to change your thinking processes towards everything moving towards a digitally based foundation where artificial intelligence (AI) can be used to seek out many of the unknowns spanning the different scientific disciplines? Will it all dissolve down to some simple rules as Einstein predicted. Or not?

Saturday, May 9, 2020

20 - 017 Human Gene Codes


As the armchair biologist reading all manner of resources on genetics you soon keep coming across codes like FMR1, CCR5 and FOXP2 referred to in many of the articles. What is going on? These are what I have termed Human Gene Codes. 

Well this is how all the genes within the human genome have been named individually. They are little bits of coherent code that define how something is finally made in your body. But just to set the context correctly these types of codes are also used for other genomes not just humans. They exist for mice, fruit flies and peas in fact for any living organism. Anything where DNA exists defining the master plan for making that organism. Mice, fruit flies and peas have been listed because, believe it or not, most of the early genetic research focussed on these three living things. Mainly because all three reproduced quickly best supporting this type of research that needed to be undertaken over many of the living thing generations.

Now just deviating slightly off from the subject of Gene Codes we need to just give ourselves an appreciation of how small everything is in the world of genetics. Watching an ant running around a paving slab in the hot sun measuring between 2-3 mm in length reminds you that stored within all his living cells is DNA that defines the building of the ant. In fact the ant has all the same capabilities as a human. Head, brain, nervous system, digestive system, legs, eyes and so forth. The majority of the DNA will be identical to that of a human except in the process of evolution the DNA went down the insect branch rather than the mammal branch so some differences will exist. So this should allow you to better visualise how big a gene is since in the ant and the human the actual gene is the same size.

So what about the documentation of genes that go across all species. This is a specialist area in bioinformatics called Gene Ontology (GO). So let us just confirm at bioinformatics is the interdisciplinary field that develops methods and software tools for understanding biological data. So bioinformatics is the study of the Digital Human in an informal way. Gene Ontology (GO) looks to maintain and develop a controlled vocabulary (the FMR letter codes) of genes and gene products attributes. Now just to complicate matters the Gene Ontology (GO) is part of a larger classification effort, the Open Biomedical Ontologies, being one of the initial Candidate Members of the Open Biological and Biomedical Ontologies (OBO) Foundry. Currently there are more than hundred ontologies that follow the OBO Foundry principles. (http://obofoundry.org) .

Now do not get too confused by all these names of bodies looking to mark up biological data. Firstly the term ontology is not anything to do with biology. It is more to do with philosophy. It is about defining entities (ie things) that exist by giving them a name or label. The grouping of like things is part of the process then often establishing hierarchies of related things. So for the terms used say in banking or retailing or manufacturing or logistics you could create an ontology. Think of it being like an index or glossary. Now the fact biology is now getting around to drawing up ontologies is significant because it is a prerequisite to establishing data processing, information processing and artificial intelligence processing. It is getting the building blocks of data analysis in place. But with the biological sciences it is much more complex than anything in the commercial computing world.

So next time you see say “FOXP2” can I suggest you use the Genetics Home Reference site search to look it up at http://ghr.nlm.nih.gov . So try looking at FM03 which causes Trimethylaminuria the genic disorder that makes you smell like a fish. This site contains a huge list of these Gene Codes. So next time you read something and a code like “FREM2” comes up use this website to look up the details.

Friday, May 8, 2020

20 - 016 DNA - RNA - Proteins


It is very important that when you are trying to understand something that you first establish in your mind a simple model that you can then drill down from to more detailed lower level models. You can call these models, diagrams, frameworks or process flows. To support understanding the best ones are hierarchical because at the top level it is like a summary or precise which is then linked to lower models which serve to expand on the detail. The author is from a professional computer background where the design of computer systems has always been dependent on the use of such modelling or process design techniques. From the top level Enterprise Architectures (EA) down through to the Unified Modelling Language (UML) these are the established techniques for designing and building computer systems. Now being only an armchair biologist and chemist for me to understand these subjects I have to apply my digital methodologies to them. This is why it is called the Digital Human.

So being an armchair biologist and chemist I have a preference for locating models that have been created by experts within these professions that coincide with the types of model I would have been familiar with or created in my digital profession. So I have located just such a diagram below which is copyrighted to Nature Education (2014). 








So in this one diagram it at a high level establishes the process flow from DNA through RNA and to Protein creation. The protein is the living cell. It does not go into the detail of DNA transcription to RNA nor the Translation from RNA to the Proteins. At this stage we do not want any more detail or complexity (eg mRNA). We just want simplicity. So in simple terms the DNA which is like a roadmap is transcriped into a simpler linear sequences of RNA which become the recipes for making proteins by sequencing the amino acids within these proteins. It is important that you get this very simple model understood. We will enlarge on the detail as we drill down into this model later in the book.

Sunday, May 3, 2020

20 - 015 Is cononavirus like a computer virus?





Is coronavirus like a computer virus?

So is coronavirus like a computer virus? Well obviously not.  Coronavirus is obviously biological in that it consists of living bits of chemical called proteins. Whilst a computer virus  is a non-living object. Now the thesis of this Digital Human Book is that everything biological and living can be represented in digital code. Essentially thousands of bits (like light bulbs) that are either switched on or off in the computer’s memory. Now if you projected all these bits on to the side of a building you could see many different patterns in it. When these small patterns look the same you have within these masses of bits identified a common object. It could be the digital representation of your face used in face recognition. Or for coronavirus it would be the digital pattern that has been identified firstly in China and subsequently at many locations throughout the world. In fact all the current coronavirus testing takes the test swab and through a process called the “polymerase chain reaction test” detects the virus genetic information. Now the interesting challenge would be to reverse engineer the process where this genetic digital information could be generated back into a biological virus. From computer code back to a living thing. 












Friday, May 1, 2020

20 - 014 Coronavirus Networking and Data Sharing


What a changed world we now live in with the internet now supporting global networking and data sharing. It allows anyone to become involved in what would have previously been hidden areas of research. Coronavirus is the best example to date of the benefits that the internet offers in terms of instant internet networking and data sharing. The openness now applied to this type of “save the human race” research with the removal of all commercial and governmental secrecy (as far as I know) has allowed the ordinary individual to become party to these events in real time. Now it is accepted that what you get to read and research being “first hand” is often far too technical to understand. This in my opinion is a good thing since it is not always good for you to only read what the media have decided to simplify for your consumption. The simplification detracts from the complexity which allows social media pundits to draw very simplified conclusions and propose their own direction of travel without knowing the underlying detail.

So in the case of coronavirus what can we get access to that may have been previously kept away from the public view?


One of my popular websites on viruses is http://virological.org where I can gather around the coffee machine with the world’s virologists for a morning chat. Unfortunately like many a chat what I get to read one day maybe gone the next. Best to remember the internet has its own fluidity. Someone on the other end is always moving things around to confuse you. Always look to capture it on own your own digital hardware. I am not even a full convert to the Cloud Architecture. Remember they are businesses running them. Sorry for about the gripe. But it is a blog !!!. So moving on with the subject in hand.


So I got to see the original posting from edward_holmes as introduced below :-


“10th January 2020. This posting is communicated by Edward C. Holmes, University of Sydney on behalf of the consortium led by Professor Yong-Zhen, Fudan University, Shanghai.”

It opens with “The Shanghai Public Health Clinical & School of Public Health, in collaboration with the Central Hospital of Wuhan etc etc …………is releasing a coronavirus genome from a case of respiratory disease from the Wuhan outbreak. The sequence has also been deposited on GenBank (accession MN908947 20.2k) and will be released as soon as possible.”


So this was post by Professor Edward Holmes working at the University of Sydney. He is an expert on infectious diseases particularly RNA viruses that jump boundaries to infect humans. He can be located on http://scholar.google.com.au giving you a link to his paper published in 2020 “A new coronavirus associated with human respiratory disease in China” with excellent links to published articles based upon the paper.


Now the link on the post lead to go to http://ncbi.nlm.nih.gov so then follow the path NCBI>DNA7RNA>Nucleotide Database where you can actually view the complete underlying genome code for “Severe acute respiratory syndrome coronavirus 2 isolate Wuhan-Hu-1 complete genome”. 

This site is the National Centre for Biotechnology Information in the USA. Now GenBank is the NIH genetic sequence database of publicly available DNA sequences. GenBank is part of the International Nucleotide Sequence Database Collaboration with Japan and Europe. These international centres process such data on a daily basis so it is real time research.



Now if you like searching through data that that means nothing to you this is the place to visit. Maybe it is my computer background but I enjoy just searching through data and the less I understand it the better. The internet now allows you to become involved in any subject and at any level of research. So today I am an amateur virologist. But what it does achieve is to make you aware of the complexity that the normal media looks to avoid in their published stories. In my case it always triggers me to want to learn more so once again the internet becomes invaluable. Just Google or Wikipedia every word you do not understand. Even buy a book or even if you are digitally dependant an eBook off Amazon. But alongside this information the key difference these days is you can have access to the first hand sources on the subject and in real time see how things are unfolding. In fact it pays to pick a few experts to follow where their bio and writings make you want to make them your new best research friend.


The big difference is with coronavirus as the current subject matter under the spotlight it is uncharted and things are unfolding in real time. This along with it being relevant to everyone, because anyone can catch it and die, makes it currently the ideal subject to have access to this inside information over the internet. Fortunately the scientists working on solving the problems are adopting currently a none commercial open systems approach freely sharing their information on open access websites. Within 12 days of the Chinese sharing the digital genome of coronavirus over the internet an American team had shared an analysis showing the mechanism by which the coronavirus entered human cells. Then within 12 hours scientists in China had shown there analysis was correct. The speed of data interchange about the virus was starting to match the speed at which the virus was spreading. But unfortunately developing the solutions would take a lot longer.