Movie Genome
Encyclopedia
The Movie Genome is an approach to indexing movies based on attributes in order to create movie catalogs with extensive, detailed data about each title.
, a scientific project to identify and map all human genes
. Similarly, a Movie Genome, as used by semantic movie discovery engine Jinni , identifies and indexes multiple “genes” (elements and aspects) of a movie.
A comparable initiative is the Music Genome Project
, intended to "capture the essence of music
at the fundamental level.” The Music Genome technology is used by Pandora
to play music for Internet users based on their preferences.
Movie Genome attributes might include mood, tone, plot, and structure. Jinni’s Movie Genome has a taxonomy
created by film professionals, while titles are automatically indexed using a mixture of metadata
and reviews and a proprietary Natural Language Processing solution to assign semantic tags to content and users.
, which takes a meaning-based approach to interpreting queries by identifying concepts within the content, rather than keywords. The data about each title in a Movie Genome can also support an item-based recommendation engine that recommends based on similarities between content items and users’ preferred “genes.” By contrast, collaborative filtering
is used to make recommendations based on statistical similarities in preferences between users.
The concept of “genes” or “DNA” has also been applied to other types of entertainment. For example, GamerDNA
has a database of games that locates games based on gameplay elements such as setting, tone and game mechanics.
. The technology is used to power a semantic discovery engine for movies and TV shows.
Description
The Movie Genome concept is borrowed from the Human Genome ProjectHuman Genome Project
The Human Genome Project is an international scientific research project with a primary goal of determining the sequence of chemical base pairs which make up DNA, and of identifying and mapping the approximately 20,000–25,000 genes of the human genome from both a physical and functional...
, a scientific project to identify and map all human genes
Gênes
Gênes is the name of a département of the First French Empire in present Italy, named after the city of Genoa. It was formed in 1805, when Napoleon Bonaparte occupied the Republic of Genoa. Its capital was Genoa, and it was divided in the arrondissements of Genoa, Bobbio, Novi Ligure, Tortona and...
. Similarly, a Movie Genome, as used by semantic movie discovery engine Jinni , identifies and indexes multiple “genes” (elements and aspects) of a movie.
A comparable initiative is the Music Genome Project
Music Genome Project
The Music Genome Project was first conceived by Will Glaser and Tim Westergren in late 1999. In January 2000, they joined forces with Jon Kraft to found Pandora Media to bring their idea to market...
, intended to "capture the essence of music
Music
Music is an art form whose medium is sound and silence. Its common elements are pitch , rhythm , dynamics, and the sonic qualities of timbre and texture...
at the fundamental level.” The Music Genome technology is used by Pandora
Pandora (music service)
Pandora Radio is an automated music recommendation service and custodian of the Music Genome Project available only in the United States. The service plays musical selections similar to song suggestions entered by a user...
to play music for Internet users based on their preferences.
Movie Genome attributes might include mood, tone, plot, and structure. Jinni’s Movie Genome has a taxonomy
Taxonomy
Taxonomy is the science of identifying and naming species, and arranging them into a classification. The field of taxonomy, sometimes referred to as "biological taxonomy", revolves around the description and use of taxonomic units, known as taxa...
created by film professionals, while titles are automatically indexed using a mixture of metadata
Metadata
The term metadata is an ambiguous term which is used for two fundamentally different concepts . Although the expression "data about data" is often used, it does not apply to both in the same way. Structural metadata, the design and specification of data structures, cannot be about data, because at...
and reviews and a proprietary Natural Language Processing solution to assign semantic tags to content and users.
Applications
The Movie Genome has several applications in the area of movie discovery. It can power search engines, notably semantic searchSemantic search
Semantic search seeks to improve search accuracy by understanding searcher intent and the contextual meaning of terms as they appear in the searchable dataspace, whether on the Web or within a closed system, to generate more relevant results. Author Seth Grimes lists "11 approaches that join...
, which takes a meaning-based approach to interpreting queries by identifying concepts within the content, rather than keywords. The data about each title in a Movie Genome can also support an item-based recommendation engine that recommends based on similarities between content items and users’ preferred “genes.” By contrast, collaborative filtering
Collaborative filtering
Collaborative filtering is the process of filtering for information or patterns using techniques involving collaboration among multiple agents, viewpoints, data sources, etc. Applications of collaborative filtering typically involve very large data sets...
is used to make recommendations based on statistical similarities in preferences between users.
The concept of “genes” or “DNA” has also been applied to other types of entertainment. For example, GamerDNA
GamerDNA
gamerDNA Inc. is a social media company for computer and video game players founded on 2006-09-21, acquired by Crispy Gamer in December 2009. The name is usually spelled with a lower case g: gamerDNA...
has a database of games that locates games based on gameplay elements such as setting, tone and game mechanics.
Examples
Jinni has created a Movie Genome by taking a taxonomic approach to cataloging titles, analyzing user reviews and metadataMetadata
The term metadata is an ambiguous term which is used for two fundamentally different concepts . Although the expression "data about data" is often used, it does not apply to both in the same way. Structural metadata, the design and specification of data structures, cannot be about data, because at...
. The technology is used to power a semantic discovery engine for movies and TV shows.