Talend Open Studio
Encyclopedia
Talend Open Studio is an open source
data integration
product developed by Talend
and designed to combine, convert and update data in various locations across a business.
Talend also provides Talend Integration Suite, a commercial extension to Talend Open Studio with additional features, technical support and IP indemnification.
(since v2.0) or Perl
(since v1.0). Its GUI
is made of a metadata
repository and a graphical designer. The metadata repository contains the definitions and configuration for each job - but not the actual data being transformed or moved. The information in the metadata repository is used by all of the components of Talend Open Studio.
The product is based on Eclipse RCP. Most of its contributors work for commercial open source vendor Talend
.
Individual jobs are designed using graphical components , of which over 400 are available, for transformation, connectivity, or other operations. The jobs created can be executed from within the studio or as standalone scripts
.
An organization might typically use Talend Open Studio for:
Open source
The term open source describes practices in production and development that promote access to the end product's source materials. Some consider open source a philosophy, others consider it a pragmatic methodology...
data integration
Data integration
Data integration involves combining data residing in different sources and providing users with a unified view of these data.This process becomes significant in a variety of situations, which include both commercial and scientific domains...
product developed by Talend
Talend
Talend is an open source software vendor that provides data integration, data management and enterprise application integration software and solutions. Headquartered in Suresnes, France and Los Altos, California, Talend has offices in North America, Europe and Asia, and a global network of...
and designed to combine, convert and update data in various locations across a business.
History
Talend Open Studio is distributed under GPLv2 and was launched in October 2006. In January 2008, it had been downloaded over 1 million times. In July 2009, the product totaled 5 million downloads and over 300,000 users.Talend also provides Talend Integration Suite, a commercial extension to Talend Open Studio with additional features, technical support and IP indemnification.
Product Description
Talend Open Studio operates as a code generator allowing data transformation scripts and underlying programs to be generated either in JavaJava (programming language)
Java is a programming language originally developed by James Gosling at Sun Microsystems and released in 1995 as a core component of Sun Microsystems' Java platform. The language derives much of its syntax from C and C++ but has a simpler object model and fewer low-level facilities...
(since v2.0) or Perl
Perl
Perl is a high-level, general-purpose, interpreted, dynamic programming language. Perl was originally developed by Larry Wall in 1987 as a general-purpose Unix scripting language to make report processing easier. Since then, it has undergone many changes and revisions and become widely popular...
(since v1.0). Its GUI
Gui
Gui or guee is a generic term to refer to grilled dishes in Korean cuisine. These most commonly have meat or fish as their primary ingredient, but may in some cases also comprise grilled vegetables or other vegetarian ingredients. The term derives from the verb, "gupda" in Korean, which literally...
is made of a metadata
Metadata
The term metadata is an ambiguous term which is used for two fundamentally different concepts . Although the expression "data about data" is often used, it does not apply to both in the same way. Structural metadata, the design and specification of data structures, cannot be about data, because at...
repository and a graphical designer. The metadata repository contains the definitions and configuration for each job - but not the actual data being transformed or moved. The information in the metadata repository is used by all of the components of Talend Open Studio.
The product is based on Eclipse RCP. Most of its contributors work for commercial open source vendor Talend
Talend
Talend is an open source software vendor that provides data integration, data management and enterprise application integration software and solutions. Headquartered in Suresnes, France and Los Altos, California, Talend has offices in North America, Europe and Asia, and a global network of...
.
Individual jobs are designed using graphical components , of which over 400 are available, for transformation, connectivity, or other operations. The jobs created can be executed from within the studio or as standalone scripts
Scripting language
A scripting language, script language, or extension language is a programming language that allows control of one or more applications. "Scripts" are distinct from the core code of the application, as they are usually written in a different language and are often created or at least modified by the...
.
An organization might typically use Talend Open Studio for:
- Synchronization or replication of databases
- Right-time or batch exchanges of data
- ETLExtract, transform, loadExtract, transform and load is a process in database usage and especially in data warehousing that involves:* Extracting data from outside sources* Transforming it to fit operational needs...
(Extract Transform Load) for analytics - Data migrationData migrationData migration is the process of transferring data between storage types, formats, or computer systems. Data migration is usually performed programmatically to achieve an automated migration, freeing up human resources from tedious tasks...
- Complex data transformation and loading
- Data qualityData qualityData are of high quality "if they are fit for their intended uses in operations, decision making and planning" . Alternatively, the data are deemed of high quality if they correctly represent the real-world construct to which they refer...