CloverETL

CloverETL
Developer(s) Javlin Inc.
Initial release 2002
Stable release 4.1.1 / December 3, 2015
Development status Active
Operating system Cross-platform
Type ETL tools
License dual LGPL, commercial
Website http://www.cloveretl.com/

CloverETL is a Java-based data integration ETL platform for rapid development and automation of data transformations, data cleansing, data migration and distribution of data into applications, databases, cloud and data warehouses. The product family starts with an open source runtime engine and limited Community edition of visual data transformation Designer. CloverETL's commercial offerings include a fully featured Designer and Server and Cluster platforms. The Server adds automation and workflow orchestration, allowing users to deploy fully automated production environments, with the possibility to scale to a cluster for added performance and robustness. Its goal is to be flexible and light-footed, so that it can be customized and embedded into third party applications. The open source and commercial products are developed and supported by Javlin, a data integration software and solutions provider.

Javlin's offices are located in the Washington DC area; London, UK; Frankfurt, Germany and Prague, Czech Republic and serve customers all over the world. With approximately 60 employees, Javlin serves more than 3,000 customers, including five OEM partners.[1] Parts of the CloverETL platform – the Engine, Designer, and Server – can be embedded on an OEM basis.

Customers include Oracle, Initiate Systems/IBM, Comcast, SUNY, and other Fortune 500 companies.

History

In 2002, the CloverETL project – named jETeL – was launched as the first Java-based open source ETL tool. In 2006, it was renamed to clover.ETL, followed by CloverETL, now a registered trademark, in 2009. Starting out as a proof of concept, its purpose was to bring the performance and functionality of big enterprise ETL tools to regular users who, at the time, did not have access to enterprise-level systems. Over time, it evolved into a data integration toolset ranging from the original core library (CloverETL Engine) to a full-fledged enterprise platform.

The CloverETL Engine is offered for free under LGPL with vendor support for the open-source ETL community.[2] In 2010, a visual data transformation designer was also made public for free use.

Javlin, the official developer and support of CloverETL, was founded in 2005 under the name “Javlin Consulting”. The company’s founder and president, David Pavlis, is also the creator of CloverETL.

Architecture

CloverETL is a Java-based ETL tool with open source components. It is either used in standalone mode – as a command-line or server application – or embedded in other applications – as a Java library. CloverETL is accompanied by the CloverETL Designer graphical user interface available as either an Eclipse plug-in or standalone application.

A data transformation in CloverETL is represented by a transformation dataflow, or graph, containing a set of interconnected components joined by edges. A component can either be a source (reader), a transformation (reformat, sort, filter, joiner, etc.) or a target (writer). The edges act as pipes, transferring data from one component to another. Each edge has a certain metadata assigned to it that describes the format of the data it transfers. The transformation graphs are represented in XML files and can be dynamically generated.

Each component runs in a separate thread and acts either as a consumer or a producer. This is used to drive data through the transformation for both simple and complex graphs and makes the platform extendable by building custom components, connections etc. Transformation graphs can then be combined into a jobflow, which defines the sequence in which the individual graphs are executed.

Fundamental aspects

The Server version of CloverETL supports parallel execution of transformations and runs inside a JavaEE application container.

Suite of Products

Commercial Extras

These come packaged with all commercial licenses.

Open Source solutions typically appeal to independent software vendors (ISVs) and systems integrators (SIs) who see these solutions as attractive alternatives to writing code.[3] Products can be embedded into solutions for Enterprise Service Bus (ESB), Business Intelligence (BI), etc.[4] [5]

CloverETL is embedded in the Oracle Endeca Information Discovery Integrator as well as GoodData CloudConnect[4][6][7][8]

CloverETL Community Edition

The CloverETL Community Edition is based on the Open Source transformation engine and also includes a limited CloverETL Designer. It is for users with modest data transformations and ETL requirements. The CloverETL Community Edition is free. The current version of CloverETL Community comes with a Graphic User Interface (GUI). In the past, the Community Edition used a command line style prompt to create and design data management projects.

CloverETL Community is Java-based and has been deployed on the following Operating System platforms: Linux both 32 & 64 bit), Windows (both 32 & 64 bit), HP-UX, AIX, AS/400 (IBM System I), Solaris, and Mac OS X. The Community edition contains connectors for the following data sources: text file delimited, fix-length and combined, XML, XLS, RDBMS through JDBC, WebServices through REST/SOAP protocols, JMS, LDAP, dBase/FoxBase/FoxPro, bulk-loaders for Oracle, DB2, MS SQL, Informix, MySQL and PostgreSQL, and QuickBase.[9]

With the Community Edition, users have access to the transformation components that allow them to accomplish common data transformations tasks such as reformatting, filtering, and sorting data. Users also can use available components for aggregating, merging, or deduplicating data. The CloverETL Community Edition provides the Hash Join component and allows use of the DBExecute, System Execute, and HTTPConector components as well.

Partners

Technical specifications

Connectors

Competitors

Other ETL frameworks include:[3]

References

  1. Topsy. N.p., n.d. Web. 20 June 2013. <http://topsy.com/s/cloveretl>.
  2. Roy, Krishna. "Javlin Elucidates CloverETL Strategy as It Continues to Take Aim at Data Integration." MIS Impact Report (2013): 1–4.
  3. 1 2 "Data Integration Vendors Comparison." Adeptia. N.p., n.d. Web. 20 June 2013. <https://adeptia.com/products/etl_vendor_comparison.html>.
  4. 1 2 "GoodData Selects CloverETL to Enrich Data Integration – GoodData." GoodData. N.p., 6 December 2012. Web. 20 June 2013. <http://www.gooddata.com/in-the-news/gooddata-selects-cloveretl-to-enrich-data-integration/>
  5. Wang, Qian. "Research of ETL on University Data Exchange Platform." IEEE Xplore. N.p., n.d. Web. 20 June 2013. <http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true>.
  6. "Oracle Endeca Information Discovery- CloverETL." OBIEE, Endeca and ODI. N.p., 25 Oct. 2012. Web. 20 June 2013. <http://www.varanasisaichand.com/2012/10/oracle-endeca-information-discovery.html>.
  7. "Introduction – Oracle Identity Analytics Business Administrator's Guide." Oracle. N.p., n.d. Web. 20 June 2013. <http://docs.oracle.com/cd/E27119_01/doc.11113/e23124/businessadministratorsguideprintable23.html>.
  8. "Endeca – Information Discovery Integrator (CloverETL)." GerardNicocom Weblog RSS. N.p., n.d. Web. 20 June 2013. <http://gerardnico.com/wiki/cloveretl/cloveretl>.
  9. Gutierrez, Jeremiah, Kent Lawson, Eddie Molina, Nestor Rodriguez. “Data Warehousing Tool Evaluation – ETL Focused." Southwest Decision Sciences Institute. 2012. 8-9. <http://www.swdsi.org/swdsi2012/proceedings_2012/papers/Papers/PA151.pdf>

External links

This article is issued from Wikipedia - version of the Tuesday, February 02, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.