If Jason Stamper, Yves de Montcheuil and Mile Sheridan set out the challenges of ‘a data world in chaos’ at Talend Connect 2013, Ciaran Dynes offered practical answers. After a morning of overviews on modern data analytics, Dyne’s Product Roadmap presentation got down to the nitty-gritty details of Talend’s future software.

The unique selling point of Talend is their single platform for completing a variety of different tasks. Data integration can be tackled alongside application integration and data governance. In the third presentation of the conference, Dynes mapped out the road ahead for the latest version of software, Talend 5.3:
F Front 2 Liv-Flo 002

Talend 5.3

Hadoop

For analytical applications it’s often necessary to move large batches of data at once, using ETL tools. Hadoop is also designed as an engine for this kind of ‘batch analytics’, but for a different type of data and many existing ETL developers aren’t equipped with the skills to use it.  Talend’s 5.3 release claims to make Hadoop easily accessible to ETL developers without them needing to know how to program in Java.

MapReduce

Talend 5.3 purports to support MapReduce at full cluster scale.

Oakland

Oakland software has now been acquired by Talend to enhance their ability to handle complex XML.

 

Talend 5.4

With big disclaimers, Dynes also offered a tentative outline of the possibilities for the next version of Talend.


Hadoop and YARN

Talend 5.4 will be compatible with the new version of Hadoop. This will include support for YARN, a feature which manages cluster resources and solves the problems of a vulnerable master node. It is also expected to be possible to visually optimize MapReduce jobs.

KERBEROS

Talend 5.4 will support the security software KERBEROS, a feature which was met with cheers from the Talend Connect 2013 audience.

NoSQL

Talend 5.4 is also expected to support the ‘bulk’ features of Not Only SQL databases such as MongoDB and Cassandra.

Reuse of Data Integration Code

For tasks where it’s necessary to copy coding to use it many times, Talend 5.4 will support the reuse of complex sets of Data Integration code.

‘Embrace the future’

Attendees at the Talend Connect 2013 were encouraged to ‘embrace the future’ of the big data community. Dynes’ presentation revealed how Talend plan to do this within their own software. The Talend Roadmap outlines precise technical changes, adapting and improving software for the chaotic data world.

 

Learning more about Hadoop, Big Data and Talend

To understand more about Hadoop, ‘Big Data’, Talend or other data jargon, contact us via enquiries@red-olive.co.uk or by phone +44 1256 83 11 00.  You could also have a look at our previous posts: ‘ “The data world is in chaos”: A review of Talend Connect 2013’ and ‘Practical experience implementing Talend Open Studio data management platform (community edition)’.  We’d love to hear from you.