Data Integration Futurethink - tdwi.1105uat.com/media/DADC6FFD0664499BB... · Data Integration...

13
WHITE PAPER Data Integration Futurethink How Talend Addresses the Changing Needs of Data Integration

Transcript of Data Integration Futurethink - tdwi.1105uat.com/media/DADC6FFD0664499BB... · Data Integration...

WHITE PAPER

Data Integration Futurethink

How Talend Addresses the Changing Needs

of Data Integration

2

WHITE PAPER Data Integration Futurethink

Table of Contents INTRODUCTION ................................................................................................... 3  

TREND ONE: BIG DATA AND UNLIMITED SCALABILITY ............................... 4  

TREND TWO: A TOTAL DATA MANAGEMENT APPROACH ........................... 5  

TREND THREE: AVAILABILITY OF POWERFUL, FREE APPS ........................ 6  

TREND FOUR: SIMPLE, TRANSPARENT SOFTWARE PROCUREMENT ....... 7  

TREND FIVE: FAST LEARNING CURVE ............................................................ 8  

TREND SIX: COMMODITY SERVERS AND MORE CPUS ................................. 9  

TREND SEVEN: SOCIAL MEDIA AND COMMUNITIES ................................... 10  

TREND EIGHT: LOWERING TOTAL COST OF OWNERSHIP ......................... 11  

CONCLUSION .................................................................................................... 12  

ABOUT TALEND ................................................................................................ 13  Contact Us ........................................................................................................... 13  

Talend, the recognized market leader in open source data integration, leverages the open

source model to make data management available to all types of organizations, regardless of

their size, level of expertise or budgetary constraints. Talend’s solutions connect to all source

and target systems.

Talend’s solutions are used primarily for integration between operational systems, as well as

for ETL (Extract, Transform, Load) for Business Intelligence and Data Warehousing, for

migration, and for data quality management.

3

WHITE PAPER Data Integration Futurethink

Introduction

“Even the most brilliant thinkers of every age are stuck in their mental trenches. Thomas Edison believed that widespread use of alternating current (AC) would burn down a city. Before railroads came along, the thinking of ‘astute’ and ‘knowledgeable’ experts was that humans could not travel faster than 15 miles per hour, because if they did, blood would spurt from their noses and ears. Steve Jobs and the other personal computer pioneers recognized the promise of home computers, at a time when computer experts saw them as mere toys.” FutureThink: How to Think Clearly in a Time of Change, Edie Weiner, Arnold Brown 2005

The purpose of this whitepaper is to highlight eight key data integration trends that necessitate

every firm to rethink their current assumptions and the business value they are providing.

Data integration technology continues to evolve from a method to extract, transform, and load

information between a few systems, to integrating, normalizing, and cleansing vast amounts

of distributed data for real-time business intelligence. Failure is not an option.

When choosing an integration solution that best meets their needs, companies must ask

themselves key questions, such as:

• What is the scope of the project?

• Is this a one-off project or recurring? What can I leverage across multiple projects?

• What data sources need to be connected, both today and tomorrow?

• Will the solution maximize my business agility, and not lock me into a limited technology?

• Can I get started quickly and scale as fast as the business requires?

• How can I enlist the vast data integration community to help?

This whitepaper presents new data integration trends and why customers are choosing

Talend, the recognized leader in open source data management.

 

 

4

WHITE PAPER Data Integration Futurethink

Trend One: Big Data and Unlimited Scalability All organizations—whatever their size—are faced with managing increasing volumes of data.

It is estimated that the amount of data doubles every year, fueled by the digitalization of

production lines and distribution systems (RFID), automated ERP and CRM processes, office

and communications software, business intelligence tools, social media, etc. Data integration

solutions need to adapt to big data and the ever-increasing volume, variety, and velocity of

data.

As firms move to execute business processes and

transactions in near real-time, IT systems need to be

hardened to provide high resilience and fast, scalable

performance. No matter if the targets/sources are big

data, enterprise data or text files, data integration

processes need to be reliable, available and scalable.

Solutions must support your evolving data needs

without significant reachitecture.

Talend provides an easy-to-use graphical environment

that allows developers to visually map big data sources

and targets without the need to learn and write

complicated code. Running 100% natively on Hadoop,

Talend Big Data provides massive scalability. Once a

big data connection is configured the underlying code is automatically generated and can be

deployed remotely as a job that runs natively on your big data cluster - HDFS, Pig, HCatalog,

HBase, Sqoop or Hive. It is a similar story if you’re doing traditional enterprise data

integration; however, the underlying code might be JAVA or SQL. Talend’s code generation

engine is unique in the data integration market and gives Talend customers a competitive

productivity edge.

With Talend solutions, you can reuse existing developments in new projects, shortening time-

to-market while increasing productivity.

“Among the main gains achieved using the Talend solution is that we have attained greater scalability and manageability of data integration processes, better transferring of internal developer skills (permanent and temporary employees) and improved collaboration and sharing of experiences both internally and externally.”

Delphine Steinbach Vice-Head Scientist and Coordinator of the URGI platform for INRA

5

WHITE PAPER Data Integration Futurethink

Trend Two: A Total Data Management Approach Data comes in all formats and sizes today, from packaged apps, to flat flies, to spreadsheets,

to streaming social media, to huge data warehouses. You may still have business users using

Excel for databases and your partners are still sending files as e-mail. A project may start by

integrating CRM, ERP and SCM apps with a data warehouse, but then needs to expand to

connect Hadoop, MongoDB and Excel. You want to handle the high volume transactions that

often come with big data, but you still have relational databases and unstructured data

sources to deal with. Your solution needs to be able to connect everything.

This year has seen a meteoric rise of big data projects. We are beginning to see standards

and patterns emerge in the implementation of big data. Recent surveys show that most

technologists and business users feel that big data is an off-shoot of data management, not a

branch of technology in itself. Whether you have to deal with big data, enterprise data or

spread-marts, data needs to be managed no matter what size. The tides are turning for a total

data management approach.

Dealing with all of these processes and data types may

mean that you will need to access data quality, master

data management or application integration

technologies. You may need to upgrade your technology

as your company matures its data governance strategy.

Wouldn’t it be helpful if data integration, data quality,

application integration and MDM were controlled from

one familiar user interface, not a series of unrelated

ones?

Talend provides a holistic approach to integration, enabling IT organizations to converge

traditionally disparate integration efforts and practices through a common set of products,

tools and best practices. Talend provides a unified platform for managing data, no matter if

data resides in text files, enterprise databases, spreadmarts or big data clusters like Hadoop.

The product set incorporates tools for all your data management needs including: big data

integration, data quality and data governance, master data management, business process

management and application integration.

“Talend brings together data and services integration in a single unified platform, making it easy and fast for kleertjes.com to develop and deploy data services for all our integration needs.”

- Fabio Zuccato, ICT Manager, kleertjes.com

6

WHITE PAPER Data Integration Futurethink

Trend Three: Availability of Powerful, Free Apps The introduction of the iPhone and App Store (or the Android and Google Play) brought an

entirely new ecosystem of application development, distribution, and payment. Users can

download fully functioning apps that are not disabled in any way. Premium apps are often also

available to those who want support, or just want to support the mission of the app.

This has changed the game for enterprise applications.

Many in IT are now expecting this same experience for

business applications – you should be able to quickly

download the software for free and use it. The software

should be functional to get the job done, and provide a

simple and seamless upgrade to advanced versions

and/or support.

For Talend, we meet this demand by offering our

Talend Open Studio product. Talend Open Studio for

data integration is a complete product comprising

many features and a wide range of connectors. It is the

most open, innovative and powerful data integration

solution available today. Talend Open Studio is not a

lightweight product or “trialware” - it contains all the features required for building powerful

data integration processes, and is freely downloadable and usable. Talend Open Studio is in

use in thousands of projects today, supported by an open source community of users.

In addition, Talend Enterprise Data Integration and Talend Platform for Data Management are

enhanced versions of Talend Open Studio, providing additional enterprise-level functionality.

These solutions, which are simple to upgrade, include high-level technical support to respond

to corporate issues and legal guarantees of intellectual property protection (IP indemnification).

The Features Comparison Matrix provides an excellent overview of the features of the

different products.

“After testing several vendors' products, we selected Talend for both their open source roots, making it compatible with our budget constraints, as well as for the service package that allows us to ramp up and become self-sufficient very quickly. The speed with which the product can be deployed, combined with the level of support delivered from the company, were the two defining reasons we chose Talend.”

Jean Gaignebet, IT director for SNCF Rail Testing Agency.

7

WHITE PAPER Data Integration Futurethink

Trend Four: Simple, Transparent Software Procurement In the download app society in which we live, we can download cool new business

applications at the airport, at home, or in the office with no restrictions. This trend is powerful

because it helps us solve problems quicker than ever before. Need to track a flight? Download

an app. Need to know the name of a song? Download an app.

Companies need this same agility. Firms cannot afford

year-long product acquisition cycles, i.e. research,

shortlist, RFI, RFP, proof-of-concept, purchase,

training and consultation. (then you start the project!)

You want to get started quickly with a proof-of-concept

that can then become a full enterprise deployment.

Download an app.

Implementing Talend Open Studio is quick and easy—

just download the latest version from Talend’s website and install it. If you have any problems

(or just want more information), the community forums can help you out. The product is free,

which means that you do not need to justify it to management or start a formal procurement

process before solving your integration issues. You will not spend any time on administration

tasks and will not need to meet with software vendors. You can use the product in an

unlimited mode and, of course, keep it as long as you like. You will also get free product

upgrades by downloading new versions as they become available. No need to negotiate with

the vendor.

“We downloaded Talend Open Studio from the Web, got the user documentation and went through a couple of online tutorials. In two hours we were up and running and developing our first data integration jobs.”

Rock Blanco, President Prime Numbers Technology

8

WHITE PAPER Data Integration Futurethink

Trend Five: Fast Learning Curve Project deployment cycles have shortened as well. Developers do sprints and produce new

versions weekly. Teams need to get up-to-speed quickly on data integration development,

testing, deployment and management functions. Products need to be easy to use, manage

and collaborate on.

Talend tools are very easy to use. The graphical user

interface is intuitive and does not require formal

training. Talend’s Job Designer provides both a

graphical and a functional view of the actual integration

processes using a graphical palette of components and

connectors—the Component Library. Integration

processes are built by simply dragging and dropping

components and connectors onto the workspace,

drawing connections and relationships between them,

and setting their properties (most properties are

inherited from the metadata).

Talend’s Business Modeler leverages a top-down

approach, allowing line-of-business stakeholders to get

involved in the design of integration processes and to monitor development progress.

With Talend, we provide a wide range of online help on our help.talend.com web site.

“We’re a marketing group, not a software development shop. While we have some technology-savvy people, I was initially concerned that we’d get into too much coding and not enough support, but Talend Open Studio really had all the features we needed to develop the data integration processes quickly. And we were even able to extend it with a few custom Java routines to cleanse the incoming data.”

Joe Burns, Marketing Analytics, Program Manager, Major Automotive Manufacturer

9

WHITE PAPER Data Integration Futurethink

Trend Six: Commodity Servers and More CPUs Can you predict how many CPUs you will need for data integration tasks 3 years from now?

Proprietary vendors charge a “data tax” which increases the cost of processing additional data,

i.e. when adding servers, data sources/targets, or even transitioning to multi-core CPUs

requires the purchase of additional licenses. Thus, infrastructure costs are not predictable as

it is difficult to determine when you need to expand and future connectivity requirements.

With Talend, the cost of the solution is based on the

number of developers of data integration processes.

You can access new data as needed. For example,

when setting up a new application or acquiring a new

business—operations sometimes hard to predict in

advance—you do not need to buy additional licenses.

Moreover, if a company is moving from development

mode to maintenance mode, it does not have to keep

all its licenses.

“For our data integration project, Talend charged by the number of developers and not by CPU. Charging by CPU dramatically raises costs when deploying dual- or quad-core servers. The per-developer license protects our budget even as our business grows."

Frédéric Chauvat, IT Director Cdiscount

10

WHITE PAPER Data Integration Futurethink

Trend Seven: Social Media and Communities Joining a community has become the way we live today. Whether it is a community of friends

and family on Facebook or a community of business associates on LinkedIn, communities

allow us to share ideas and obtain needed services. In software, the open source community

is an effective crowdsourcing community providing support and product development.

Talend’s online community of over 100,000 registered

users, offers many resources that facilitate

implementing and maintaining Talend solutions

including forums, wiki, tutorials, blogs, and BugTracker.

Talend encourages contributions from its users, who

develop connectors, features or modify the source code,

giving back their contributions to the larger community.

For example, four months after launching Babili—

Talend’s first community user interface translation

application—Talend introduced nine language packs

which localize the UI of its products. Talend’s tools are

the only data integration solutions on the market today

that provide localization options for users and

developers, extending Talend’s reach around the world.

“The collaborative spirit among Talend users is excellent, and the talendforge.org forum is a great example of this collaborative work. Not only is this forum already very rich with information, but many community members are actively involved in providing answers and help to newbie users who, in turn, quickly become contributors. We have also used the Wiki and the BugTracker, and Talend’s documentation is of excellent quality.”

Maik Böttcher, IT Manager, Pokolm

11

WHITE PAPER Data Integration Futurethink

Trend Eight: Lowering Total Cost of Ownership In an age when IT is constantly ask to do more with fewer people and flat budgets, it is

important to squeeze as much value out of your IT investments. An investment may have a

rapid return (ROI); however, when looking at the total

cost of ownership (TCO) over a 3 or 5 year span, the

solution costs (resources, software, hardware,

training, etc) may not be viable. Moving to a lower

TCO data integration solution enables funding for new

strategic projects.

Talend's solutions are much more cost effective than

equivalent proprietary solutions offered on the market.

They are also less expensive to deploy, maintain, and

support providing excellent TCO. In addition, they

facilitate faster development compared to proprietary

tools and hand-coding. For example, the Business

Modeler streamlines communication between

business users and the development teams, reducing

the time a company typically takes to identify needs

and launch new product development.

Talend’s unified platform across all data management and application functions means that

your resources can leverage their Talend skills across multiple projects. The Talend

community provides an abundance of resources to help keep your costs down.

“We were looking for the best solution to meet our business requirements at an optimal cost. We selected an open source solution, not only because of the budget considerations, but also for power and extensibility. Talend Open Studio proved to be the right decision for us. The product is feature-rich, stable, and robust. It doesn’t take a lot of time to build data integration processes, which is important in our fast moving business environment.”

Marcin Kierdelewicz, Head of Systems, Sitelynx

12

WHITE PAPER Data Integration Futurethink

Conclusion The data integration market is very large—more than $13 billion annually and growing. More

and more, open source vendors are taking this business from the high-priced proprietary

incumbents. This is partly because of the value-add that open source brings to the table—high

value instead of high price with no vendor lock-in and a faster time-to-market.

Another important factor is how a company responds to three key players: its community, its

partners, and its customers. Many open source companies tend to be strong in one or two of

these areas, but Talend is unique in excelling in all three. Today, Talend has a strongly

involved and committed community with give-and-take in both directions (over 100,000

community members); over 1,800 active customers worldwide; and an extensive partner

network.

13

WHITE PAPER Data Integration Futurethink

About Talend

Talend provides integration that truly scales. From small projects to enterprise-wide

implementations, Talend’s highly scalable data, application and business process integration

platform maximizes the value of an organization’s information assets and optimizes return on

investment through a usage-based subscription model. Ready for big data environments,

Talend’s flexible architecture easily adapts to future IT platforms. And a common set of easy-

to-use tools implemented across all Talend products enable teams to scale developer skillsets,

too.

More than 1,800 active subscribers worldwide leverage Talend’s solutions and services. The

company has major offices in North America, Europe and Asia, and a global network of

technical and services partners. For more information, please visit www.talend.com.

Contact Us

www.talend.com/contact

[email protected]

[email protected]

[email protected]

© Talend 2013 WP177-EN