Data Integration Futurethink - tdwi.1105uat.com/media/DADC6FFD0664499BB... · Data Integration...
Transcript of Data Integration Futurethink - tdwi.1105uat.com/media/DADC6FFD0664499BB... · Data Integration...
WHITE PAPER
Data Integration Futurethink
How Talend Addresses the Changing Needs
of Data Integration
2
WHITE PAPER Data Integration Futurethink
Table of Contents INTRODUCTION ................................................................................................... 3
TREND ONE: BIG DATA AND UNLIMITED SCALABILITY ............................... 4
TREND TWO: A TOTAL DATA MANAGEMENT APPROACH ........................... 5
TREND THREE: AVAILABILITY OF POWERFUL, FREE APPS ........................ 6
TREND FOUR: SIMPLE, TRANSPARENT SOFTWARE PROCUREMENT ....... 7
TREND FIVE: FAST LEARNING CURVE ............................................................ 8
TREND SIX: COMMODITY SERVERS AND MORE CPUS ................................. 9
TREND SEVEN: SOCIAL MEDIA AND COMMUNITIES ................................... 10
TREND EIGHT: LOWERING TOTAL COST OF OWNERSHIP ......................... 11
CONCLUSION .................................................................................................... 12
ABOUT TALEND ................................................................................................ 13 Contact Us ........................................................................................................... 13
Talend, the recognized market leader in open source data integration, leverages the open
source model to make data management available to all types of organizations, regardless of
their size, level of expertise or budgetary constraints. Talend’s solutions connect to all source
and target systems.
Talend’s solutions are used primarily for integration between operational systems, as well as
for ETL (Extract, Transform, Load) for Business Intelligence and Data Warehousing, for
migration, and for data quality management.
3
WHITE PAPER Data Integration Futurethink
Introduction
“Even the most brilliant thinkers of every age are stuck in their mental trenches. Thomas Edison believed that widespread use of alternating current (AC) would burn down a city. Before railroads came along, the thinking of ‘astute’ and ‘knowledgeable’ experts was that humans could not travel faster than 15 miles per hour, because if they did, blood would spurt from their noses and ears. Steve Jobs and the other personal computer pioneers recognized the promise of home computers, at a time when computer experts saw them as mere toys.” FutureThink: How to Think Clearly in a Time of Change, Edie Weiner, Arnold Brown 2005
The purpose of this whitepaper is to highlight eight key data integration trends that necessitate
every firm to rethink their current assumptions and the business value they are providing.
Data integration technology continues to evolve from a method to extract, transform, and load
information between a few systems, to integrating, normalizing, and cleansing vast amounts
of distributed data for real-time business intelligence. Failure is not an option.
When choosing an integration solution that best meets their needs, companies must ask
themselves key questions, such as:
• What is the scope of the project?
• Is this a one-off project or recurring? What can I leverage across multiple projects?
• What data sources need to be connected, both today and tomorrow?
• Will the solution maximize my business agility, and not lock me into a limited technology?
• Can I get started quickly and scale as fast as the business requires?
• How can I enlist the vast data integration community to help?
This whitepaper presents new data integration trends and why customers are choosing
Talend, the recognized leader in open source data management.
4
WHITE PAPER Data Integration Futurethink
Trend One: Big Data and Unlimited Scalability All organizations—whatever their size—are faced with managing increasing volumes of data.
It is estimated that the amount of data doubles every year, fueled by the digitalization of
production lines and distribution systems (RFID), automated ERP and CRM processes, office
and communications software, business intelligence tools, social media, etc. Data integration
solutions need to adapt to big data and the ever-increasing volume, variety, and velocity of
data.
As firms move to execute business processes and
transactions in near real-time, IT systems need to be
hardened to provide high resilience and fast, scalable
performance. No matter if the targets/sources are big
data, enterprise data or text files, data integration
processes need to be reliable, available and scalable.
Solutions must support your evolving data needs
without significant reachitecture.
Talend provides an easy-to-use graphical environment
that allows developers to visually map big data sources
and targets without the need to learn and write
complicated code. Running 100% natively on Hadoop,
Talend Big Data provides massive scalability. Once a
big data connection is configured the underlying code is automatically generated and can be
deployed remotely as a job that runs natively on your big data cluster - HDFS, Pig, HCatalog,
HBase, Sqoop or Hive. It is a similar story if you’re doing traditional enterprise data
integration; however, the underlying code might be JAVA or SQL. Talend’s code generation
engine is unique in the data integration market and gives Talend customers a competitive
productivity edge.
With Talend solutions, you can reuse existing developments in new projects, shortening time-
to-market while increasing productivity.
“Among the main gains achieved using the Talend solution is that we have attained greater scalability and manageability of data integration processes, better transferring of internal developer skills (permanent and temporary employees) and improved collaboration and sharing of experiences both internally and externally.”
Delphine Steinbach Vice-Head Scientist and Coordinator of the URGI platform for INRA
5
WHITE PAPER Data Integration Futurethink
Trend Two: A Total Data Management Approach Data comes in all formats and sizes today, from packaged apps, to flat flies, to spreadsheets,
to streaming social media, to huge data warehouses. You may still have business users using
Excel for databases and your partners are still sending files as e-mail. A project may start by
integrating CRM, ERP and SCM apps with a data warehouse, but then needs to expand to
connect Hadoop, MongoDB and Excel. You want to handle the high volume transactions that
often come with big data, but you still have relational databases and unstructured data
sources to deal with. Your solution needs to be able to connect everything.
This year has seen a meteoric rise of big data projects. We are beginning to see standards
and patterns emerge in the implementation of big data. Recent surveys show that most
technologists and business users feel that big data is an off-shoot of data management, not a
branch of technology in itself. Whether you have to deal with big data, enterprise data or
spread-marts, data needs to be managed no matter what size. The tides are turning for a total
data management approach.
Dealing with all of these processes and data types may
mean that you will need to access data quality, master
data management or application integration
technologies. You may need to upgrade your technology
as your company matures its data governance strategy.
Wouldn’t it be helpful if data integration, data quality,
application integration and MDM were controlled from
one familiar user interface, not a series of unrelated
ones?
Talend provides a holistic approach to integration, enabling IT organizations to converge
traditionally disparate integration efforts and practices through a common set of products,
tools and best practices. Talend provides a unified platform for managing data, no matter if
data resides in text files, enterprise databases, spreadmarts or big data clusters like Hadoop.
The product set incorporates tools for all your data management needs including: big data
integration, data quality and data governance, master data management, business process
management and application integration.
“Talend brings together data and services integration in a single unified platform, making it easy and fast for kleertjes.com to develop and deploy data services for all our integration needs.”
- Fabio Zuccato, ICT Manager, kleertjes.com
6
WHITE PAPER Data Integration Futurethink
Trend Three: Availability of Powerful, Free Apps The introduction of the iPhone and App Store (or the Android and Google Play) brought an
entirely new ecosystem of application development, distribution, and payment. Users can
download fully functioning apps that are not disabled in any way. Premium apps are often also
available to those who want support, or just want to support the mission of the app.
This has changed the game for enterprise applications.
Many in IT are now expecting this same experience for
business applications – you should be able to quickly
download the software for free and use it. The software
should be functional to get the job done, and provide a
simple and seamless upgrade to advanced versions
and/or support.
For Talend, we meet this demand by offering our
Talend Open Studio product. Talend Open Studio for
data integration is a complete product comprising
many features and a wide range of connectors. It is the
most open, innovative and powerful data integration
solution available today. Talend Open Studio is not a
lightweight product or “trialware” - it contains all the features required for building powerful
data integration processes, and is freely downloadable and usable. Talend Open Studio is in
use in thousands of projects today, supported by an open source community of users.
In addition, Talend Enterprise Data Integration and Talend Platform for Data Management are
enhanced versions of Talend Open Studio, providing additional enterprise-level functionality.
These solutions, which are simple to upgrade, include high-level technical support to respond
to corporate issues and legal guarantees of intellectual property protection (IP indemnification).
The Features Comparison Matrix provides an excellent overview of the features of the
different products.
“After testing several vendors' products, we selected Talend for both their open source roots, making it compatible with our budget constraints, as well as for the service package that allows us to ramp up and become self-sufficient very quickly. The speed with which the product can be deployed, combined with the level of support delivered from the company, were the two defining reasons we chose Talend.”
Jean Gaignebet, IT director for SNCF Rail Testing Agency.
7
WHITE PAPER Data Integration Futurethink
Trend Four: Simple, Transparent Software Procurement In the download app society in which we live, we can download cool new business
applications at the airport, at home, or in the office with no restrictions. This trend is powerful
because it helps us solve problems quicker than ever before. Need to track a flight? Download
an app. Need to know the name of a song? Download an app.
Companies need this same agility. Firms cannot afford
year-long product acquisition cycles, i.e. research,
shortlist, RFI, RFP, proof-of-concept, purchase,
training and consultation. (then you start the project!)
You want to get started quickly with a proof-of-concept
that can then become a full enterprise deployment.
Download an app.
Implementing Talend Open Studio is quick and easy—
just download the latest version from Talend’s website and install it. If you have any problems
(or just want more information), the community forums can help you out. The product is free,
which means that you do not need to justify it to management or start a formal procurement
process before solving your integration issues. You will not spend any time on administration
tasks and will not need to meet with software vendors. You can use the product in an
unlimited mode and, of course, keep it as long as you like. You will also get free product
upgrades by downloading new versions as they become available. No need to negotiate with
the vendor.
“We downloaded Talend Open Studio from the Web, got the user documentation and went through a couple of online tutorials. In two hours we were up and running and developing our first data integration jobs.”
Rock Blanco, President Prime Numbers Technology
8
WHITE PAPER Data Integration Futurethink
Trend Five: Fast Learning Curve Project deployment cycles have shortened as well. Developers do sprints and produce new
versions weekly. Teams need to get up-to-speed quickly on data integration development,
testing, deployment and management functions. Products need to be easy to use, manage
and collaborate on.
Talend tools are very easy to use. The graphical user
interface is intuitive and does not require formal
training. Talend’s Job Designer provides both a
graphical and a functional view of the actual integration
processes using a graphical palette of components and
connectors—the Component Library. Integration
processes are built by simply dragging and dropping
components and connectors onto the workspace,
drawing connections and relationships between them,
and setting their properties (most properties are
inherited from the metadata).
Talend’s Business Modeler leverages a top-down
approach, allowing line-of-business stakeholders to get
involved in the design of integration processes and to monitor development progress.
With Talend, we provide a wide range of online help on our help.talend.com web site.
“We’re a marketing group, not a software development shop. While we have some technology-savvy people, I was initially concerned that we’d get into too much coding and not enough support, but Talend Open Studio really had all the features we needed to develop the data integration processes quickly. And we were even able to extend it with a few custom Java routines to cleanse the incoming data.”
Joe Burns, Marketing Analytics, Program Manager, Major Automotive Manufacturer
9
WHITE PAPER Data Integration Futurethink
Trend Six: Commodity Servers and More CPUs Can you predict how many CPUs you will need for data integration tasks 3 years from now?
Proprietary vendors charge a “data tax” which increases the cost of processing additional data,
i.e. when adding servers, data sources/targets, or even transitioning to multi-core CPUs
requires the purchase of additional licenses. Thus, infrastructure costs are not predictable as
it is difficult to determine when you need to expand and future connectivity requirements.
With Talend, the cost of the solution is based on the
number of developers of data integration processes.
You can access new data as needed. For example,
when setting up a new application or acquiring a new
business—operations sometimes hard to predict in
advance—you do not need to buy additional licenses.
Moreover, if a company is moving from development
mode to maintenance mode, it does not have to keep
all its licenses.
“For our data integration project, Talend charged by the number of developers and not by CPU. Charging by CPU dramatically raises costs when deploying dual- or quad-core servers. The per-developer license protects our budget even as our business grows."
Frédéric Chauvat, IT Director Cdiscount
10
WHITE PAPER Data Integration Futurethink
Trend Seven: Social Media and Communities Joining a community has become the way we live today. Whether it is a community of friends
and family on Facebook or a community of business associates on LinkedIn, communities
allow us to share ideas and obtain needed services. In software, the open source community
is an effective crowdsourcing community providing support and product development.
Talend’s online community of over 100,000 registered
users, offers many resources that facilitate
implementing and maintaining Talend solutions
including forums, wiki, tutorials, blogs, and BugTracker.
Talend encourages contributions from its users, who
develop connectors, features or modify the source code,
giving back their contributions to the larger community.
For example, four months after launching Babili—
Talend’s first community user interface translation
application—Talend introduced nine language packs
which localize the UI of its products. Talend’s tools are
the only data integration solutions on the market today
that provide localization options for users and
developers, extending Talend’s reach around the world.
“The collaborative spirit among Talend users is excellent, and the talendforge.org forum is a great example of this collaborative work. Not only is this forum already very rich with information, but many community members are actively involved in providing answers and help to newbie users who, in turn, quickly become contributors. We have also used the Wiki and the BugTracker, and Talend’s documentation is of excellent quality.”
Maik Böttcher, IT Manager, Pokolm
11
WHITE PAPER Data Integration Futurethink
Trend Eight: Lowering Total Cost of Ownership In an age when IT is constantly ask to do more with fewer people and flat budgets, it is
important to squeeze as much value out of your IT investments. An investment may have a
rapid return (ROI); however, when looking at the total
cost of ownership (TCO) over a 3 or 5 year span, the
solution costs (resources, software, hardware,
training, etc) may not be viable. Moving to a lower
TCO data integration solution enables funding for new
strategic projects.
Talend's solutions are much more cost effective than
equivalent proprietary solutions offered on the market.
They are also less expensive to deploy, maintain, and
support providing excellent TCO. In addition, they
facilitate faster development compared to proprietary
tools and hand-coding. For example, the Business
Modeler streamlines communication between
business users and the development teams, reducing
the time a company typically takes to identify needs
and launch new product development.
Talend’s unified platform across all data management and application functions means that
your resources can leverage their Talend skills across multiple projects. The Talend
community provides an abundance of resources to help keep your costs down.
“We were looking for the best solution to meet our business requirements at an optimal cost. We selected an open source solution, not only because of the budget considerations, but also for power and extensibility. Talend Open Studio proved to be the right decision for us. The product is feature-rich, stable, and robust. It doesn’t take a lot of time to build data integration processes, which is important in our fast moving business environment.”
Marcin Kierdelewicz, Head of Systems, Sitelynx
12
WHITE PAPER Data Integration Futurethink
Conclusion The data integration market is very large—more than $13 billion annually and growing. More
and more, open source vendors are taking this business from the high-priced proprietary
incumbents. This is partly because of the value-add that open source brings to the table—high
value instead of high price with no vendor lock-in and a faster time-to-market.
Another important factor is how a company responds to three key players: its community, its
partners, and its customers. Many open source companies tend to be strong in one or two of
these areas, but Talend is unique in excelling in all three. Today, Talend has a strongly
involved and committed community with give-and-take in both directions (over 100,000
community members); over 1,800 active customers worldwide; and an extensive partner
network.
13
WHITE PAPER Data Integration Futurethink
About Talend
Talend provides integration that truly scales. From small projects to enterprise-wide
implementations, Talend’s highly scalable data, application and business process integration
platform maximizes the value of an organization’s information assets and optimizes return on
investment through a usage-based subscription model. Ready for big data environments,
Talend’s flexible architecture easily adapts to future IT platforms. And a common set of easy-
to-use tools implemented across all Talend products enable teams to scale developer skillsets,
too.
More than 1,800 active subscribers worldwide leverage Talend’s solutions and services. The
company has major offices in North America, Europe and Asia, and a global network of
technical and services partners. For more information, please visit www.talend.com.
Contact Us
www.talend.com/contact
© Talend 2013 WP177-EN