Welcome!

Industrial IoT Authors: William Schmarzo, Elizabeth White, Stackify Blog, Yeshim Deniz, SmartBear Blog

Related Topics: Industrial IoT, @DXWorldExpo

Industrial IoT: Article

To Heck with 'Big Data,' 'Little Data' Is the Problem Most Face | @BigDataExpo

'Big Data' gets the press, but 'little data' is the big problem

To Heck with “Big Data”, “Little Data” is the Problem Most Face

"Big data" gets all the press - but for the vast majority of people who work with data, it's the proliferation of "little data" that impacts us the most. What do I mean by little data?  I'm referring to the proliferation of various SaaS and Cloud-based applications, on-premises applications, databases, spreadsheets, log files, data files and so forth. Many organizations are plagued with multiple instances of the same applications or multiple applications from different vendors that do essentially the same thing.    These are the applications and data that run today's enterprise - and they're a mess.

A week doesn't go by without some major vendor doing a press release that discusses unlocking the value in the mountains of structured and unstructured data that companies love to accumulate. For most of us, though, it's not getting all that value out of the Petabytes that cause us heartburn - it's getting answers out of the megabytes or gigabytes that are distributed across handfuls, dozens or even hundreds of unintegrated systems, applications and data sources.

As I mentioned recently on ebizQ's Integration Edge, the average enterprise uses at least 397 Cloud/SaaS applications in addition to all of the on-premises applications in play.  Add to that the various data stores (for example SQL Databases), and it's not unrealistic to say that a typical enterprise has around 1,000 different data-related systems of one sort or another.  Apart from the concerns for security, compliance and backup/recovery, one obvious question should stand out: how can I "get value" out of all that data - data locked up in all those different locations and different formats.

Traditionally these types of problems were solved with DBAs, programmers and business analysts (with liberal amounts of "money" and "time" tossed in).  This approach works.  It's time-tested.  It's also expensive, and not particularly scalable or flexible.  Sometimes it can take years to actually get a working solution.

Not every organization has the luxury of taking the "traditional approach" to solving the little data problem.  With the pressure to deliver results faster, better and less expensively, some businesses have found new and innovative ways to get the value they need from this sea of disparate data.

For most of us, a few systems - perhaps a dozen constitutes this disparate data mess.  But what would it be like to have to make sense out of thousands of different data sources with different data semantics?  And to know that "if things go well", there might be twice as many in a year or two?  I recently met up with such a person - and it's quite a compelling story.

This past week, I was fortunate enough to have coffee and "talk about data issues" for a few hours with Jason Haskins - a Data Architect at Alchemy Systems, a rapidly growing international company that delivers innovative technologies and services for the global food industry that increase productivity, ensure regulatory compliance, foster safe working environments, and produce quality products.  In short, they help ensure the safety and quality of our food supply chain.

I've met quite a few data architects in my day, but Jason actually is an architect - he has 2 architecture degrees - including graduate work at Columbia University and a Masters from the University of Texas at Austin.  Talking data architecture with an architect (in the A.I.A-sense of the word) - that's a new one for me.  What made it compelling was the way that Jason drew parallels between the challenges of Architecture (especially workplace-design architecture) and data - and how his training as a design architect taught him to examine highly complex systems at the systemic level, with usability, flexibility and scalability always at the top of the stack.

Jason's rather unusual background proved valuable when he inherited a wildly disparate and rapidly growing data infrastructure featuring the dreaded "A times B times C times D" problem - 500 clients with 2,000 installations.  Further complicating the situation - many of these installations require customized Alchemy Systems solutions and data models to support multiple product lines, multiple market segments and geographies that can span multiple countries and many regulatory environments.

Because of this rapid growth, Alchemy Systems found itself in a situation where it was simply unable to get value from all of this disparate data.  This was not a "big data" problem - it was a deluge of little data.

Jason HaskinsJason's architectural training led him to propose a "meta" layer above the Alchemy Systems applications and data - something that would not interfere with existing data models, would meet the needs of Alchemy's customer success managers and would provide the kind of flexibility and scalability to support Alchemy's rapid growth - essentially a software bridge between the Customer, Alchemy's development team and the customer-success group at Alchemy.  One of Jason's key design requirements was to provide a level of abstraction across the different data sets, yet no loss of resolution - a rather challenging goal as "abstraction" and "resolution" are often at odds with each other.

There are commercially available products out there that Jason might have turned to in order to solve the data mapping problem that he faced.  But Jason's belief was that "usability" extended far beyond just a data mapping layer - he wanted to deliver an integrated solution that united the customer data and also provided data visualization capabilities to the customer managers at Alchemy.  And it was his belief that Alchemy would be best served by a product which would do the data management and the data visualization all in one single product stack

What Jason found was that there are some products out there that integrate and unite data very well.  And there are other products out there that do data visualization very well.  But finding a single product that did both of those well turned out to be a challenge.

At South by Southwest Interactive, after attending a session on data visualization and integration, Jason got involved in a discussion with Gaute Solaas - an Austin-based technologist/CEO who´s company IQumulus was developing a Cloud-based data management technology called Flux.

In a conversation with Gaute, he reflected on his interactions with Jason, "the more I spoke with Jason about the problems he was facing, the more I realized that our new product needed to solve the data visualization problem as well as the data management problem in a single Cloud-based product that provides a business intelligence solution for large quantities and varieties of small data.  So we worked with Alchemy Systems on the product requirements and quickly delivered the enhancements they needed for a pilot project."

Jason drew up plans for an Account Management Dashboard pilot project using Flux that would allow customer managers to view various important success indicators and statistics for their clients and was able to deliver the project in 30 days, an impressive feat.  I asked Jason a top-of-mind question - asking him if he was nervous using a new software product, to which he replied, "I found something flexible and it fit into the paradigm I was working with. Gaute and his company share my belief  that the meaningful integration and proper utilization of numerous smaller and distributed data sets is a problem currently not adequately addressed by existing products."

Gaute added some additional perspective, "the real challenge for most organizations is not only managing the various distributed data sources, but also enabling the productive presentation of those data sets to the relevant stakeholders across the enterprise.  Finding ways to do cost-effective data aggregation and presentation in an ever-changing environment is a very challenging thing - it's what we set out to do with the Flux platform".

When asked about the future of data at Alchemy, Jason declined to mention any specifics - but hinted at interesting things to come - "we've created a neutral and adaptable layer with Flux.  It's designed for flexibility and scalability - we can take it as far as we want to".

More Stories By Hollis Tibbetts

Hollis Tibbetts, or @SoftwareHollis as his 50,000+ followers know him on Twitter, is listed on various “top 100 expert lists” for a variety of topics – ranging from Cloud to Technology Marketing, Hollis is by day Evangelist & Software Technology Director at Dell Software. By night and weekends he is a commentator, speaker and all-round communicator about Software, Data and Cloud in their myriad aspects. You can also reach Hollis on LinkedIn – linkedin.com/in/SoftwareHollis. His latest online venture is OnlineBackupNews - a free reference site to help organizations protect their data, applications and systems from threats. Every year IT Downtime Costs $26.5 Billion In Lost Revenue. Even with such high costs, 56% of enterprises in North America and 30% in Europe don’t have a good disaster recovery plan. Online Backup News aims to make sure you all have the news and tips needed to keep your IT Costs down and your information safe by providing best practices, technology insights, strategies, real-world examples and various tips and techniques from a variety of industry experts.

Hollis is a regularly featured blogger at ebizQ, a venue focused on enterprise technologies, with over 100,000 subscribers. He is also an author on Social Media Today "The World's Best Thinkers on Social Media", and maintains a blog focused on protecting data: Online Backup News.
He tweets actively as @SoftwareHollis

Additional information is available at HollisTibbetts.com

All opinions expressed in the author's articles are his own personal opinions vs. those of his employer.

IoT & Smart Cities Stories
Dion Hinchcliffe is an internationally recognized digital expert, bestselling book author, frequent keynote speaker, analyst, futurist, and transformation expert based in Washington, DC. He is currently Chief Strategy Officer at the industry-leading digital strategy and online community solutions firm, 7Summits.
Digital Transformation is much more than a buzzword. The radical shift to digital mechanisms for almost every process is evident across all industries and verticals. This is often especially true in financial services, where the legacy environment is many times unable to keep up with the rapidly shifting demands of the consumer. The constant pressure to provide complete, omnichannel delivery of customer-facing solutions to meet both regulatory and customer demands is putting enormous pressure on...
IoT is rapidly becoming mainstream as more and more investments are made into the platforms and technology. As this movement continues to expand and gain momentum it creates a massive wall of noise that can be difficult to sift through. Unfortunately, this inevitably makes IoT less approachable for people to get started with and can hamper efforts to integrate this key technology into your own portfolio. There are so many connected products already in place today with many hundreds more on the h...
The standardization of container runtimes and images has sparked the creation of an almost overwhelming number of new open source projects that build on and otherwise work with these specifications. Of course, there's Kubernetes, which orchestrates and manages collections of containers. It was one of the first and best-known examples of projects that make containers truly useful for production use. However, more recently, the container ecosystem has truly exploded. A service mesh like Istio addr...
Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...
Charles Araujo is an industry analyst, internationally recognized authority on the Digital Enterprise and author of The Quantum Age of IT: Why Everything You Know About IT is About to Change. As Principal Analyst with Intellyx, he writes, speaks and advises organizations on how to navigate through this time of disruption. He is also the founder of The Institute for Digital Transformation and a sought after keynote speaker. He has been a regular contributor to both InformationWeek and CIO Insight...
Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life settlement products to hedge funds and investment banks. After, he co-founded a revenue cycle management company where he learned about Bitcoin and eventually Ethereal. Andrew's role at ConsenSys Enterprise is a mul...
To Really Work for Enterprises, MultiCloud Adoption Requires Far Better and Inclusive Cloud Monitoring and Cost Management … But How? Overwhelmingly, even as enterprises have adopted cloud computing and are expanding to multi-cloud computing, IT leaders remain concerned about how to monitor, manage and control costs across hybrid and multi-cloud deployments. It’s clear that traditional IT monitoring and management approaches, designed after all for on-premises data centers, are falling short in ...
In his general session at 19th Cloud Expo, Manish Dixit, VP of Product and Engineering at Dice, discussed how Dice leverages data insights and tools to help both tech professionals and recruiters better understand how skills relate to each other and which skills are in high demand using interactive visualizations and salary indicator tools to maximize earning potential. Manish Dixit is VP of Product and Engineering at Dice. As the leader of the Product, Engineering and Data Sciences team at D...
Dynatrace is an application performance management software company with products for the information technology departments and digital business owners of medium and large businesses. Building the Future of Monitoring with Artificial Intelligence. Today we can collect lots and lots of performance data. We build beautiful dashboards and even have fancy query languages to access and transform the data. Still performance data is a secret language only a couple of people understand. The more busine...