Welcome!

Industrial IoT Authors: JP Morgenthal, Derek Weeks, Liz McMillan, Elizabeth White, William Schmarzo

Related Topics: Industrial IoT, @BigDataExpo

Industrial IoT: Article

To Heck with 'Big Data,' 'Little Data' Is the Problem Most Face | @BigDataExpo

'Big Data' gets the press, but 'little data' is the big problem

To Heck with “Big Data”, “Little Data” is the Problem Most Face

"Big data" gets all the press - but for the vast majority of people who work with data, it's the proliferation of "little data" that impacts us the most. What do I mean by little data?  I'm referring to the proliferation of various SaaS and Cloud-based applications, on-premises applications, databases, spreadsheets, log files, data files and so forth. Many organizations are plagued with multiple instances of the same applications or multiple applications from different vendors that do essentially the same thing.    These are the applications and data that run today's enterprise - and they're a mess.

A week doesn't go by without some major vendor doing a press release that discusses unlocking the value in the mountains of structured and unstructured data that companies love to accumulate. For most of us, though, it's not getting all that value out of the Petabytes that cause us heartburn - it's getting answers out of the megabytes or gigabytes that are distributed across handfuls, dozens or even hundreds of unintegrated systems, applications and data sources.

As I mentioned recently on ebizQ's Integration Edge, the average enterprise uses at least 397 Cloud/SaaS applications in addition to all of the on-premises applications in play.  Add to that the various data stores (for example SQL Databases), and it's not unrealistic to say that a typical enterprise has around 1,000 different data-related systems of one sort or another.  Apart from the concerns for security, compliance and backup/recovery, one obvious question should stand out: how can I "get value" out of all that data - data locked up in all those different locations and different formats.

Traditionally these types of problems were solved with DBAs, programmers and business analysts (with liberal amounts of "money" and "time" tossed in).  This approach works.  It's time-tested.  It's also expensive, and not particularly scalable or flexible.  Sometimes it can take years to actually get a working solution.

Not every organization has the luxury of taking the "traditional approach" to solving the little data problem.  With the pressure to deliver results faster, better and less expensively, some businesses have found new and innovative ways to get the value they need from this sea of disparate data.

For most of us, a few systems - perhaps a dozen constitutes this disparate data mess.  But what would it be like to have to make sense out of thousands of different data sources with different data semantics?  And to know that "if things go well", there might be twice as many in a year or two?  I recently met up with such a person - and it's quite a compelling story.

This past week, I was fortunate enough to have coffee and "talk about data issues" for a few hours with Jason Haskins - a Data Architect at Alchemy Systems, a rapidly growing international company that delivers innovative technologies and services for the global food industry that increase productivity, ensure regulatory compliance, foster safe working environments, and produce quality products.  In short, they help ensure the safety and quality of our food supply chain.

I've met quite a few data architects in my day, but Jason actually is an architect - he has 2 architecture degrees - including graduate work at Columbia University and a Masters from the University of Texas at Austin.  Talking data architecture with an architect (in the A.I.A-sense of the word) - that's a new one for me.  What made it compelling was the way that Jason drew parallels between the challenges of Architecture (especially workplace-design architecture) and data - and how his training as a design architect taught him to examine highly complex systems at the systemic level, with usability, flexibility and scalability always at the top of the stack.

Jason's rather unusual background proved valuable when he inherited a wildly disparate and rapidly growing data infrastructure featuring the dreaded "A times B times C times D" problem - 500 clients with 2,000 installations.  Further complicating the situation - many of these installations require customized Alchemy Systems solutions and data models to support multiple product lines, multiple market segments and geographies that can span multiple countries and many regulatory environments.

Because of this rapid growth, Alchemy Systems found itself in a situation where it was simply unable to get value from all of this disparate data.  This was not a "big data" problem - it was a deluge of little data.

Jason HaskinsJason's architectural training led him to propose a "meta" layer above the Alchemy Systems applications and data - something that would not interfere with existing data models, would meet the needs of Alchemy's customer success managers and would provide the kind of flexibility and scalability to support Alchemy's rapid growth - essentially a software bridge between the Customer, Alchemy's development team and the customer-success group at Alchemy.  One of Jason's key design requirements was to provide a level of abstraction across the different data sets, yet no loss of resolution - a rather challenging goal as "abstraction" and "resolution" are often at odds with each other.

There are commercially available products out there that Jason might have turned to in order to solve the data mapping problem that he faced.  But Jason's belief was that "usability" extended far beyond just a data mapping layer - he wanted to deliver an integrated solution that united the customer data and also provided data visualization capabilities to the customer managers at Alchemy.  And it was his belief that Alchemy would be best served by a product which would do the data management and the data visualization all in one single product stack

What Jason found was that there are some products out there that integrate and unite data very well.  And there are other products out there that do data visualization very well.  But finding a single product that did both of those well turned out to be a challenge.

At South by Southwest Interactive, after attending a session on data visualization and integration, Jason got involved in a discussion with Gaute Solaas - an Austin-based technologist/CEO who´s company IQumulus was developing a Cloud-based data management technology called Flux.

In a conversation with Gaute, he reflected on his interactions with Jason, "the more I spoke with Jason about the problems he was facing, the more I realized that our new product needed to solve the data visualization problem as well as the data management problem in a single Cloud-based product that provides a business intelligence solution for large quantities and varieties of small data.  So we worked with Alchemy Systems on the product requirements and quickly delivered the enhancements they needed for a pilot project."

Jason drew up plans for an Account Management Dashboard pilot project using Flux that would allow customer managers to view various important success indicators and statistics for their clients and was able to deliver the project in 30 days, an impressive feat.  I asked Jason a top-of-mind question - asking him if he was nervous using a new software product, to which he replied, "I found something flexible and it fit into the paradigm I was working with. Gaute and his company share my belief  that the meaningful integration and proper utilization of numerous smaller and distributed data sets is a problem currently not adequately addressed by existing products."

Gaute added some additional perspective, "the real challenge for most organizations is not only managing the various distributed data sources, but also enabling the productive presentation of those data sets to the relevant stakeholders across the enterprise.  Finding ways to do cost-effective data aggregation and presentation in an ever-changing environment is a very challenging thing - it's what we set out to do with the Flux platform".

When asked about the future of data at Alchemy, Jason declined to mention any specifics - but hinted at interesting things to come - "we've created a neutral and adaptable layer with Flux.  It's designed for flexibility and scalability - we can take it as far as we want to".

More Stories By Hollis Tibbetts

Hollis Tibbetts, or @SoftwareHollis as his 50,000+ followers know him on Twitter, is listed on various “top 100 expert lists” for a variety of topics – ranging from Cloud to Technology Marketing, Hollis is by day Evangelist & Software Technology Director at Dell Software. By night and weekends he is a commentator, speaker and all-round communicator about Software, Data and Cloud in their myriad aspects. You can also reach Hollis on LinkedIn – linkedin.com/in/SoftwareHollis. His latest online venture is OnlineBackupNews - a free reference site to help organizations protect their data, applications and systems from threats. Every year IT Downtime Costs $26.5 Billion In Lost Revenue. Even with such high costs, 56% of enterprises in North America and 30% in Europe don’t have a good disaster recovery plan. Online Backup News aims to make sure you all have the news and tips needed to keep your IT Costs down and your information safe by providing best practices, technology insights, strategies, real-world examples and various tips and techniques from a variety of industry experts.

Hollis is a regularly featured blogger at ebizQ, a venue focused on enterprise technologies, with over 100,000 subscribers. He is also an author on Social Media Today "The World's Best Thinkers on Social Media", and maintains a blog focused on protecting data: Online Backup News.
He tweets actively as @SoftwareHollis

Additional information is available at HollisTibbetts.com

All opinions expressed in the author's articles are his own personal opinions vs. those of his employer.

@ThingsExpo Stories
Web Real-Time Communication APIs have quickly revolutionized what browsers are capable of. In addition to video and audio streams, we can now bi-directionally send arbitrary data over WebRTC's PeerConnection Data Channels. With the advent of Progressive Web Apps and new hardware APIs such as WebBluetooh and WebUSB, we can finally enable users to stitch together the Internet of Things directly from their browsers while communicating privately and securely in a decentralized way.
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this SYS-CON.tv interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
"A lot of times people will come to us and have a very diverse set of requirements or very customized need and we'll help them to implement it in a fashion that you can't just buy off of the shelf," explained Nick Rose, CTO of Enzu, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
Web Real-Time Communication APIs have quickly revolutionized what browsers are capable of. In addition to video and audio streams, we can now bi-directionally send arbitrary data over WebRTC's PeerConnection Data Channels. With the advent of Progressive Web Apps and new hardware APIs such as WebBluetooh and WebUSB, we can finally enable users to stitch together the Internet of Things directly from their browsers while communicating privately and securely in a decentralized way.
Who are you? How do you introduce yourself? Do you use a name, or do you greet a friend by the last four digits of his social security number? Assuming you don’t, why are we content to associate our identity with 10 random digits assigned by our phone company? Identity is an issue that affects everyone, but as individuals we don’t spend a lot of time thinking about it. In his session at @ThingsExpo, Ben Klang, Founder & President of Mojo Lingo, discussed the impact of technology on identity. Sho...
"Operations is sort of the maturation of cloud utilization and the move to the cloud," explained Steve Anderson, Product Manager for BMC’s Cloud Lifecycle Management, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
"I think that everyone recognizes that for IoT to really realize its full potential and value that it is about creating ecosystems and marketplaces and that no single vendor is able to support what is required," explained Esmeralda Swartz, VP, Marketing Enterprise and Cloud at Ericsson, in this SYS-CON.tv interview at @ThingsExpo, held June 7-9, 2016, at the Javits Center in New York City, NY.
The buzz continues for cloud, data analytics and the Internet of Things (IoT) and their collective impact across all industries. But a new conversation is emerging - how do companies use industry disruption and technology enablers to lead in markets undergoing change, uncertainty and ambiguity? Organizations of all sizes need to evolve and transform, often under massive pressure, as industry lines blur and merge and traditional business models are assaulted and turned upside down. In this new da...
Bert Loomis was a visionary. This general session will highlight how Bert Loomis and people like him inspire us to build great things with small inventions. In their general session at 19th Cloud Expo, Harold Hannon, Architect at IBM Bluemix, and Michael O'Neill, Strategic Business Development at Nvidia, discussed the accelerating pace of AI development and how IBM Cloud and NVIDIA are partnering to bring AI capabilities to "every day," on-demand. They also reviewed two "free infrastructure" pr...
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo 2016 in New York. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place June 6-8, 2017, at the Javits Center in New York City, New York, is co-located with 20th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry p...
It is one thing to build single industrial IoT applications, but what will it take to build the Smart Cities and truly society changing applications of the future? The technology won’t be the problem, it will be the number of parties that need to work together and be aligned in their motivation to succeed. In his Day 2 Keynote at @ThingsExpo, Henrik Kenani Dahlgren, Portfolio Marketing Manager at Ericsson, discussed how to plan to cooperate, partner, and form lasting all-star teams to change the...
SYS-CON Events announced today that delaPlex will exhibit at SYS-CON's @CloudExpo, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. delaPlex pioneered Software Development as a Service (SDaaS), which provides scalable resources to build, test, and deploy software. It’s a fast and more reliable way to develop a new product or expand your in-house team.
SYS-CON Events announced today that IoT Now has been named “Media Sponsor” of SYS-CON's 20th International Cloud Expo, which will take place on June 6–8, 2017, at the Javits Center in New York City, NY. IoT Now explores the evolving opportunities and challenges facing CSPs, and it passes on some lessons learned from those who have taken the first steps in next-gen IoT services.
SYS-CON Events announced today that WineSOFT will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Based in Seoul and Irvine, WineSOFT is an innovative software house focusing on internet infrastructure solutions. The venture started as a bootstrap start-up in 2010 by focusing on making the internet faster and more powerful. WineSOFT’s knowledge is based on the expertise of TCP/IP, VPN, SSL, peer-to-peer, mob...
The explosion of new web/cloud/IoT-based applications and the data they generate are transforming our world right before our eyes. In this rush to adopt these new technologies, organizations are often ignoring fundamental questions concerning who owns the data and failing to ask for permission to conduct invasive surveillance of their customers. Organizations that are not transparent about how their systems gather data telemetry without offering shared data ownership risk product rejection, regu...
The Internet of Things can drive efficiency for airlines and airports. In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect with GE, and Sudip Majumder, senior director of development at Oracle, discussed the technical details of the connected airline baggage and related social media solutions. These IoT applications will enhance travelers' journey experience and drive efficiency for the airlines and the airports.
With billions of sensors deployed worldwide, the amount of machine-generated data will soon exceed what our networks can handle. But consumers and businesses will expect seamless experiences and real-time responsiveness. What does this mean for IoT devices and the infrastructure that supports them? More of the data will need to be handled at - or closer to - the devices themselves.
SYS-CON Events announced today that Dataloop.IO, an innovator in cloud IT-monitoring whose products help organizations save time and money, has been named “Bronze Sponsor” of SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Dataloop.IO is an emerging software company on the cutting edge of major IT-infrastructure trends including cloud computing and microservices. The company, founded in the UK but now based in San Fran...
Big Data, cloud, analytics, contextual information, wearable tech, sensors, mobility, and WebRTC: together, these advances have created a perfect storm of technologies that are disrupting and transforming classic communications models and ecosystems. In his session at @ThingsExpo, Erik Perotti, Senior Manager of New Ventures on Plantronics’ Innovation team, provided an overview of this technological shift, including associated business and consumer communications impacts, and opportunities it m...
In his keynote at @ThingsExpo, Chris Matthieu, Director of IoT Engineering at Citrix and co-founder and CTO of Octoblu, focused on building an IoT platform and company. He provided a behind-the-scenes look at Octoblu’s platform, business, and pivots along the way (including the Citrix acquisition of Octoblu).