02
Sat, Nov
2 New Articles

Commercial Software Can Help You Get Started with AI Now

Typography
  • Smaller Small Medium Big Bigger
  • Default Helvetica Segoe Georgia Times

If AI software sounds like something that could help your enterprise, a good way to start is to perform analysis on a single business function and expand from there. Solutions exist that can handle a small analysis project and be useful if commitment to AI expands later.

In some ways, artificial intelligence (AI) still seems like a science-fiction concept, but it's not. AI packages exist today that can help enterprises do a better job of spotting problems, analyzing business conditions to figure out better strategies going forward, and automating predictive processes to provide better outcomes than simply relying on, for example, some CEO's "gut instinct." It may be helpful to those thinking about dipping a toe into this new technology to take a brief look at some of the major AI software packages already available.

The foundation for nearly all AI packages is machine learning (ML), a procedure by which a computer system is fed datasets and is trained to process them. The goal of ML is to teach computer systems how to learn and improve on their own, without a human having to explicitly program all the learning steps. Instead, a computer learns by experience.

The Major Types of Machine Learning

There are three major types of ML: supervised, unsupervised, and semi-supervised. Most ML systems can use all three, as specified by a human user.

Supervised ML starts with a known dataset that the system is trained to analyze via a model. The system uses an inferred function to predict output values of the particular dataset it's analyzing at the time. Those projected values are compared to the actual expected values, after which the system modifies the model it will use to analyze the next dataset. After numerous iterations, the system learns to modify its learning model to become more and more accurate until the model is trusted enough to analyze fresh data.

In unsupervised ML, the system is given a large amount of "unlabeled" data with no particular outcome already known. Labeled data includes some kind of embedded descriptive information (for example, data specified as "medical records") while unlabeled data might, for instance, simply be a series of x-ray images. By iterative methods similar to supervised learning, the system learns to draw inferences that let it predict hidden structures in uncategorized data.

Semi-supervised is a blend of the two other methods. Most often, inputs are a limited amount of labeled data and a large amount of unlabeled data. As such, this method tries to mimic human learning, in which children are given some information by parents and teachers and then go on to draw inferences from the large amount of unstructured data they encounter as a result of normal experiences.

Commercial AI Software Products

What follows is a brief overview of some of the major commercially available packages for data analysis and other functions that can be used standalone or incorporated into more-specific AI apps. A few term definitions will ease your way through the descriptions.

"Structured" data contains numbers and dates while "unstructured" data refers to textual sources. "Data transformation" refers to moving data from one format to a different supported format. "Predictive analytics" is the process of using historical data to forecast possible future outcomes. "Streaming data" is collecting data coming in concurrently from a large variety of sources. "Data engineering" is the practical process of collecting and validating datasets. "Data cleansing" is the process of preparing data for analysis by removing data from a dataset that may be incorrect, incomplete, corrupted, or otherwise possibly invalid.

A "data lake" refers to a single, large repository of data at any scale, and without a requirement of having to move all the data to a structured format before analysis can take place. In effect, data lakes can draw information from social media, Internet-connected devices, log files, and click streams in addition to structured databases. This makes data lakes potentially more flexible than data warehouses, which primarily pull information from business applications and transaction systems.

None of the descriptions should be interpreted as a complete view of each product's capabilities, nor any sort of relative ranking of each product's value.

Alteryx lets users blend together multiple sources to form a data-analysis platform. The algorithm is PC-based and runs on Windows 7 or higher to validate "the health, quality and statistical distribution" of data and is supported by an online community site. Experienced users can employ the data to build advanced analytics data models by using more than 50 prebuilt tools that function without coding, or by using self-written scripts in R and Python. Analysis can be augmented by geospatial intelligence to build and envision location-based calculations. There's an extensive tool set to build reports in a wide array of document formats (e.g., PDF, HTML, DOCX, XLSX) and deliver data directly to visualization formats like Microsoft Power BI, Tableau, or Qlik. Alteryx also enables natural-language inputs in English, French, German, Japanese, Portuguese, and Spanish.

Google Cloud AI Platform is a means of carrying out ML. Once this learning program is implemented, it's referred to as a "trained model." Trained models help users identify the validity of the data used to build them. The Cloud AI Platform helps users train, evaluate, tune, deploy, manage, monitor, and extract predictions from resulting data models. Via a "Cloud Console" feature, the Cloud AI Platform provides a UI for controlling ML functions and operations, making predictions from data, and issuing other commands to the system. The product has APIs for interfacing with programs written in Python.

H2O is an open-source ML platform that also provides other AI-related services, such as search and visualization tools for ad hoc data analysis, automatic modeling, report and dashboard generation, and the ability for users to construct their own AI apps that use augmented datasets. H2O is geared primarily for the financial, insurance, healthcare, marketing, manufacturing, and telecom industries. It includes modules that are currently being used for COVID-19 research.

IBM Decision Optimization Center is a central location to learn about IBM's family of products that help users build mathematical optimization models of business situations in order to make better decisions more quickly. Optimization models display the most important characteristics of problems users might be trying to solve by looking at the objective function of a business decision, variables that might affect that decision, and business constraints. The IBM ILOG CPLEX Optimization Studio provides a built-in Optimization Programming Language (OPL) (or alternatively works with other programs built in C, C++, C# APIs, Java, or Python) to build and deploy models that help identify the best actions in given business situations. IBM Decision Optimization for Watson Studio includes features such as a modeling assistant and visual dashboards to facilitate model building and what-if analysis tools to sort out results of multiple scenarios. IBM Watson Studio Premium for IBM Cloud Pak for Data is a combination of IBM products that help enterprises predict business outcomes, partly by submitting constructed models to the Apache Hadoop Engine for further analysis. The IBM Decision Optimization in IBM Watson Machine Learning helps users build and deploy optimization models in cloud environments.

IBM SPSS Modeler runs on PCs in either a client/server configuration or a standalone desktop. It mines data and analyzes text sources for information, as well as provides predictive analytics capabilities. The front end runs on PCs and the back end runs on servers using UNIX variants, Linux, or Windows. The modeler analyzes structured and unstructured data from sources such as files, survey data, operational databases, the IBM Cognos 8 Business Intelligence framework, and flat files such as IBM SPSS Statistics, SAS, and MS Excel files. Users can access predictive, data-transformative, testing, and reporting characteristics from the same interface. Professional edition includes tools for analyzing existing data. Premium edition adds a text-mining feature for retrieving concepts, relationships, and sentiments from text data, as well as converting unstructured data to a structured format.

IBM Watson Studio is available via the cloud and automates many data-preparation tasks, enables preparation of predictive models with a mix of visual tools, draws from most common data sources (e.g., spreadsheets, flat files, relational databases), and enables data display by facilitating data export into presentations that can use dozens of prebuilt chart types. Watson Studio also integrates with the IBM SPSS Modeler to access that product's features.

Infosys Nia is an open-source platform that absorbs information about business processes and represents them in a summarizing structure. Available via public, private, or hybrid clouds, Nia works with browsers such as IE V7-11, Firefox, Chrome, Safari V9 or better, and Microsoft Edge. Nia has a discovery subsystem that can summarize what information is available from enterprise databases, as well as a learning subsystem that can take inputs about new apps from potential users via natural language documents (such as memos), as well as machine-learning that can assimilate information from other data sources. Other subsystems provide runtime capabilities, self-healing responses to problems, and an automation platform for predictive automation (building models of automation processes), cognitive automation (building processes that mimic human behavior), and robotic process automation (programming robotic machinery functions).

MathWorks MATLAB "combines a desktop environment tuned for iterative analysis and design processes with a programming language that expresses matrix and array mathematics directly," according to Mathorks. The product lets developers, for example, run different sample algorithms against available data to see what results they produce. The Live Editor function generates scripts that can blend code, output, and formatted text. A Data Analysis function can compile, cleanse, and analyze multiple datasets; includes prebuilt widgets for signal processing, machine learning, and statistical analysis; and generates sharable analysis reports. There's also a drag-and-drop App Builder for generating GUIs and specifying app behavior, a Plot Gallery with dozens of standard and customizable means of graphing results, and APIs that let MATLAB be called by other apps written in C, C++, Fortran, Java, Python, and apps using some COM components, such as Visual C# .NET and Visual Basic .NET.

Qubole works with data lakes. It's cloud-based and provides end-to-end services that help users with ML, ad hoc data analytics, and streaming data. Data-management tools help users manage metadata and infrastructure, reveal statistics and data dependencies, and automate control of clustered resources, all across cloud environments. The platform also offers automated continuous data engineering.

RapidMiner, from the company of the same name, is a platform that aims at helping both data scientists and less-technical corporate end users. It's an automated data-science platform that provides functions such as data analysis, data cleansing and transformation, model deployment and optimization, prebuilt use templates, and tutorials for the inexperienced user. RapidMiner integrates with custom code written in Python or R, supports any third-party ML libraries, and includes more than 1500 scripts that provide individual data-science and data-preparation functions.

Symphony AyasdiAI is an application framework designed primarily for financial applications and uses an engine called Topological Data Analysis (TDA), which is based on the mathematical concept of topology. Topology is the idea that data has an underlying shape that gives the data meaning. TDA adapts this method to analyze highly complex data. Ayasdi builds compressed diagrams of data points to display important patterns as a way of showing users geometric relationships that may exist between data points. It combines with other machine-learning algorithms to find patterns in data to generate insights for users. Ayasdi is currently used in situations such as combatting money laundering and bank fraud, promoting healthcare institution cost reductions and reducing health insurance claim refusals, and improving general performance of other software applications.

TensorFlow is an open-source library of ML and neural networking algorithms that functions as an end-to-end platform for developing and training ML models. It provides tools for managing ML environments and APIs for defining and training ML models, as well as using the data to make predictions. TensorFlow uses Python to help users build front-ends for analytical apps and builds the apps themselves in C++. Its apps can run on local PCs, iOS and Android devices, a public cloud, and other CPUs. By handling algorithm deployment and implementation, as well as connecting outputs to the next function looking for an input, TensorFlow frees developers to concentrate on an application's overall design instead.

Wipro HOLMES is a proprietary AI and ML platform that helps accelerate existing business processes by automating them. Its name is a reference to IBM's Watson, but it's an independent platform that can run above existing business applications to coordinate business functions such as finance, human resources, legal, marketing, operations, procurement, and regulatory compliance. It contains separate modules for use by COOs, CFOs, CLOs, and procurement officers.

There's No One-Size-Fits-All AI Product

Like any kind of software, none of these alternatives is likely to be perfect for your situation. All of them offer something but many are specialized for certain kinds of businesses. Adding to the potential confusion is that your enterprise may not know today what kind of AI would best suit its particular profile and ways of doing business. As is true in ML, an enterprise's learning curve about AI will only get to a good outcome via experience. That experience will start with a first step in some direction, and any of the products above will help you begin that learning process.

John Ghrist

John Ghrist has been a journalist, programmer, and systems manager in the computer industry since 1982. He has covered the market for IBM i servers and their predecessor platforms for more than a quarter century and has attended more than 25 COMMON conferences. A former editor-in-chief with Defense Computing and a senior editor with SystemiNEWS, John has written and edited hundreds of articles and blogs for more than a dozen print and electronic publications. You can reach him at This email address is being protected from spambots. You need JavaScript enabled to view it..

BLOG COMMENTS POWERED BY DISQUS

LATEST COMMENTS

Support MC Press Online

$

Book Reviews

Resource Center

  • SB Profound WC 5536 Have you been wondering about Node.js? Our free Node.js Webinar Series takes you from total beginner to creating a fully-functional IBM i Node.js business application. You can find Part 1 here. In Part 2 of our free Node.js Webinar Series, Brian May teaches you the different tooling options available for writing code, debugging, and using Git for version control. Brian will briefly discuss the different tools available, and demonstrate his preferred setup for Node development on IBM i or any platform. Attend this webinar to learn:

  • SB Profound WP 5539More than ever, there is a demand for IT to deliver innovation. Your IBM i has been an essential part of your business operations for years. However, your organization may struggle to maintain the current system and implement new projects. The thousands of customers we've worked with and surveyed state that expectations regarding the digital footprint and vision of the company are not aligned with the current IT environment.

  • SB HelpSystems ROBOT Generic IBM announced the E1080 servers using the latest Power10 processor in September 2021. The most powerful processor from IBM to date, Power10 is designed to handle the demands of doing business in today’s high-tech atmosphere, including running cloud applications, supporting big data, and managing AI workloads. But what does Power10 mean for your data center? In this recorded webinar, IBMers Dan Sundt and Dylan Boday join IBM Power Champion Tom Huntington for a discussion on why Power10 technology is the right strategic investment if you run IBM i, AIX, or Linux. In this action-packed hour, Tom will share trends from the IBM i and AIX user communities while Dan and Dylan dive into the tech specs for key hardware, including:

  • Magic MarkTRY the one package that solves all your document design and printing challenges on all your platforms. Produce bar code labels, electronic forms, ad hoc reports, and RFID tags – without programming! MarkMagic is the only document design and print solution that combines report writing, WYSIWYG label and forms design, and conditional printing in one integrated product. Make sure your data survives when catastrophe hits. Request your trial now!  Request Now.

  • SB HelpSystems ROBOT GenericForms of ransomware has been around for over 30 years, and with more and more organizations suffering attacks each year, it continues to endure. What has made ransomware such a durable threat and what is the best way to combat it? In order to prevent ransomware, organizations must first understand how it works.

  • SB HelpSystems ROBOT GenericIT security is a top priority for businesses around the world, but most IBM i pros don’t know where to begin—and most cybersecurity experts don’t know IBM i. In this session, Robin Tatam explores the business impact of lax IBM i security, the top vulnerabilities putting IBM i at risk, and the steps you can take to protect your organization. If you’re looking to avoid unexpected downtime or corrupted data, you don’t want to miss this session.

  • SB HelpSystems ROBOT GenericCan you trust all of your users all of the time? A typical end user receives 16 malicious emails each month, but only 17 percent of these phishing campaigns are reported to IT. Once an attack is underway, most organizations won’t discover the breach until six months later. A staggering amount of damage can occur in that time. Despite these risks, 93 percent of organizations are leaving their IBM i systems vulnerable to cybercrime. In this on-demand webinar, IBM i security experts Robin Tatam and Sandi Moore will reveal:

  • FORTRA Disaster protection is vital to every business. Yet, it often consists of patched together procedures that are prone to error. From automatic backups to data encryption to media management, Robot automates the routine (yet often complex) tasks of iSeries backup and recovery, saving you time and money and making the process safer and more reliable. Automate your backups with the Robot Backup and Recovery Solution. Key features include:

  • FORTRAManaging messages on your IBM i can be more than a full-time job if you have to do it manually. Messages need a response and resources must be monitored—often over multiple systems and across platforms. How can you be sure you won’t miss important system events? Automate your message center with the Robot Message Management Solution. Key features include:

  • FORTRAThe thought of printing, distributing, and storing iSeries reports manually may reduce you to tears. Paper and labor costs associated with report generation can spiral out of control. Mountains of paper threaten to swamp your files. Robot automates report bursting, distribution, bundling, and archiving, and offers secure, selective online report viewing. Manage your reports with the Robot Report Management Solution. Key features include:

  • FORTRAFor over 30 years, Robot has been a leader in systems management for IBM i. With batch job creation and scheduling at its core, the Robot Job Scheduling Solution reduces the opportunity for human error and helps you maintain service levels, automating even the biggest, most complex runbooks. Manage your job schedule with the Robot Job Scheduling Solution. Key features include:

  • LANSA Business users want new applications now. Market and regulatory pressures require faster application updates and delivery into production. Your IBM i developers may be approaching retirement, and you see no sure way to fill their positions with experienced developers. In addition, you may be caught between maintaining your existing applications and the uncertainty of moving to something new.

  • LANSAWhen it comes to creating your business applications, there are hundreds of coding platforms and programming languages to choose from. These options range from very complex traditional programming languages to Low-Code platforms where sometimes no traditional coding experience is needed. Download our whitepaper, The Power of Writing Code in a Low-Code Solution, and:

  • LANSASupply Chain is becoming increasingly complex and unpredictable. From raw materials for manufacturing to food supply chains, the journey from source to production to delivery to consumers is marred with inefficiencies, manual processes, shortages, recalls, counterfeits, and scandals. In this webinar, we discuss how:

  • The MC Resource Centers bring you the widest selection of white papers, trial software, and on-demand webcasts for you to choose from. >> Review the list of White Papers, Trial Software or On-Demand Webcast at the MC Press Resource Center. >> Add the items to yru Cart and complet he checkout process and submit

  • Profound Logic Have you been wondering about Node.js? Our free Node.js Webinar Series takes you from total beginner to creating a fully-functional IBM i Node.js business application.

  • SB Profound WC 5536Join us for this hour-long webcast that will explore:

  • Fortra IT managers hoping to find new IBM i talent are discovering that the pool of experienced RPG programmers and operators or administrators with intimate knowledge of the operating system and the applications that run on it is small. This begs the question: How will you manage the platform that supports such a big part of your business? This guide offers strategies and software suggestions to help you plan IT staffing and resources and smooth the transition after your AS/400 talent retires. Read on to learn: