Obecne kazda aplikace pro nakladani s daty (ETL - extract transform load, business inteligenci, master data management, …) se sklada z mnoha komponenet, pricemz jedna z komponent je:
aplikacni server a aplikace v nem, ktera zajistuje webovy management
a dalsi komponenty, ktere provadi primo manipulaci s daty.
Big Data and Analytics University na learning2.atlanta.ibm.com
IBM BigData & Analytics Hub
<dl> <dd>Infosphere Datastage (Infosphere Information Server)</dd> <dd>Oracle Warehouse Builder</dd> <dd>Oracle Data Integrator</dd> <dd>Informatica PowerCenter Express - https://community.informatica.com/solutions/pcexpress</dd> <dd>Informatica PowerCenter - http://etl-tools.info/informatica/components.html http://etl-tools.info/informatica/tutorial.html</dd> <dd>TalenD ETL - OpenSource ETL</dd> </dl>
<dl> <dd>IBM Cognos BI - Business Intelligence - Reporty, analyzy dashboardy a skorovani z dat</dd> <dd>IBM Cognos TM1</dd> <dd>IBM Cognos Insight - Analyzy a vizualizace a sdileni dat</dd> <dd>IBM Cognos Express</dd> <dd>IBM SPSS</dd> <dd>IBM SPPS Modeler - prediktivni analyza</dd> <dd>IBM SPPS Data Collection - </dd> <dd>IBM SPSS Statistics - Statistiky na porozumeni dat a trendu</dd> <dd>IBM SPSS Collaboration and Deployment Services - automatizace analytickeho procesu a bezpecne sdileni</dd> <dd>IBM Predictive Maintenance and Quality</dd> <dd>IBM Algorithmics software - financial analytics tool for decisions based on risk management</dd> <dd>IBM OpenPages - tool for risk and compliance management</dd> <dd>DB2 Query Management Facility</dd> <dd>Microstrategy</dd> <dd>Pentaho</dd> <dd>Oracle BI</dd> <dd>Oracle Business Intelligence Discoverer (Oracle Discoverer)</dd> <dd>Oracle Hyperion … Business Intelligence - rozhodavaci software a mereni vykonu podniku</dd> </dl>
<dl> <dd>InfoSphere MDM CS - Master Data Management Colaboration Server (Colaboration Edition), Drive WebSphere Product Center nebo PIM</dd> <dd>Physical MDM - drive Transactional MDM, WebSphere Customer Center - <!–a href=“wcc.html”–>WCC<!–/a–>, MDM Server</dd> <dd>Virtual MDM - drive MDM Hub, nebo MDS - Master Data Service</dd> <dd>informaticamdm.htmlInformatica MDM</dd> </dl>
<dl> <dd>IBM Watson Explorer - IBM InfoSphere Data Explorer … Aplikace na prochazeni dat - objevuje, spojuje, nabizi a umoznuje prohledavani dat</dd> <dd>ELK (Elastic stack) - Log management platforma - prijimani logu a eventu, hledani v logach, analyza logu (bezpecnost, perfomance, ..). Vice na qbox</dd> <dd>Elasticsearch … vyhledavaci engine postaveny na Apache Lucene</dd> <dd>Kibana … vizualizacni plugin do Elasticsearch</dd> <dd>Logstash … Nastroj na managovani udalosti a logu</dd> <dd>Apache Lucene … java knihovna na ziskavani informaci</dd> <dd>Apache Soir … Vyhledavaci engine postaveni na Apache Lucene</dd> <dd>Apache Kafka … stream processing platforma (streamy jako logy, audio, video, …) - pekny tutorial</dd> <dd>Apache Spark … cluster framework pouzitelny napr. s Kafka. Komponenty - Spark SQL, Spark streaming, GraphX, MLib Machine Learning</dd> </dl>
<dd>IBM InfoSphere Optim Grow … Aplikace na spravu politiky rustu dat - presouvani dat na levnejsi media, zalohovani, mazani atd.</dd>
<dd>Infosphere Guardium Data Encryption - GDE … Nastroj na enkrypci a zpristupneni dat na zaklade politik</dd>
<dl> <dd>Netezza</dd> <dd>Teradata Warehouse - datovy sklad pro strukturovana data v zadu peta bajtu</dd> <dd>Hadoop … clusterovy filesystem - spojovani datovych ulozist na tisicovkach nodu - datove uloziste na nestrukturovana data</dd> <dd>FPA … Portable Format for Analytics … metoda k vyuziti hadoop a jinych nastroju na analyzu dat</dd> <dd>Aster DB - discovery platvorm - reseni na polostrukturovana data - ukladani, transformace nestrukturovanycn na polostrukturovana</dd> <dd>Hortonworks - Nahravani nestrukturovany dat pomoci Aster do datoveho skladu</dd> </dl>