====Architektura==== Obecne kazda aplikace pro nakladani s daty (ETL - extract transform load, business inteligenci, master data management, ...) se sklada z mnoha komponenet, pricemz jedna z komponent je: [[../web/|aplikacni server]] a aplikace v nem, ktera zajistuje webovy management a dalsi komponenty, ktere provadi primo manipulaci s daty. ====Odkazy==== [[http://learning2.atlanta.ibm.com/qpg/university.nsf/analytics/BDA-Lobby|Big Data and Analytics University]] na learning2.atlanta.ibm.com\\ [[http://www.ibmbigdatahub.com/|IBM BigData & Analytics Hub]]\\ ====ETL nastroje (Extract Transform Load====
[[datastage.html|Infosphere Datastage]] (Infosphere Information Server)
[[owp.html|Oracle Warehouse Builder]]
[[odi.html|Oracle Data Integrator]]
Informatica PowerCenter Express - https://community.informatica.com/solutions/pcexpress
[[informatica.html|Informatica PowerCenter]] - http://etl-tools.info/informatica/components.html http://etl-tools.info/informatica/tutorial.html
[[https://www.talend.com/products/talend-open-studio/|TalenD ETL]] - OpenSource ETL
====Kontinualni prisun dat - zpracovani==== [[steams.html|IBM Steams]]\\ ====Business Inteligence, WCC, MDM Server
Virtual MDM - drive MDM Hub, nebo MDS - Master Data Service
[[informaticamdm.html|]]Informatica MDM
====Data Explorers====
[[dataexplorer.html|IBM Watson Explorer]] - IBM InfoSphere Data Explorer ... Aplikace na prochazeni dat - objevuje, spojuje, nabizi a umoznuje prohledavani dat
[[https://www.quora.com/What-is-the-ELK-stack|ELK (Elastic stack)]] - Log management platforma - prijimani logu a eventu, hledani v logach, analyza logu (bezpecnost, perfomance, ..). Vice na [[https://qbox.io/blog/welcome-to-the-elk-stack-elasticsearch-logstash-kibana|qbox]]
[[https://en.wikipedia.org/wiki/Elasticsearch|Elasticsearch]] ... vyhledavaci engine postaveny na [[https://en.wikipedia.org/wiki/Apache_Lucene|Apache Lucene]]
[[https://en.wikipedia.org/wiki/Kibana|Kibana]] ... vizualizacni plugin do Elasticsearch
[[https://wikitech.wikimedia.org/wiki/Logstash|Logstash]] ... Nastroj na managovani udalosti a logu
[[https://en.wikipedia.org/wiki/Apache_Lucene|Apache Lucene]] ... java knihovna na ziskavani informaci
[[https://en.wikipedia.org/wiki/Apache_Solr|Apache Soir]] ... Vyhledavaci engine postaveni na [[https://en.wikipedia.org/wiki/Apache_Lucene|Apache Lucene]]
[[https://en.wikipedia.org/wiki/Apache_Kafka|Apache Kafka]] ... stream processing platforma (streamy jako logy, audio, video, ...) - [[https://www.tutorialspoint.com/apache_kafka/apache_kafka_integration_spark.htm|pekny tutorial]]
[[https://en.wikipedia.org/wiki/Apache_Spark|Apache Spark]] ... cluster framework pouzitelny napr. s Kafka. Komponenty - Spark SQL, Spark streaming, GraphX, MLib Machine Learning
====Nastroje na rizeni rustu dat====
IBM InfoSphere Optim Grow ... Aplikace na spravu politiky rustu dat - presouvani dat na levnejsi media, zalohovani, mazani atd.
====Nastroje na kryptovani dat====
[[gde.html|Infosphere Guardium Data Encryption - GDE]] ... Nastroj na enkrypci a zpristupneni dat na zaklade politik
====Big Data====
Netezza
[[../db/td|Teradata Warehouse]] - datovy sklad pro strukturovana data v zadu peta bajtu
Hadoop ... clusterovy filesystem - spojovani datovych ulozist na tisicovkach nodu - datove uloziste na nestrukturovana data
[[http://dmg.org/pfa/docs/motivation/|FPA]] ... Portable Format for Analytics ... metoda k vyuziti hadoop a jinych nastroju na analyzu dat
Aster DB - discovery platvorm - reseni na polostrukturovana data - ukladani, transformace nestrukturovanycn na polostrukturovana
Hortonworks - Nahravani nestrukturovany dat pomoci Aster do datoveho skladu