Introduction:

IBM Watson Explorer Analytical Components(AC) which is part of the IBM Watson Explorer Advanced Edition gives organizations the ability to access and analyze 

unstructured information that can be found both inside and outside the organization.Using sophisticated natural language algorithms, this solution can deliver

new business insight from virtually any type of content in any format including both text and data. It extracts key informaiton, then sorts filters and categorizes this

information to present the relevant content in context to the business user.

Cotent Analytics is tool for reporting statistics and for obtaining actionable insights. Actionable insights is a key concept that refers to insight into data that leads

to action. Cotent Analytics brings the power of business intelligence to all of your enterprise information,not just your structured information. The result helps you

achieve the most value from all your data, regardless of its structure.

Solution:

The biggest benefit that a content analytics solution provides is the ability to use a computer to analyze massive amounts of unstructured count in such a way

as to discover the "why" to business scenarios. Traditional business intelligence ,analyzing structured data such as volume of calls, average length of call,

amount of sales increase or decrease,is very good at describing "what" is happening. What content analytics can do is analyze the additional unstructured or

textual data associated with those events and help you find out "why".

Content Analytics delivers new business understanding and visibility from the content and context of textual information. For example, you can identify patterns,view trends and deviations over time,and reveal unusual correlations or anomalies.

Key Concepts:

Unstructured and structured content:is information that is generally recorded in a natural language as free text. The text contains all of the complexities and ambiguities of
the language that is being used. It is easily understood by a human reader but difficult to process by a computer program.

Text analytics: is a general term that refers to the automated techniques of converting textual data into structured data. A program that reads text and extracts person names is considered a text analytic.

Data mining: is the process of identifying patterns in your data that might be used to answer a business problem, question, or concern. Data mining is a natural part of discovery.
Keywords: As the term implies, keywords are usually words and phrases that are extracted from textual content. However, they can also be obtained from structured fields such as date or numeric fields.
14 IBM Watson Content Analytics: Discovering Actionable Insight from Your Content of discovery

Collections:A single content analytics or enterprise search collection represents the entire group of documents that are available to an application for search and analysis.

Facets:represent the different aspects or dimensions of your document corpus.They are a crucial mechanism for navigating and analyzing your content with the content analytics miner.

Frequency:Frequency counts in Content Analytics represent the total number of documents that contribute to a particular keyword.

Correlation:is a measure of how strongly a facet value is related (correlated) to the current query or selection criteria. In a facet pair, it indicates how two facets are correlated to each other. It is used to better gauge the relevance of a particular keyword as it compares to other data in your document corpus.

Deviation:Deviation measures the average change in a facet over time. It is a weighted,moving average.

最新文章

  1. C# 编程实现非自相交多边形质心
  2. Spring(4)
  3. h4和h5的区别
  4. 百度推出的echarts,制表折线图柱状图饼图等的超级工具(转)
  5. 事件委托&jQuery on
  6. [转]ASP.NET MVC Select List Example
  7. 关于文件的复制(用InputStream和OutputStream)
  8. 娓娓道来c指针 (0)c语言的梦魇:c指针
  9. IOS学习之路十九(JSON与Arrays 或者 Dictionaries相互转换)
  10. 判断mine类型
  11. Redis Windows 64位下安装Redis详细教程
  12. vue二、脚手架搭建
  13. Spark入门——什么是Hadoop,为什么是Spark?
  14. C# 分部方法partial
  15. 65. sqlserver执行存储过程实例
  16. bashttpd使用手册
  17. SQL2008如何清空压缩数据库日志
  18. pin
  19. AngularJS(一):概述
  20. vs和github同步开发步骤

热门文章

  1. js中toggle()及toggleClass()的使用详解
  2. python serialread
  3. java zip4j 内存文件和磁盘文件 压缩和加密
  4. JS简单实现自定义右键菜单
  5. Assert与内存泄漏
  6. nodeJS之fs文件系统
  7. Redis数据类型之Hash(二)
  8. LCA——求解最近公共祖先
  9. cocoapod升级
  10. 从netty源码里拿到的关于http错误码,自己学习下