分享

Adventures with Open Source BI

 集微笔记 2013-09-12
By Slawomir Chodnicki, on July 5th, 2012
rolling_plugins

Dear Kettle enthusiasts,

this is just a short blog post to point out that there are some new development resources available: Pentaho has added several sample plugins for PDI to the public CI-Server. If you ever wondered how to create your own plugins for PDI, check out the samples! . . . → Read More: Exploring the sample plugins for PDI

By Slawomir Chodnicki, on June 7th, 2012
concept_fifo

Most database systems provide a command-line based bulk loading utility that operates on flat files. These db-specific utilities are usually the fastest way to get data into a db system. This post shows a generic technique for using bulk loading utilities as part of a PDI ETL process. The download package has a working sample showing the . . . → Read More: Implementing Universal Bulk-Loading in PDI

By Slawomir Chodnicki, on December 6th, 2011
edi

In this post I would like to announce the release of the Edi2Xml PDI plugin. It will convert a string field with edifact data to an XML text, with one tag per segment, containing the fields and values as child tags. This makes the edifact data more accessible, as the XML can be queried using the . . . → Read More: Releasing Edi2Xml plugin for PDI

By Slawomir Chodnicki, on October 14th, 2011
cookbook_2

“PDI 4 Cookbook”, published June 2011, is a wonderful collection of tips, tricks, techniques and best practices regarding Kettle 4.x. It contains over 70 individual recipes that show how to solve common (and sometimes extraordinary) data processing tasks. This book is not about architecture diagrams, technology buzzwords and the philosophy behind Enterprise APIs. It is a . . . → Read More: Book Review: PDI 4 Cookbook by Mar?a Carina and Adri?n Sergio

By Slawomir Chodnicki, on October 12th, 2011
clustered_group_and_sort

This article introduces clustering concepts supported by Kettle a.k.a. PDI. If you need to replicate data to several physical databases, or would like to learn about scale-out options for record processing, this article may be for you. As usual, the downloads section has the demo transformations for this article. . . . → Read More: Clustering in Kettle

By Slawomir Chodnicki, on September 9th, 2011
article_pic

This article introduces partitioning concepts supported by Kettle a.k.a. PDI. If you need to partition records over several tables, or would like to learn about increasing the parallelism of your transformations, this article may be for you. . . . → Read More: Partitioning in Kettle

By Slawomir Chodnicki, on June 28th, 2011
featured

This post shows the steps to get a customized CDE dashboard up and running on Pentaho BI Server. Starting from a mock-up sketch of a dashboard, this post shows the implementation steps all the way to the finished result, explaining CDE concepts as they appear along the way. CDE uses web technologies i.e. HTML/CSS/JS and SVG. . . . → Read More: Creating Dashboards with CDE

By Slawomir Chodnicki, on May 28th, 2011
pillars

Pentaho Report Designer exposes the jFreeChart objects of charts to a post-processing script. This opens up some interesting possibilities. This post shows some techniques to dynamically change the appearance of a chart. All samples are available in the download package. Let’s play!

Using Colors straight from the Data

In some circumstances it’s useful to get colors straight from . . . → Read More: Dynamic coloring of PRD Charts

By Slawomir Chodnicki, on May 27th, 2011
order_color_coding

Maintaining consistent color coding for metrics on PRD charts can be a bit challenging, as chart colors are defined statically with no reference to a specific metric they belong to. This post shows how to maintain consistent relationships between colors and metrics in PRD charts using two different approaches. Get the sample reports from the downloads . . . → Read More: Color Coding and PRD Charts

By Slawomir Chodnicki, on May 14th, 2011
finite_automaton

Regular expressions are a very useful tool for a variety of string related tasks. In Kettle they are frequently used for extraction and manipulation tasks, as well as for specifying groups of file names. This post gives an introduction to regular expressions in general as well as some applications within Kettle a.k.a. PDI. Since the built-in . . . → Read More: An Introduction to Regular Expressions

    本站是提供个人知识管理的网络存储空间,所有内容均由用户发布,不代表本站观点。请注意甄别内容中的联系方式、诱导购买等信息,谨防诈骗。如发现有害或侵权内容,请点击一键举报。
    转藏 分享 献花(0

    0条评论

    发表

    请遵守用户 评论公约

    类似文章 更多