Lets learn at Informatica Power Center in less than 30mins.
We will first discuss the basic overall architecture of the product, and then will have some hands on to break through.
Informatica is a tool for the ETL process and integration, which has a service-oriented architecture. SOA provides the ability to scale services and share resources across multiple machines.
Informatica has the ability to communicate with all major data sources and database vendors. It can move data to/from these data sources.
Power Center Domain is the fundamental and an important administrative unit. It is a collection of nodes and services. A node is the logical representation of a machine in a domain. Services are the processes that help in the whole ETL process, e.g. authentication, authorization, repository management, integration services etc.
Services for the domain include:
- Service Manager: It manages all domain operations. It runs the application services and performs domain functions (e.g. authentication, authorization, and logging) on each node in the domain. The Service Manager uses TCP/IP to communicate with the application services.
· Application services
· Repository Service: It manages the repository. Repository is the data about all the data and and the ETL involved on the data. It retrieves, inserts, and updates metadata in the repository database tables.
· Integration Service: The Integration Service runs sessions and workflows.
A session is a set of instructions that tells the Integration Service how to move data from sources to targets
A workflow is a set of instructions that tells the Integration Service how to execute tasks, such as sessions, email notifications, and shell commands.
The Integration Service reads workflow information from the repository. The Integration Service connects to the repository through the Repository Service to fetch metadata from the repository.
· SAP BW Service: listens for RFC requests from SAP BW and processes it.
· Web Services Hub: Runs within Service Manager and receives requests from web service clients and processes.
There are 2 types of nodes:
- Gateway Nodes: A gateway node can run application services, and it can serve as a master gateway node. The master gateway node is the entry point to the domain.
- Worker Nodes: A worker node is any node not configured to serve as a gateway. The Service Manager performs limited domain operations on a worker node.
Some other terms:
Mapping: is to depict the flow of data between sources and targets.
Transformations: We add transformations to a mapping that depict how the Integration Service extracts and transforms data before it loads a target.
A transformation is a part of a mapping that generates or modifies data. Every mapping includes a Source Qualifier transformation, representing all data read from a source and temporarily stored by the Integration Service. In addition, you can add transformations that calculate a sum, look up a value, or generate a unique ID before the source data reaches the target
Source Qualifier: The source qualifier represents the rows that the Integration Service reads from the source when it runs a session.
Session: A session is a set of instructions that tells the Integration Service how to move data from sources to targets. You create a session for each mapping that you want the Integration Service to run
Workflow: A workflow is a set of instructions that tells the Integration Service how to execute tasks, such as sessions, email notifications, and shell commands.
Above is some insight into the working of the Informatica. For more in depth architecture of the Informatica, you can visit:
http://informatica.techtiks.com/informatica_architecture.html
Informatica tool has 3 main components:
- Client tools: These tools are used for the development process. It has the following supporting tools:
· Designer: To define mapping and transformation.
· Workflow Manager: To define the session and the workflow.
· Workflow Monitor: To monitor execution of sessions.
· Repository Manager: To manage repository.
2. Server tools: Can be considered as the staging area, where data from sources are fetched, transformed and then loaded into the target.
3. Repository: Repository is a kind of data inventory where all the data related to mappings, sources, targets etc is kept.
Now once we are familiar with the overall description of the product, we are now ready to have some practical hands-on. Follow the following steps to perform your ETL. This is a highly generic steps, and depending on your requirements u need to perform your transformations.
1. Create a folder to organize your work.
2. Logon to the Repository in the Designer Window.
3. Create the Source Repository in the Repository Database.
4. Create the Target Repository in the Repository Database. If Target does not exist, then create and execute the Target creation commands.
5. Now perform the source to target mapping, with appropriate transformations.
6. Now create a session for the mapping and thereafter a workflow for its execution. This will be done in the Workflow Manager.
7. Now run the workflow and watch in Workflow Monitor. It will flash ‘Success’ once the operation completed successfully.
The time is running out. So, will post the pictorial form of the steps in next issue.
Thanks, and comments are solicited.
7 comments:
Thank you.
you blog is good
informatica mdm Training
thank you for information,well blog sites in excellent concept for informatica
Keep yourself upgraded with the latest software techniques running in the market, through most latest & upgraded software skills.
This is an awesome post.Really very informative and creative contents. These concept is a good way to enhance the knowledge.I like it and help me to development very well.Thank you for this brief explanation and very nice information.
Informatica Training in Chennai
Well Said, you have furnished the right information that will be useful to anyone at all time. Thanks for sharing your Ideas.
aws Training in indira nagar | Aws course in indira Nagar
selenium Training in indira nagar | Best selenium course in indira Nagar | selenium course in indira Nagar
python Training in indira nagar | Best python training in indira Nagar
datascience Training in indira nagar | Data science course in indira Nagar
devops Training in indira nagar | Best devops course in indira Nagar
I appreciate that you produced this wonderful article to help us get more knowledge about this topic. I know, it is not an easy task to write such a big article in one day, I've tried that and I've failed. But, here you are, trying the big task and finishing it off and getting good comments and ratings. That is one hell of a job done!
python course in pune
python course in chennai
python Training in Bangalore
I think Informatica is the best and most useful visualization tool to look and find more and more solutions to some extremely complicated IT problems.Its really the next big thing.
Informatica Read Soap API
Post a Comment