Techlogbooks

Posts

Showing posts with the label OEDQ

EDQ Interview Questions & Answers-1

1. What are the types of external souces from which you can import data into EDQ? EDQ can import from different types of sources like text(.txt, .dsv etc), excel (.xls, csv), and all types of databases like Oracle, DB2, Postgresql, Mysql, Microsoft Sql Server, Sybase etc.. 2. What are the objects you create in EDQ to import files or from database? First of all we need to create a Data store pointing to file or database and then create and run the staged data to import data. In case of file you can either give the local path or if its server give the server credentials and path of the file to select the file. 3. What is the Staged data? Staged data is where you store the intermediate or final results within your EDQ space, it’s like a EDQ table which stores the Processed data from the processes 4. What is the different between Staged data and Reference data? Staged data is...

EDQ Interview Questions & Answers – 2

What is the main purpose of Lookup and Return? Lookup and return is one of the main processor used in the EDQ for the data enrichment. This processors takes one or more attributes as input and returns one or more attributes as output as per the reference data definition. If you multiple files/sources to read the data, how are going bring all data together in one stream? First of all create snapshot of all the files and add a reader processor for each file and then by using the Merge processor you can bring all the files together. P.S : All the files has to be in the same format to bring together in merge process/ you can selectively choose few columns from each file in Merge processor How will you identify and eliminate duplicates in EDQ ? In order to just identify duplicates we can use Duplicate check processor by passing one or more attributes on which duplicates needs to be identified. In order to eliminate/merge these duplicate, we can use Gr...

EDQ Interview Questions & Answers-3

Which processor you use to exclude the duplicate records ? Firstly we need to identify the duplicates by using the “Duplicate check” processor providing the attributes on which you want list duplicates. Take only the output records of this processor from “Non-Duplicated” port, thereby eliminating duplicates from the data stream. Which Processor is used to eliminate Duplicates ? In order to eliminate duplicates, we can use “Group and Merge” processor, which in turn has 3 sub-processors i.e. Input, Group and Merge. Add Attributes to Input Sub-processor to be considered in this data stream. Add the Attribute(s) on which to eliminate the duplicate to the “Group” sub processor. In the Merge Sub-process, select the relevant Merge function, by default its “Most Common Value” Consider the Merged output results for the De-duplicated records. What is the difference between “Lookup and Return” and “Lookup Check” Processors ? Lookup and Return, does the look up on t...