Refer to the following Table for Questions 6-7
CREATE TABLE TEST_MSR_SOURCE (
rpt_grp_cd varchar(60) ,
lctn_typ_cd varchar(10) ,
clctn_prd_txt varchar(8) ,
msr_cd varchar(20),
clcltn_date varchar(10),
grp_rate_nmrtr varchar(3),
grp_rate_dnmntr varchar(5) ,
file_name varchar(50),
creat_ts varchar(50),
creat_user_id varchar(30),
submsn_cmplt_cd varchar(1))
CREATE TABLE TEST_MSR_TARGET (
TEST_MSR_TARGET_ID int4 NOT NULL
rpt_grp_cd varchar(60),
lctn_typ_cd varchar(10),
clctn_prd_txt varchar(8),
msr_cd varchar(20),
clcltn_date date,
grp_rate_nmrtr int4,
grp_rate_dnmntr numeric(5),
file_name varchar(50),
finl_sw varchar(1),
creat_ts timestamp NOT NULL,
creat_user_id varchar(30) NOT NULL,
submsn_cmplt_cd varchar(1))
Import .csv file into TEST_MSR_SOURCE using Python Script (name of the csv file is python_test.csv)
Write a Python script that takes all data from TEST_MSR_SOURCE and inserts the data into TEST_MSR_TARGET
Database type = Redshift
Number of rows = 10
Sequential key used for TEST_MSR_TARGET_ID
Data transformations from varchar to (date, int, numeric, timestamp)
Would your script change if the number of rows = 1,000,000? Why or why not?