Code for converting multiple xml files to json file format

0 votes
I want to convert a folder containing over 3,300 .xml files into a .json file. How would I do this? The format is fixed. I would like to do this in python or java - which ever is easiest. I would like to see a coded example if possible please. If there is an open source industry tool that does this - please point me to it, but all I can find are single file converters which won't work for me. Can I do this in Talend? If so, how? I have a total of 11 zip files and approximately 210 gigabytes of data.
Dec 18, 2020 in Talend by Aaron
• 160 points
122 views

1 answer to this question.

+1 vote
Best answer

Hey, @AAron,

I would suggest you go through this as you need to follow multiple steps: https://www.girikon.com/blog/etl-xml-and-json-files-processing-with-talend-open-studio/

answered Dec 18, 2020 by Gitika
• 65,870 points

selected Dec 20, 2020 by Aaron
Awesome - thank you!
I do have another question - if I want to take the .xml and turn it into a specifically formatted file for an RDF graph database - can I specify or correct the JSON schema so that it reflects the new and improved schema....?
Is there a "rule of thumb" as to how many files at one time I should iterate through? Or maybe I need to experiment? Just curious how long 3,300 files might take to go through?

Related Questions In Talend

0 votes
1 answer

Using tHiveRow to execute multiple Hive queries in Talend

In order to execute multiple queries, you ...READ MORE

answered Apr 14, 2018 in Talend by misc.edu04
• 1,440 points
1,809 views
0 votes
1 answer

How to load a set of files into database using Talend

Using tFileList component,  you can read all the files present ...READ MORE

answered Apr 16, 2018 in Talend by code.reaper12
• 3,500 points
2,219 views
0 votes
1 answer

How to generate multiple rows from a single row in Talend?

I think using tJavaFlex, tMap and tFlowToIterate ...READ MORE

answered Apr 18, 2018 in Talend by code.reaper12
• 3,500 points
4,080 views
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
6,854 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
1,098 views
+2 votes
11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
48,452 views
–1 vote
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
2,602 views
0 votes
1 answer

How to loading flat file from configuration in Talend?

Hey, @There, Regarding your query you need to ...READ MORE

answered Jun 3, 2020 in Talend by Gitika
• 65,870 points
237 views
0 votes
1 answer

how to add data to existing file and how to use LIST properly ??

Hi, @Aminegac yes, in the output file there's ...READ MORE

answered Dec 4, 2020 in Talend by Gitika
• 65,870 points
94 views