Academy
- Join the Academy Benefit from guided learning opportunities →
Community
Documentation
- Reference Documentation Comprehensive specifications of Dataiku →
Knowledge
- Knowledge Base Articles and tutorials on Dataiku features →
Developer
- Developer Guide Tutorials and articles for developers and coder users →
For You

Sign up to take part

Registered users can ask their own questions, contribute to discussions, and be part of the Community!

Learn more

Community
»
Discussions
»
Using Dataiku
»

Options

Subscribe to RSS Feed
Mark Topic as New
Mark Topic as Read
Float this Topic for Current User
Bookmark
Subscribe
Mute
Printer Friendly Page

Unable to write spark df to csv with column headers and multiple partitions?

parul

Level 1

‎10-02-2019 04:39 AM

Mark as New
Bookmark
Subscribe
Mute
Subscribe to RSS Feed
Permalink
Print
Report Inappropriate Content

Unable to write spark df to csv with column headers and multiple partitions?

Writing spark df into csv along with headers repartitions the df into 1 by default. So it takes a lot of time while writing considering the dataset is large because only 1 partition is active. How do I write spark dataframe to csv in hdfs with column headers and multiple partitions, so that it runs faster?

0 Kudos