Spark sqlContext.read.json - java.io.IOException: No input paths specified in job
Fixing Spark SQL's 'No input paths specified' error when reading JSON files by using the correct file:// or hdfs:// URI prefix.
Robin Moffatt is a Principal DevEx Engineer and seasoned conference speaker with 15+ years of experience presenting at top events like QCon, Devoxx, Kafka Summit, and Strata. He shares insights on developer experience, distributed systems, and cloud technologies through his blog, YouTube, and public talks.
617 articles from this blog
Fixing Spark SQL's 'No input paths specified' error when reading JSON files by using the correct file:// or hdfs:// URI prefix.
Explores using SparkSQL and Pandas to import data into Hive and Big Data Discovery, bypassing manual Hive table creation.
Fixing SSH connection errors in Proxmox 4 LXC containers by correcting the network CIDR suffix from /32 to /24.
Guide to resetting a Hue user's password and activating the account in a Cloudera/Hadoop environment using Django shell commands.
Troubleshooting a 'No current connection' error in Apache Drill caused by a conflicting Jackson JAR file, with a solution provided.
A technical guide on resolving ClassNotFoundException errors when using the MongoDB-Hadoop connector with Apache Hive, focusing on required JAR files.
Fixing a CDH installation failure on LXC/Proxmox caused by an erroneous SwapFree value in /proc/meminfo when swap is disabled.
Guide to using Jupyter Notebooks with Oracle Big Data Discovery 1.2 for advanced data science and Python/Spark integration.
Guide to setting up Big Data Discovery Shell and Jupyter Notebooks on Oracle's Big Data Lite VM for advanced data science work.
A guide to recovering a bricked Ubiquiti EdgeRouter Lite using a Mac, serial connection, and TFTP to fix corrupt filesystem errors.
A technical guide on installing Docker on Proxmox and running a Resilio Sync (BitTorrent Sync) container for decentralized file synchronization.
A technical guide on converting and importing existing VMWare and VirtualBox virtual machines to run on the Proxmox virtualization platform.
A technical guide detailing the installation of Proxmox VE 4 on a server and the setup of ext4 filesystems on multiple hard drives.
A tech enthusiast details setting up a custom Proxmox home server for hosting VMs and containers for various data and analytics tools.
Exploration of OBIEE 12c features, focusing on RESTful APIs, web services, cache behavior, and data set service.
How to enable and list detailed logging sources in OBIEE 12c Presentation Services for troubleshooting and diagnostics.
Explains OBIEE 12c's Extended Subject Areas (XSA) and Data Set Service for user-uploaded data analysis.
Explains changes to the BI Server cache behavior in OBIEE 12c, focusing on the new OBIS_REFRESH_CACHE logic.
Explores undocumented REST web services in OBIEE 12c for downloading RPD files, detailing commands and diagnostic methods.
A technical guide on dynamically naming OBIEE 12c service instance exports using WLST scripts with timestamps and hostnames.