How do I connect to a HIVE Meta store through a program in SparkSQL

Question

I'm using HiveContext with SparkSQL and I'm trying to connect to a remote Hive meta store, the only way to set the hive meta store is through including the hive-site.xml on the classpath (or copying it to /etc/spark/conf/). Is there any way to set this parameter programmatically in a java code without including the hive-site.xml? If so what is the Spark configuration to use?

ravikiran · Answer 1 · Sep 5, 2019

In spark 2.0.+ it should look something like that:

Don't forget to replace the "hive.metastore.uris" with yours. This assumes that you have a hive meta store service started already (not a hive server).

 val spark = SparkSession
          .builder()
          .appName("interfacing spark sql to hive metastore without configuration file")
          .config("hive.metastore.uris", "thrift://localhost:9083") // replace with your hivemetastore service's thrift url
          .enableHiveSupport() // don't forget to enable hive support
          .getOrCreate()

        import spark.implicits._
        import spark.sql
        // create an arbitrary frame
        val frame = Seq(("one", 1), ("two", 2), ("three", 3)).toDF("word", "count")
        // see the frame created
        frame.show()
        /**
         * +-----+-----+
         * | word|count|
         * +-----+-----+
         * |  one|    1|
         * |  two|    2|
         * |three|    3|
         * +-----+-----+
         */
        // write the frame
        frame.write.mode("overwrite").saveAsTable("t4")