I am looking at kudu's documentation.
Below is a partial description of kudu-spark.
https://kudu.apache.org/docs/developing.html#_avoid_multiple_kudu_clients_per_cluster
Avoid multiple Kudu clients per cluster.
One common Kudu-Spark coding error is instantiating extra
KuduClientobjects. In kudu-spark, aKuduClientis owned by theKuduContext. Spark application code should not create anotherKuduClientconnecting to the same cluster. Instead, application code should use theKuduContextto access aKuduClientusingKuduContext#syncClient.To diagnose multiple
KuduClientinstances in a Spark job, look for signs in the logs of the master being overloaded by manyGetTableLocationsorGetTabletLocationsrequests coming from different clients, usually around the same time. This symptom is especially likely in Spark Streaming code, where creating aKuduClientper task will result in periodic waves of master requests from new clients.
Does this mean that I can only run one kudu-spark task at a time?
If I have a spark-streaming program that is always writing data to the kudu, How can I connect to kudu with other spark programs?