Ignite for Spark on YARN Deployment

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

Ignite for Spark on YARN Deployment

percent620
My detailed issues as below this link, please refer to this link http://apache-ignite-users.70518.x6.nabble.com/Ignite-for-Spark-on-YARN-Deployment-tt6910.html


My question:
1) What ports should be opened on driver machine or yarn machine?(There issues were happned on our production env)

2) But in my local env(3 spark cluster[1 master, 2 workers] and 1 yarn cluster) and there is no issues as i manually closed my firewall, then fixed this issue.

I faced another issues as below
========================
1)
val igniteContext = new IgniteContext[Integer,Integer](sc, () => new IgniteConfiguration())
    val sharedRDD = igniteContext.fromCache("sharedBaselineCacheRDD")
    val initalRDD = sc.parallelize(1 to 100000,10).map(i => (new Integer(i), new Integer(i)))
    println("initalRDD.couner=/. " + initalRDD.count() +"\tpartition=> " + initalRDD.partitions.size)

    sharedRDD.savePairs(initalRDD, true)
    println("=====>totalcounter" + sharedRDD.count + "\t paris => " + sharedRDD.partitions.size)
    println("=====>" + sharedRDD.filter(_._2 > 50000).count)


2) submit a jar to yarn cluster as below command
/u01/spark-1.6.0-hive/bin/spark-submit --driver-memory 4G --class com.ValidSparkCache --master yarn --executor-cores 5 --executor-memory 1000m --num-executors 10 --conf spark.rdd.compress=false --conf spark.shuffle.compress=false --conf spark.broadcast.compress=false /u01/xxx/sparkignite.jar -quesize 10 -batchSize 5000 -writethread 30 -runningSeconds 20

3) final result should be 50000, but actullay result is 40000,3000(ramdom)

final result is 40000 NOT 50000, this is happened also on embed integration with spark.
Reply | Threaded
Open this post in threaded view
|

Re: Ignite for Spark on YARN Deployment

Nikolai Tikhonov
Hi,


> 1) What ports should be opened on driver machine or yarn machine?(There
> issues were happned on our production env)


Ignite by default using the following ports:11211 -  REST,  47100..47200 -
TCP communication, 47500...47600 - Discovery,  49112 -  JMX. You can change
the values (see TcpCommunicationSpi#setLocalPort and
TcpDiscoverySpi#setLocalPort).

About other issue, could you clarify your test environment and which
configuration spark you using and which get correct/incorrect results?
Standalone or embedded? YARN ignite cluster or local cluster?
Reply | Threaded
Open this post in threaded view
|

Re: Ignite for Spark on YARN Deployment

Valentin Kulichenko
I copy-pasted the response to the user list. Let's continue there, this
discussion is not for the dev list.

*http://apache-ignite-users.70518.x6.nabble.com/Ignite-for-Spark-on-YARN-Deployment-td6910.html
<http://apache-ignite-users.70518.x6.nabble.com/Ignite-for-Spark-on-YARN-Deployment-td6910.html>*

-Val

On Mon, Aug 15, 2016 at 3:19 AM, Nikolai Tikhonov <[hidden email]>
wrote:

> Hi,
>
>
> > 1) What ports should be opened on driver machine or yarn machine?(There
> > issues were happned on our production env)
>
>
> Ignite by default using the following ports:11211 -  REST,  47100..47200 -
> TCP communication, 47500...47600 - Discovery,  49112 -  JMX. You can change
> the values (see TcpCommunicationSpi#setLocalPort and
> TcpDiscoverySpi#setLocalPort).
>
> About other issue, could you clarify your test environment and which
> configuration spark you using and which get correct/incorrect results?
> Standalone or embedded? YARN ignite cluster or local cluster?
>