Question about partitioned cluster schema

I have one PU, when the PU is deployed, about 10000 objects are loaded from a data base, I´m using a partitioned cluster schema with 2 cluster instances. The partition 1 has 10000 objects and partition 2 has 10000 objects too. When my application does a read multiple operation, the application returns an object two times, I think one per partition, I need no duplicated objects... How can I solve this... I need to point to only one instance?



  pu.xml

asked 2008-06-24 17:55:06 -0500

updated 2013-08-08 09:52:00 -0500

1 Answer

Your cluster external data source settings need to be configured.
You should run in shared data source mode to make sure each partition will load the correct data. Make sure you have routing field declared.
If you are using 6.0 make sure your routing field and id decorations are using the same field. This limitation has been lifted with 6.5.
I recommend you to use the open spaces hibernate data source which comes with 6.5 and implement the initial load method to allow each partition to query the relevant data set from the database. Without this optimization each partition will load all the data and filter out the relevant data which could take long time and consumes memory.

Please note that with cache policy ALL IN CACHE all the data will be loaded and with LRU half of the cache size amount of objects will be loaded.

Take a look here:


  pu.xml

answered 2008-06-24 18:32:49 -0500

Hi Shay.

How can I configure the shared data source mode. Now my Rounting field and space id are the same, Which are the criteria for choosing the routing field?. Now I´m working with Gigaspaces 6.5 and Java 1.5.0, How can I use the open spaces hibernate data source, What´s the difference between the Gigaspaces 6.0 version.

Jorge


To configure the External Data source when spaces running in cluster mode use the following: cluster-config.cache-loader.external-data-source=true cluster-config.cache-loader.central-data-source=true

Take a look on the org.openspaces.persistency.hibernate.DefaultHibernateExternalDataSource and org.openspaces.persistency.hibernate.StatelessHibernateExternalDataSource.

These provides much better data load compared to the old HibernateExternalDataSource and includes few bug fixes found.

My cluster mode configuration is fine, but my question is about the SHARED MODE CONFIGURATION.

Thanks


The shared data source mode should be set to true.

Can you post the pu.xml line of code for setting to true the shared data source mode?

Thanks


