Big Data with HierarchicalRegression

alphamaximus · August 10, 2018, 8:14pm

I managed to get around it by using Minibatch.update_shared_f:

def choice_iterator(size, nrows, seed):
    state = np.random.RandomState()
    state.seed(seed)

    while True:
        yield sorted(np.random.randint(0, nrows+1, size=size))

def update_shared_f_in_memory(obj, size, seed):

    nrows     = len(obj) if isinstance(obj, np.ndarray) else len(obj.index)
    generator = choice_iterator(size=size, nrows=nrows, seed=seed)

    def f():
        where = next(generator)
        return obj[where] if isinstance(obj, np.ndarray) else obj.iloc[where]

    return f

Topic		Replies	Views
ADVI Minibatch slows down with increasing size of data Questions	3	988	April 19, 2019
Minibatch for a large dataset ADVI Questions	2	1209	September 7, 2018
Simple Hierarchical Model with Huge Data version agnostic	6	165	July 23, 2024
Hierarchical Model - Slow Sampling Questions	4	1168	March 26, 2020
Hierarchical gaussian mixture model VI minibatch Questions	0	479	July 14, 2020

Big Data with HierarchicalRegression

Related topics