contains the methotology for creating the input pipeline More...

Functions
def	nabu.processing.input_pipeline.get_filenames (dataconfs)
	create a list of filenames to put into the queue More...

def	nabu.processing.input_pipeline.input_pipeline (data_queue, batch_size, numbuckets, dataconfs, variable_batch_size=False, allow_smaller_final_batch=False, name=None)
	create the input pipeline More...

def	nabu.processing.input_pipeline.bucket_boundaries (histogram, numbuckets)
	detemine the bucket boundaries to uniformally devide the number of elements in the buckets More...

Detailed Description

contains the methotology for creating the input pipeline

Function Documentation

def nabu.processing.input_pipeline.bucket_boundaries	(	histogram,
		numbuckets
	)

detemine the bucket boundaries to uniformally devide the number of elements in the buckets

this is a greedy algorithm and does not guarantee an optimal solution

def nabu.processing.input_pipeline.get_filenames ( dataconfs )

create a list of filenames to put into the queue

Parameters

dataconfs the database configurations as a list of lists

Returns

def nabu.processing.input_pipeline.input_pipeline	(	data_queue,
		batch_size,
		numbuckets,
		dataconfs,
		variable_batch_size = `False`,
		allow_smaller_final_batch = `False`,
		name = `None`
	)

create the input pipeline

Parameters

data_queue	the data queue where the filenemas are queued
batch_size	the desired batch size
numbuckets	the number of data buckets
dataconfs	the databes configuration sections that should be read as a list of lists
variable_batch_size	bool, change batch size from bucket to bucket, for buckets with higher seq_length a smaller batch size is used
allow_smaller_final_batch	if set to True a smaller final batch is
allowed
name	name of the pipeline

Returns