various bugfixes from github issues kmean with some frozen centroids GPU better tiling for large flat datasets default AVX for vector ops