.. |
blockselect
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
nvidia
|
added benchmarking scripts
|
2017-02-24 08:32:08 -08:00 |
warpselect
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
BlockSelectFloat.cu
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
BlockSelectHalf.cu
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
BlockSelectKernel.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
Comparators.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
ConversionOperators.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
CopyUtils.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
DeviceDefs.cuh
|
add support for Nvidia GPUs with CUDA compute capability 6.1 (#110)
|
2017-05-31 15:52:39 +02:00 |
DeviceMemory.cpp
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
DeviceMemory.h
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
DeviceTensor-inl.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
DeviceTensor.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
DeviceUtils.cpp
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
DeviceUtils.h
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
DeviceVector.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
Float16.cu
|
added benchmarking scripts
|
2017-02-24 08:37:57 -08:00 |
Float16.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
HostTensor-inl.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
HostTensor.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
Limits.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
LoadStoreOperators.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
MathOperators.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
MatrixMult.cu
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
MatrixMult.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
MergeNetworkBlock.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
MergeNetworkWarp.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
NoTypeTensor.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
Pair.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
PtxUtils.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
ReductionOperators.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
Reductions.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
Select.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
StackDeviceMemory.cpp
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
StackDeviceMemory.h
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
StaticUtils.h
|
correct utils::nextHighestPowerOf2
|
2017-05-06 20:36:16 +02:00 |
Tensor-inl.cuh
|
sync with FB version. Added:
|
2017-03-20 10:48:35 -07:00 |
Tensor.cuh
|
sync with FB version. Added:
|
2017-03-20 10:48:35 -07:00 |
ThrustAllocator.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
Timer.cpp
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
Timer.h
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
Transpose.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
WarpSelectFloat.cu
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
WarpSelectHalf.cu
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
WarpSelectKernel.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
WarpShuffles.cuh
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
WorkerThread.cpp
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |
WorkerThread.h
|
Initial commit
|
2017-02-22 23:26:44 +01:00 |