A. R. Shajii
|
38e08b409a
|
Doc updates (#385)
* Documentation updates
* Documentation updates
* Update README.md
* Fix tuple indexing error messages
* Update roadmap, differences
* Update README, FAQ
* Trim newline
* Update README.md
* Update README.md
* Update README.md
* Update roadmap.md
* Update cpp.md
* Update README.md
* Update roadmap.md
* Update README.md
* Fix test
* clang-format
* Fix exporting function named "main"
* Update export test
* Fix paths
* Rename extra/python -> jit
* Update license change date
* Minor docs updates
* Re-add __init__.py
* Update header
* Update gitignore
* Update README.md
---------
Co-authored-by: Ibrahim Numanagić <ibrahimpasa@gmail.com>
Co-authored-by: Ibrahim Numanagić <inumanag@users.noreply.github.com>
|
2023-05-23 17:59:26 -04:00 |
A. R. Shajii
|
bac6ae58dd
|
Generator argument optimization (and more) (#175)
* Fix ABI incompatibilities
* Fix codon-jit on macOS
* Fix scoping bugs
* Fix .codon detection
* Handle static arguments in magic methods; Update simd; Fix misc. bugs
* Avoid partial calls with generators
* clang-format
* Add generator-argument optimization
* Fix typo
* Fix omp test
* Make sure sum() does not call __iadd__
* Clarify difference in docs
* Fix any/all generator pass
* Fix InstantiateExpr simplification; Support .py as module extension
* clang-format
* Bump version
Co-authored-by: Ibrahim Numanagić <ibrahimpasa@gmail.com>
|
2023-01-17 10:21:59 -05:00 |
A. R. Shajii
|
ebd344f894
|
GPU and other updates (#52)
* Add nvptx pass
* Fix spaces
* Don't change name
* Add runtime support
* Add init call
* Add more runtime functions
* Add launch function
* Add intrinsics
* Fix codegen
* Run GPU pass between general opt passes
* Set data layout
* Create context
* Link libdevice
* Add function remapping
* Fix linkage
* Fix libdevice link
* Fix linking
* Fix personality
* Fix linking
* Fix linking
* Fix linking
* Add internalize pass
* Add more math conversions
* Add more re-mappings
* Fix conversions
* Fix __str__
* Add decorator attribute for any decorator
* Update kernel decorator
* Fix kernel decorator
* Fix kernel decorator
* Fix kernel decorator
* Fix kernel decorator
* Remove old decorator
* Fix pointer calc
* Fix fill-in codegen
* Fix linkage
* Add comment
* Update list conversion
* Add more conversions
* Add dict and set conversions
* Add float32 type to IR/LLVM
* Add float32
* Add float32 stdlib
* Keep required global values in PTX module
* Fix PTX module pruning
* Fix malloc
* Set will-return
* Fix name cleanup
* Fix access
* Fix name cleanup
* Fix function renaming
* Update dimension API
* Fix args
* Clean up API
* Move GPU transformations to end of opt pipeline
* Fix alloc replacements
* Fix naming
* Target PTX 4.2
* Fix global renaming
* Fix early return in static blocks; Add __realized__ function
* Format
* Add __llvm_name__ for functions
* Add vector type to IR
* SIMD support [wip]
* Update kernel naming
* Fix early returns; Fix SIMD calls
* Fix kernel naming
* Fix IR matcher
* Remove module print
* Update realloc
* Add overloads for 32-bit float math ops
* Add gpu.Pointer type for working with raw pointers
* Add float32 conversion
* Add to_gpu and from_gpu
* clang-format
* Add f32 reduction support to OpenMP
* Fix automatic GPU class conversions
* Fix conversion functions
* Fix conversions
* Rename self
* Fix tuple conversion
* Fix conversions
* Fix conversions
* Update PTX filename
* Fix filename
* Add raw function
* Add GPU docs
* Allow nested object conversions
* Add tests (WIP)
* Update SIMD
* Add staticrange and statictuple loop support
* SIMD updates
* Add new Vec constructors
* Fix UInt conversion
* Fix size-0 allocs
* Add more tests
* Add matmul test
* Rename gpu test file
* Add more tests
* Add alloc cache
* Fix object_to_gpu
* Fix frees
* Fix str conversion
* Fix set conversion
* Fix conversions
* Fix class conversion
* Fix str conversion
* Fix byte conversion
* Fix list conversion
* Fix pointer conversions
* Fix conversions
* Fix conversions
* Update tests
* Fix conversions
* Fix tuple conversion
* Fix tuple conversion
* Fix auto conversions
* Fix conversion
* Fix magics
* Update tests
* Support GPU in JIT mode
* Fix GPU+JIT
* Fix kernel filename in JIT mode
* Add __static_print__; Add earlyDefines; Various domination bugfixes; SimplifyContext RAII base handling
* Fix global static handling
* Fix float32 tests
* FIx gpu module
* Support OpenMP "collapse" option
* Add more collapse tests
* Capture generics and statics
* TraitVar handling
* Python exceptions / isinstance [wip; no_ci]
* clang-format
* Add list comparison operators
* Support empty raise in IR
* Add dict 'or' operator
* Fix repr
* Add copy module
* Fix spacing
* Use sm_30
* Python exceptions
* TypeTrait support; Fix defaultDict
* Fix earlyDefines
* Add defaultdict
* clang-format
* Fix invalid canonicalizations
* Fix empty raise
* Fix copyright
* Add Python numerics option
* Support py-numerics in math module
* Update docs
* Add static Python division / modulus
* Add static py numerics tests
* Fix staticrange/tuple; Add KwTuple.__getitem__
* clang-format
* Add gpu parameter to par
* Fix globals
* Don't init loop vars on loop collapse
* Add par-gpu tests
* Update gpu docs
* Fix isinstance check
* Remove invalid test
* Add -libdevice to set custom path [skip ci]
* Add release notes; bump version [skip ci]
* Add libdevice docs [skip ci]
Co-authored-by: Ibrahim Numanagić <ibrahimpasa@gmail.com>
|
2022-09-15 15:40:00 -04:00 |
A. R. Shajii
|
d5ce1f8ff9
|
Update docs (#28)
* Update docs
* Update docs
* Update docs
* GitBook: [#4] Add hint
* Update primer
* Re-organize docs
* Fix table
* Fix link
* GitBook: [#5] No subject
* GitBook: [#6] No subject
* Cleanup and doc fix
* Add IR docs
* Add ir docs
* Fix spelling error
* More IR docs
* Update README.md
* Update README.md
* Fix warning
* Update intro
* Update README.md
* Update docs
* Fix table
* Don't build docs
* Update docs
* Add Jupyter docs
* FIx snippet
* Update README.md
* Fix images
* Fix code block
* Update docs, update cmake
* Break up tutorial
* Update pipeline.svg
* Update docs for new version
* Add differences with Python docs
|
2022-07-26 16:08:42 -04:00 |