A. R. Shajii
|
ebd344f894
|
GPU and other updates (#52)
* Add nvptx pass
* Fix spaces
* Don't change name
* Add runtime support
* Add init call
* Add more runtime functions
* Add launch function
* Add intrinsics
* Fix codegen
* Run GPU pass between general opt passes
* Set data layout
* Create context
* Link libdevice
* Add function remapping
* Fix linkage
* Fix libdevice link
* Fix linking
* Fix personality
* Fix linking
* Fix linking
* Fix linking
* Add internalize pass
* Add more math conversions
* Add more re-mappings
* Fix conversions
* Fix __str__
* Add decorator attribute for any decorator
* Update kernel decorator
* Fix kernel decorator
* Fix kernel decorator
* Fix kernel decorator
* Fix kernel decorator
* Remove old decorator
* Fix pointer calc
* Fix fill-in codegen
* Fix linkage
* Add comment
* Update list conversion
* Add more conversions
* Add dict and set conversions
* Add float32 type to IR/LLVM
* Add float32
* Add float32 stdlib
* Keep required global values in PTX module
* Fix PTX module pruning
* Fix malloc
* Set will-return
* Fix name cleanup
* Fix access
* Fix name cleanup
* Fix function renaming
* Update dimension API
* Fix args
* Clean up API
* Move GPU transformations to end of opt pipeline
* Fix alloc replacements
* Fix naming
* Target PTX 4.2
* Fix global renaming
* Fix early return in static blocks; Add __realized__ function
* Format
* Add __llvm_name__ for functions
* Add vector type to IR
* SIMD support [wip]
* Update kernel naming
* Fix early returns; Fix SIMD calls
* Fix kernel naming
* Fix IR matcher
* Remove module print
* Update realloc
* Add overloads for 32-bit float math ops
* Add gpu.Pointer type for working with raw pointers
* Add float32 conversion
* Add to_gpu and from_gpu
* clang-format
* Add f32 reduction support to OpenMP
* Fix automatic GPU class conversions
* Fix conversion functions
* Fix conversions
* Rename self
* Fix tuple conversion
* Fix conversions
* Fix conversions
* Update PTX filename
* Fix filename
* Add raw function
* Add GPU docs
* Allow nested object conversions
* Add tests (WIP)
* Update SIMD
* Add staticrange and statictuple loop support
* SIMD updates
* Add new Vec constructors
* Fix UInt conversion
* Fix size-0 allocs
* Add more tests
* Add matmul test
* Rename gpu test file
* Add more tests
* Add alloc cache
* Fix object_to_gpu
* Fix frees
* Fix str conversion
* Fix set conversion
* Fix conversions
* Fix class conversion
* Fix str conversion
* Fix byte conversion
* Fix list conversion
* Fix pointer conversions
* Fix conversions
* Fix conversions
* Update tests
* Fix conversions
* Fix tuple conversion
* Fix tuple conversion
* Fix auto conversions
* Fix conversion
* Fix magics
* Update tests
* Support GPU in JIT mode
* Fix GPU+JIT
* Fix kernel filename in JIT mode
* Add __static_print__; Add earlyDefines; Various domination bugfixes; SimplifyContext RAII base handling
* Fix global static handling
* Fix float32 tests
* FIx gpu module
* Support OpenMP "collapse" option
* Add more collapse tests
* Capture generics and statics
* TraitVar handling
* Python exceptions / isinstance [wip; no_ci]
* clang-format
* Add list comparison operators
* Support empty raise in IR
* Add dict 'or' operator
* Fix repr
* Add copy module
* Fix spacing
* Use sm_30
* Python exceptions
* TypeTrait support; Fix defaultDict
* Fix earlyDefines
* Add defaultdict
* clang-format
* Fix invalid canonicalizations
* Fix empty raise
* Fix copyright
* Add Python numerics option
* Support py-numerics in math module
* Update docs
* Add static Python division / modulus
* Add static py numerics tests
* Fix staticrange/tuple; Add KwTuple.__getitem__
* clang-format
* Add gpu parameter to par
* Fix globals
* Don't init loop vars on loop collapse
* Add par-gpu tests
* Update gpu docs
* Fix isinstance check
* Remove invalid test
* Add -libdevice to set custom path [skip ci]
* Add release notes; bump version [skip ci]
* Add libdevice docs [skip ci]
Co-authored-by: Ibrahim Numanagić <ibrahimpasa@gmail.com>
|
2022-09-15 15:40:00 -04:00 |
Ibrahim Numanagić
|
cb945f569c
|
Typechecker refactoring (#20)
* Initial refactor commit
* Support external vars
* Simplify refactor; Python scoping [wip]
* Python scoping [wip]
* Python scoping [fix loops; wip]
* Fix lambdas
* Python scoping [test fixes; wip]
* Fix scoping [wip]
* Fix basic tests [no-ci]
* Fix tests
* CallExpr refactoring [wip]
* CallExpr refactoring [wip]
* Remove activeUnbounds tracking
* Add core.codon
* Move Function and other core types to core.codon; Revamp Function and Callable types
* Refactor IntExpr, FloatExpr and CallExpr
* Refactor ClassStmt
* Refactor context, IdExpr and DotExpr
* Refactor DotExpr and AssignStmt
* Refactor ImportStmt
* Refactor FunctionStmt
* Refactor
* Remove UpdateStmt
* Refactor AssignReplacementVisitor
* Make SimplifyVisitor in-place
* Fix new scoping
* Fix import type alias handling
* Add docstrings; Complete Simplify refactoring
* Fixes for seqtest
* Refactor typecheck [wip]
* Refactor typecheck [wip]
* Refactor typecheck/access; Remove void anduse NoneType; Fix #18
* Refactor typecheck/assign
* clang-format and cmake-format
* Fix none types in IR
* Multi-error support in simplify
* Fix IR tests for new void
* Simplify ClassStmt
* Refactor cond.cpp
* Refactor error.cpp
* Refactor function.cpp and simplify unbounds
* Refactor op.cpp
* Refactor call.cpp [wip] [no-ci]
* seqassertn updates [noci]
* Refactor call.cpp
* Refactor call.cpp
* Refactor call.cpp
* Refactor typecheck
* clang-tidy updates [noci]
* Refactor infer.cpp [wip]
* Refactor infer.cpp
* Refactor wrapExpr
* Remove visitedAsts
* Remove old base logic
* Refactor typecheck ctx
* Fix JIT bug
* Fix JIT tests
* Scoping fixes [wip] [noci]
* Fix ImperativeForFlow var store
* Add newlines [noci]
* Dump IR module with log flag
* Fix scoping bugs; Add &, ^ and | static operations; Address stylistic review issues
* Fix side effect analysis for for-loops
* Add support for class variables and ClassVar
* Refactor special dot-member cases
* Add codon app tests
* Fix class variables; clang-tidy
* Fix __argv__
* Add datetime constants and update tests
* Fix #25; Add Py_None, Py_True and Py_False; External var support [wip]
* External var support [wip]
* Dump LLVM IR when debug flags are active
* clang-format
* Fix arg var construction
* Extern var fixes
* Undo extern var changes related to stdout etc.
* Fix tuple magics
* Fix extern vars and tuple magics
* Fix duplicate var name error
* Fix extern vars
* Fix #16
* Fix side-effect analysis for try-catch
* Move test C var to test executable
* Add staticmethod
* Fix var status for try-catch
* Fix tests
* Fix shell var name
* Fix test
* Fix app test
* Fix scoping issue (remove dominated identifier from stack)
* Fix no-pie issue
* Use PIC when building library object
* Don't use -no-pie when building library [noci]
* Use -relocation-model=pic in test
* Fix lib build on Linux
* Fix lib build
* Update exceptions to use subclasses vs. header
* Fix __repr__
* Fix tests
* Fix exceptions test
* Don't build docs
Co-authored-by: A. R. Shajii <ars@ars.me>
|
2022-07-26 16:06:00 -04:00 |
Ishak Numanagić
|
f1658ec62e
|
stdlib/internal/box.codon
|
2022-02-28 20:57:12 +01:00 |
A. R. Shajii
|
166e1ad455
|
JIT (#6)
* Add JIT engine
* More engine updates
* Fix takeModule()
* Parser JIT support [wip]
* Fix class definition
* Parser JIT support [wip]
* Parser JIT support [wip]
* Fix LLVM context conflict
* Parser JIT support [wip]
* Fix JIT engine
* Parser JIT support [wip]
* Doc fix
* JIT fix
* Fix JIT exceptions
* Refactor
* Add JIT status codes
* Add compiler class
* Better logging
* Better logging
* Update .gitignore
* Add Jupyter Xeus support
* Update JIT
* Remove print
* Update errors
* Fix assert
* Fix asserts
* Update docgen signature
* Update file system interface
* Update plugin errors
* Use numeric_limits
* JIT refactor [wip]
* Set JIT flag on LLVMVisitor
* Update module opt
* JIT imports
* First Jupyter integration
* Update JIT API to return outputs as string
* Capture runtime exception output
* Fix libbacktrace build
* Initial Jupyter support
* Format
* Fix print
* Support run-mode backtraces
* Fix multithreaded backtrace
* Update backtraces
* Upgrade OpenMP
* Add libunwind
* Fix build
* Fix build
* Fix build
* Fix build
* Fix OpenMP & tests
* Use libbacktrace instead of libunwind
* Add debug listener
* Remove unused include
* Remove unused class
* Fix backtraces
* Update backtrace config
* Fix debug info generation
* Refactor backtraces
* Fix ASAN flag
* Fix JIT
* Fix JIT backtraces
* Fix JIT backtrace
* Fix Jupyter, fix xeus build flags
* Fix JIT output capture
* Fix Jupyter
* Fix Jupyter Python support
* Add __repr_pretty__ support
* Update JIT output capturing
* Better backtrace method names
* Support plugins in JIT mode
Co-authored-by: Ibrahim Numanagić <ibrahimpasa@gmail.com>
|
2021-11-30 11:50:28 -05:00 |
A. R. Shajii
|
132593a36e
|
Add complex type
|
2021-10-04 00:05:24 -04:00 |
A. R. Shajii
|
bf57a4bd48
|
Add "_" files back
|
2021-10-02 18:56:18 -04:00 |