Commit Graph

6 Commits (ebd344f8949858f7cbaf30956f0e57653a2b79e4)

Author SHA1 Message Date
A. R. Shajii ebd344f894
GPU and other updates (#52)
* Add nvptx pass

* Fix spaces

* Don't change name

* Add runtime support

* Add init call

* Add more runtime functions

* Add launch function

* Add intrinsics

* Fix codegen

* Run GPU pass between general opt passes

* Set data layout

* Create context

* Link libdevice

* Add function remapping

* Fix linkage

* Fix libdevice link

* Fix linking

* Fix personality

* Fix linking

* Fix linking

* Fix linking

* Add internalize pass

* Add more math conversions

* Add more re-mappings

* Fix conversions

* Fix __str__

* Add decorator attribute for any decorator

* Update kernel decorator

* Fix kernel decorator

* Fix kernel decorator

* Fix kernel decorator

* Fix kernel decorator

* Remove old decorator

* Fix pointer calc

* Fix fill-in codegen

* Fix linkage

* Add comment

* Update list conversion

* Add more conversions

* Add dict and set conversions

* Add float32 type to IR/LLVM

* Add float32

* Add float32 stdlib

* Keep required global values in PTX module

* Fix PTX module pruning

* Fix malloc

* Set will-return

* Fix name cleanup

* Fix access

* Fix name cleanup

* Fix function renaming

* Update dimension API

* Fix args

* Clean up API

* Move GPU transformations to end of opt pipeline

* Fix alloc replacements

* Fix naming

* Target PTX 4.2

* Fix global renaming

* Fix early return in static blocks; Add __realized__ function

* Format

* Add __llvm_name__ for functions

* Add vector type to IR

* SIMD support [wip]

* Update kernel naming

* Fix early returns; Fix SIMD calls

* Fix kernel naming

* Fix IR matcher

* Remove module print

* Update realloc

* Add overloads for 32-bit float math ops

* Add gpu.Pointer type for working with raw pointers

* Add float32 conversion

* Add to_gpu and from_gpu

* clang-format

* Add f32 reduction support to OpenMP

* Fix automatic GPU class conversions

* Fix conversion functions

* Fix conversions

* Rename self

* Fix tuple conversion

* Fix conversions

* Fix conversions

* Update PTX filename

* Fix filename

* Add raw function

* Add GPU docs

* Allow nested object conversions

* Add tests (WIP)

* Update SIMD

* Add staticrange and statictuple loop support

* SIMD updates

* Add new Vec constructors

* Fix UInt conversion

* Fix size-0 allocs

* Add more tests

* Add matmul test

* Rename gpu test file

* Add more tests

* Add alloc cache

* Fix object_to_gpu

* Fix frees

* Fix str conversion

* Fix set conversion

* Fix conversions

* Fix class conversion

* Fix str conversion

* Fix byte conversion

* Fix list conversion

* Fix pointer conversions

* Fix conversions

* Fix conversions

* Update tests

* Fix conversions

* Fix tuple conversion

* Fix tuple conversion

* Fix auto conversions

* Fix conversion

* Fix magics

* Update tests

* Support GPU in JIT mode

* Fix GPU+JIT

* Fix kernel filename in JIT mode

* Add __static_print__; Add earlyDefines; Various domination bugfixes; SimplifyContext RAII base handling

* Fix global static handling

* Fix float32 tests

* FIx gpu module

* Support OpenMP "collapse" option

* Add more collapse tests

* Capture generics and statics

* TraitVar handling

* Python exceptions / isinstance [wip; no_ci]

* clang-format

* Add list comparison operators

* Support empty raise in IR

* Add dict 'or' operator

* Fix repr

* Add copy module

* Fix spacing

* Use sm_30

* Python exceptions

* TypeTrait support; Fix defaultDict

* Fix earlyDefines

* Add defaultdict

* clang-format

* Fix invalid canonicalizations

* Fix empty raise

* Fix copyright

* Add Python numerics option

* Support py-numerics in math module

* Update docs

* Add static Python division / modulus

* Add static py numerics tests

* Fix staticrange/tuple; Add KwTuple.__getitem__

* clang-format

* Add gpu parameter to par

* Fix globals

* Don't init loop vars on loop collapse

* Add par-gpu tests

* Update gpu docs

* Fix isinstance check

* Remove invalid test

* Add -libdevice to set custom path [skip ci]

* Add release notes; bump version [skip ci]

* Add libdevice docs [skip ci]

Co-authored-by: Ibrahim Numanagić <ibrahimpasa@gmail.com>
2022-09-15 15:40:00 -04:00
Ibrahim Numanagić cb945f569c
Typechecker refactoring (#20)
* Initial refactor commit

* Support external vars

* Simplify refactor; Python scoping [wip]

* Python scoping [wip]

* Python scoping [fix loops; wip]

* Fix lambdas

* Python scoping [test fixes; wip]

* Fix scoping [wip]

* Fix basic tests [no-ci]

* Fix tests

* CallExpr refactoring [wip]

* CallExpr refactoring [wip]

* Remove activeUnbounds tracking

* Add core.codon

* Move Function and other core types to core.codon; Revamp Function and Callable types

* Refactor IntExpr, FloatExpr and CallExpr

* Refactor ClassStmt

* Refactor context, IdExpr and DotExpr

* Refactor DotExpr and AssignStmt

* Refactor ImportStmt

* Refactor FunctionStmt

* Refactor

* Remove UpdateStmt

* Refactor AssignReplacementVisitor

* Make SimplifyVisitor in-place

* Fix new scoping

* Fix import type alias handling

* Add docstrings; Complete Simplify refactoring

* Fixes for seqtest

* Refactor typecheck [wip]

* Refactor typecheck [wip]

* Refactor typecheck/access; Remove void anduse NoneType; Fix #18

* Refactor typecheck/assign

* clang-format and cmake-format

* Fix none types in IR

* Multi-error support in simplify

* Fix IR tests for new void

* Simplify ClassStmt

* Refactor cond.cpp

* Refactor error.cpp

* Refactor function.cpp and simplify unbounds

* Refactor op.cpp

* Refactor call.cpp [wip] [no-ci]

* seqassertn updates [noci]

* Refactor call.cpp

* Refactor call.cpp

* Refactor call.cpp

* Refactor typecheck

* clang-tidy updates [noci]

* Refactor infer.cpp [wip]

* Refactor infer.cpp

* Refactor wrapExpr

* Remove visitedAsts

* Remove old base logic

* Refactor typecheck ctx

* Fix JIT bug

* Fix JIT tests

* Scoping fixes [wip] [noci]

* Fix ImperativeForFlow var store

* Add newlines [noci]

* Dump IR module with log flag

* Fix scoping bugs; Add &, ^ and | static operations; Address stylistic review issues

* Fix side effect analysis for for-loops

* Add support for class variables and ClassVar

* Refactor special dot-member cases

* Add codon app tests

* Fix class variables; clang-tidy

* Fix __argv__

* Add datetime constants and update tests

* Fix #25; Add Py_None, Py_True and Py_False; External var support [wip]

* External var support [wip]

* Dump LLVM IR when debug flags are active

* clang-format

* Fix arg var construction

* Extern var fixes

* Undo extern var changes related to stdout etc.

* Fix tuple magics

* Fix extern vars and tuple magics

* Fix duplicate var name error

* Fix extern vars

* Fix #16

* Fix side-effect analysis for try-catch

* Move test C var to test executable

* Add staticmethod

* Fix var status for try-catch

* Fix tests

* Fix shell var name

* Fix test

* Fix app test

* Fix scoping issue (remove dominated identifier from stack)

* Fix no-pie issue

* Use PIC when building library object

* Don't use -no-pie when building library [noci]

* Use -relocation-model=pic in test

* Fix lib build on Linux

* Fix lib build

* Update exceptions to use subclasses vs. header

* Fix __repr__

* Fix tests

* Fix exceptions test

* Don't build docs

Co-authored-by: A. R. Shajii <ars@ars.me>
2022-07-26 16:06:00 -04:00
Ishak Numanagić f1658ec62e stdlib/internal/box.codon 2022-02-28 20:57:12 +01:00
A. R. Shajii 166e1ad455
JIT (#6)
* Add JIT engine

* More engine updates

* Fix takeModule()

* Parser JIT support [wip]

* Fix class definition

* Parser JIT support [wip]

* Parser JIT support [wip]

* Fix LLVM context conflict

* Parser JIT support [wip]

* Fix JIT engine

* Parser JIT support [wip]

* Doc fix

* JIT fix

* Fix JIT exceptions

* Refactor

* Add JIT status codes

* Add compiler class

* Better logging

* Better logging

* Update .gitignore

* Add Jupyter Xeus support

* Update JIT

* Remove print

* Update errors

* Fix assert

* Fix asserts

* Update docgen signature

* Update file system interface

* Update plugin errors

* Use numeric_limits

* JIT refactor [wip]

* Set JIT flag on LLVMVisitor

* Update module opt

* JIT imports

* First Jupyter integration

* Update JIT API to return outputs as string

* Capture runtime exception output

* Fix libbacktrace build

* Initial Jupyter support

* Format

* Fix print

* Support run-mode backtraces

* Fix multithreaded backtrace

* Update backtraces

* Upgrade OpenMP

* Add libunwind

* Fix build

* Fix build

* Fix build

* Fix build

* Fix OpenMP & tests

* Use libbacktrace instead of libunwind

* Add debug listener

* Remove unused include

* Remove unused class

* Fix backtraces

* Update backtrace config

* Fix debug info generation

* Refactor backtraces

* Fix ASAN flag

* Fix JIT

* Fix JIT backtraces

* Fix JIT backtrace

* Fix Jupyter, fix xeus build flags

* Fix JIT output capture

* Fix Jupyter

* Fix Jupyter Python support

* Add __repr_pretty__ support

* Update JIT output capturing

* Better backtrace method names

* Support plugins in JIT mode

Co-authored-by: Ibrahim Numanagić <ibrahimpasa@gmail.com>
2021-11-30 11:50:28 -05:00
A. R. Shajii 132593a36e Add complex type 2021-10-04 00:05:24 -04:00
A. R. Shajii bf57a4bd48 Add "_" files back 2021-10-02 18:56:18 -04:00