Age | Commit message (Collapse) | Author | |
---|---|---|---|
2020-12-12 | Mark loads and stores as alignedbuiltin_fixes_try2 | Andrzej Janik | |
2020-12-12 | Fix bugs | Andrzej Janik | |
2020-12-11 | Fix SPIR-V code generation for PTX special registers (#21) | Andrzej Janik | |
We currently directly map PTX special registers: %ntid, %tid, etc. to SPIR-V builtins with type OpTypeVector %uint 4. This is wrong and leads to a silent corruption, which fails e.g. Depth of Field in GeekBench | |||
2020-12-09 | Refactor how vectors are handled (#20) | vosen | |
Current code has a problem with handling vector members: "b.x" in "mov.u32 a, b.x". This functionality has been kinda tacked-on and has annoying issues: * vector members support is only limited to being source of movs (so "add.u32 a.x, b.x, c.y" will not work) * the width of "b" in "b.x" is not known, which led to some "interesting" workarounds * passes can either convert all member accesses to other member accesses or to temporaries. No way to convert some member accesses to temporaries (which we need for an important fix) This commit solves all this | |||
2020-11-29 | Merge pull request #15 from nilsmartel/patch-2 | vosen | |
Fix small typo | |||
2020-11-29 | Merge pull request #14 from ritschwumm/patch-1 | vosen | |
fix typo in readme | |||
2020-11-27 | Fix small typo | Nils Martel | |
2020-11-27 | fix typo in readme | ritschwumm | |
2020-11-24 | Update wording, add license | Andrzej Janik | |
2020-11-23 | Update README with links to GeekBench resultsv1 | Andrzej Janik | |
2020-11-23 | Append short project name to the device if there's not enough space for long ↵ | Andrzej Janik | |
name | |||
2020-11-23 | Change wording slightly | Andrzej Janik | |
2020-11-23 | Add graph with Geekbench results | Andrzej Janik | |
2020-11-23 | Add README and rebuild .spv library | Andrzej Janik | |
2020-11-23 | Remove temporary file | Andrzej Janik | |
2020-11-23 | Rename everything | Andrzej Janik | |
2020-11-23 | Throw away useless stuff | Andrzej Janik | |
2020-11-22 | Fix typo in selp | Andrzej Janik | |
2020-11-22 | Add 8bit memset | Andrzej Janik | |
2020-11-21 | Fix linking with shl/shr, add memset on host and support __assertfail | Andrzej Janik | |
2020-11-21 | Fix problems with linking | Andrzej Janik | |
2020-11-20 | Fix buggy handling of u8 shared memory | Andrzej Janik | |
2020-11-19 | Implement stateless-to-stateful optimization | Andrzej Janik | |
2020-11-14 | Support more property queries | Andrzej Janik | |
2020-11-12 | Add back erroneously removed functionality | Andrzej Janik | |
2020-11-12 | Refactor host code to use one big lock | Andrzej Janik | |
2020-11-07 | Append project URL to device name and add few missing CUDA v1 functions | Andrzej Janik | |
2020-11-07 | Fix ftz behavior slightly | Andrzej Janik | |
2020-11-06 | Implement instructions bfe, rem, xor | Andrzej Janik | |
2020-11-05 | Implement instructions clz, brev, popc | Andrzej Janik | |
2020-11-05 | Fix same width float-to-float conversions | Andrzej Janik | |
2020-11-05 | Fix issues with .param/.local and implement sin, cos, ex2, lg2 | Andrzej Janik | |
2020-11-01 | Implement neg instruction | Andrzej Janik | |
2020-11-01 | Implement div, sqrt, rsqrt and more of setp | Andrzej Janik | |
2020-10-31 | Implement atomic instructions | Andrzej Janik | |
2020-10-26 | Add support for fma instruction | Andrzej Janik | |
2020-10-26 | Implement selp instruction | Andrzej Janik | |
2020-10-26 | Add support for and instruction | Andrzej Janik | |
2020-10-26 | Support float constants | Andrzej Janik | |
2020-10-25 | Implement ftz handling through Intel extension | Andrzej Janik | |
2020-10-25 | Implement ftz handling through Khronos extensions | Andrzej Janik | |
2020-10-25 | Implement rcp instruction | Andrzej Janik | |
2020-10-25 | Add test for indirect shared mem use | Andrzej Janik | |
2020-10-25 | Add dynamic shared mem support | Andrzej Janik | |
2020-10-18 | Fix small regression | Andrzej Janik | |
2020-10-18 | Implement pass to handle .extern .shared and add parsing code for it | Andrzej Janik | |
2020-10-04 | Add support for top-level global variables, improve array support | Andrzej Janik | |
2020-10-02 | Add sub, min, max | Andrzej Janik | |
2020-10-01 | Implement or | Andrzej Janik | |
2020-10-01 | Implement shr | Andrzej Janik | |