aboutsummaryrefslogtreecommitdiffhomepage
AgeCommit message (Collapse)Author
2020-12-12Mark loads and stores as alignedbuiltin_fixes_try2Andrzej Janik
2020-12-12Fix bugsAndrzej Janik
2020-12-11Fix SPIR-V code generation for PTX special registers (#21)Andrzej Janik
We currently directly map PTX special registers: %ntid, %tid, etc. to SPIR-V builtins with type OpTypeVector %uint 4. This is wrong and leads to a silent corruption, which fails e.g. Depth of Field in GeekBench
2020-12-09Refactor how vectors are handled (#20)vosen
Current code has a problem with handling vector members: "b.x" in "mov.u32 a, b.x". This functionality has been kinda tacked-on and has annoying issues: * vector members support is only limited to being source of movs (so "add.u32 a.x, b.x, c.y" will not work) * the width of "b" in "b.x" is not known, which led to some "interesting" workarounds * passes can either convert all member accesses to other member accesses or to temporaries. No way to convert some member accesses to temporaries (which we need for an important fix) This commit solves all this
2020-11-29Merge pull request #15 from nilsmartel/patch-2vosen
Fix small typo
2020-11-29Merge pull request #14 from ritschwumm/patch-1vosen
fix typo in readme
2020-11-27Fix small typoNils Martel
2020-11-27fix typo in readmeritschwumm
2020-11-24Update wording, add licenseAndrzej Janik
2020-11-23Update README with links to GeekBench resultsv1Andrzej Janik
2020-11-23Append short project name to the device if there's not enough space for long ↵Andrzej Janik
name
2020-11-23Change wording slightlyAndrzej Janik
2020-11-23Add graph with Geekbench resultsAndrzej Janik
2020-11-23Add README and rebuild .spv libraryAndrzej Janik
2020-11-23Remove temporary fileAndrzej Janik
2020-11-23Rename everythingAndrzej Janik
2020-11-23Throw away useless stuffAndrzej Janik
2020-11-22Fix typo in selpAndrzej Janik
2020-11-22Add 8bit memsetAndrzej Janik
2020-11-21Fix linking with shl/shr, add memset on host and support __assertfailAndrzej Janik
2020-11-21Fix problems with linkingAndrzej Janik
2020-11-20Fix buggy handling of u8 shared memoryAndrzej Janik
2020-11-19Implement stateless-to-stateful optimizationAndrzej Janik
2020-11-14Support more property queriesAndrzej Janik
2020-11-12Add back erroneously removed functionalityAndrzej Janik
2020-11-12Refactor host code to use one big lockAndrzej Janik
2020-11-07Append project URL to device name and add few missing CUDA v1 functionsAndrzej Janik
2020-11-07Fix ftz behavior slightlyAndrzej Janik
2020-11-06Implement instructions bfe, rem, xorAndrzej Janik
2020-11-05Implement instructions clz, brev, popcAndrzej Janik
2020-11-05Fix same width float-to-float conversionsAndrzej Janik
2020-11-05Fix issues with .param/.local and implement sin, cos, ex2, lg2Andrzej Janik
2020-11-01Implement neg instructionAndrzej Janik
2020-11-01Implement div, sqrt, rsqrt and more of setpAndrzej Janik
2020-10-31Implement atomic instructionsAndrzej Janik
2020-10-26Add support for fma instructionAndrzej Janik
2020-10-26Implement selp instructionAndrzej Janik
2020-10-26Add support for and instructionAndrzej Janik
2020-10-26Support float constantsAndrzej Janik
2020-10-25Implement ftz handling through Intel extensionAndrzej Janik
2020-10-25Implement ftz handling through Khronos extensionsAndrzej Janik
2020-10-25Implement rcp instructionAndrzej Janik
2020-10-25Add test for indirect shared mem useAndrzej Janik
2020-10-25Add dynamic shared mem supportAndrzej Janik
2020-10-18Fix small regressionAndrzej Janik
2020-10-18Implement pass to handle .extern .shared and add parsing code for itAndrzej Janik
2020-10-04Add support for top-level global variables, improve array supportAndrzej Janik
2020-10-02Add sub, min, maxAndrzej Janik
2020-10-01Implement orAndrzej Janik
2020-10-01Implement shrAndrzej Janik