index
:
ZLUDA
better_build
better_inject
blender_42
bpermute
build_fix
build_fixes
build_fixes2
builtin_fixes_try2
carry_fix
cgbn_readme
codegen_variables
compat
cuda-12-4
detours
dump_replay
error_report
feature_matching
fix_build
fix_builtin_wip
geekbench
geekbnch
gh-pages
improve_ci
improve_ci2
kernel_dump
linux_linker_hack
llama2
llvm
malloc_private
master
meshroom
misc_fixes
new_dev
ockl_fix
oidn
older_cuda
parser_rewrite
parser_rewrite_try1
refactor_vectors
regen_tests
remove_sema
repass
repass2
sad_inst
stateful_try1
stateful_try2
sust_suld
troubleshooting_docs
update-readme
update_docs
wave32_report_fix
win_fix
CUDA on AMD GPUs
vosen
about
summary
refs
log
tree
commit
diff
homepage
log msg
author
committer
range
Age
Commit message (
Collapse
)
Author
2020-11-23
Support -nolocalra hack
geekbench
Andrzej Janik
2020-11-23
Append short project name to the device if there's not enough space for long ↵
Andrzej Janik
name
2020-11-23
Change wording slightly
Andrzej Janik
2020-11-23
Add graph with Geekbench results
Andrzej Janik
2020-11-23
Add README and rebuild .spv library
Andrzej Janik
2020-11-23
Remove temporary file
Andrzej Janik
2020-11-23
Rename everything
Andrzej Janik
2020-11-23
Throw away useless stuff
Andrzej Janik
2020-11-22
Fix typo in selp
Andrzej Janik
2020-11-22
Add 8bit memset
Andrzej Janik
2020-11-21
Fix linking with shl/shr, add memset on host and support __assertfail
Andrzej Janik
2020-11-21
Fix problems with linking
Andrzej Janik
2020-11-20
Fix buggy handling of u8 shared memory
Andrzej Janik
2020-11-19
Implement stateless-to-stateful optimization
Andrzej Janik
2020-11-14
Support more property queries
Andrzej Janik
2020-11-12
Add back erroneously removed functionality
Andrzej Janik
2020-11-12
Refactor host code to use one big lock
Andrzej Janik
2020-11-07
Append project URL to device name and add few missing CUDA v1 functions
Andrzej Janik
2020-11-07
Fix ftz behavior slightly
Andrzej Janik
2020-11-06
Implement instructions bfe, rem, xor
Andrzej Janik
2020-11-05
Implement instructions clz, brev, popc
Andrzej Janik
2020-11-05
Fix same width float-to-float conversions
Andrzej Janik
2020-11-05
Fix issues with .param/.local and implement sin, cos, ex2, lg2
Andrzej Janik
2020-11-01
Implement neg instruction
Andrzej Janik
2020-11-01
Implement div, sqrt, rsqrt and more of setp
Andrzej Janik
2020-10-31
Implement atomic instructions
Andrzej Janik
2020-10-26
Add support for fma instruction
Andrzej Janik
2020-10-26
Implement selp instruction
Andrzej Janik
2020-10-26
Add support for and instruction
Andrzej Janik
2020-10-26
Support float constants
Andrzej Janik
2020-10-25
Implement ftz handling through Intel extension
Andrzej Janik
2020-10-25
Implement ftz handling through Khronos extensions
Andrzej Janik
2020-10-25
Implement rcp instruction
Andrzej Janik
2020-10-25
Add test for indirect shared mem use
Andrzej Janik
2020-10-25
Add dynamic shared mem support
Andrzej Janik
2020-10-18
Fix small regression
Andrzej Janik
2020-10-18
Implement pass to handle .extern .shared and add parsing code for it
Andrzej Janik
2020-10-04
Add support for top-level global variables, improve array support
Andrzej Janik
2020-10-02
Add sub, min, max
Andrzej Janik
2020-10-01
Implement or
Andrzej Janik
2020-10-01
Implement shr
Andrzej Janik
2020-10-01
Fix remaining bugs in vector destructuring and in the process improve ↵
Andrzej Janik
implicit conversions
2020-09-30
Implement vector-destructuring mov/ld/st
Andrzej Janik
2020-09-27
Add test for vector extract
Andrzej Janik
2020-09-27
Add missing support for Milestone 1
Andrzej Janik
2020-09-24
Fix minor problems with a private CUDA function
Andrzej Janik
2020-09-24
Implement more host code, moving execution further
Andrzej Janik
2020-09-23
Add a workaround for IGC bug
Andrzej Janik
2020-09-20
Fix more bugs in implicit conversions
Andrzej Janik
2020-09-20
Better propagate information about address-mov and emit conversion for it
Andrzej Janik
[next]