- Jan 24, 2020
-
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
- Nov 08, 2019
-
-
ReinUsesLisp authored
-
- Oct 26, 2019
-
-
Fernando Sahmkow authored
-
- Oct 25, 2019
-
-
ReinUsesLisp authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
- Oct 04, 2019
-
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
- Aug 21, 2019
-
-
ReinUsesLisp authored
Implement VOTE using Nvidia's intrinsics. Documentation about these can be found here https://developer.nvidia.com/reading-between-threads-shader-intrinsics Instead of using portable ARB instructions I opted to use Nvidia intrinsics because these are the closest we have to how Tegra X1 hardware renders. To stub VOTE on non-Nvidia drivers (including nouveau) this commit simulates a GPU with a warp size of one, returning what is meaningful for the instruction being emulated: * anyThreadNV(value) -> value * allThreadsNV(value) -> value * allThreadsEqualNV(value) -> true ballotARB, also known as "uint64_t(activeThreadsNV())", emits VOTE.ANY Rd, PT, PT; on nouveau's compiler. This doesn't match exactly to Nvidia's code VOTE.ALL Rd, PT, PT; Which is emulated with activeThreadsNV() by this commit. In theory this shouldn't really matter since .ANY, .ALL and .EQ affect the predicates (set to PT on those cases) and not the registers.
-
- Jul 19, 2019
-
-
Lioncash authored
Makes it explicit that the conversions here are intentional.
-
- Jul 16, 2019
-
-
Lioncash authored
These are std::shared_ptr instances underneath the hood, which means copying them isn't as cheap as a regular pointer. Particularly so on weakly-ordered systems. This avoids atomic reference count increments and decrements where they aren't necessary for the core set of operations.
-
- Jul 09, 2019
-
-
Fernando Sahmkow authored
Also shows Nvidia's address space on comments.
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
Fernando Sahmkow authored
-
- Jun 21, 2019
-
-
ReinUsesLisp authored
-
- Jun 05, 2019
-
-
ReinUsesLisp authored
Instead of having a vector of unique_ptr stored in a vector and returning star pointers to this, use shared_ptr. While changing initialization code, move it to a separate file when possible. This is a first step to allow code analysis and node generation beyond the ShaderIR class.
-
- Apr 18, 2019
-
-
ReinUsesLisp authored
-
- Feb 26, 2019
-
-
ReinUsesLisp authored
-
- Feb 03, 2019
-
-
ReinUsesLisp authored
It's not always used as a basic block. Rename it for consistency.
-
ReinUsesLisp authored
Some games call LDG at the top of a basic block, making the tracking heuristic to fail. This commit lets the heuristic the decoded nodes as a whole instead of per basic blocks. This may lead to some false positives but allows it the heuristic to track cases it previously couldn't.
-
- Jan 28, 2019
-
-
Lioncash authored
std::moveing a local variable in a return statement has the potential to prevent copy elision from occurring, so this can just be converted into a regular return.
-
- Jan 15, 2019
-
-
ReinUsesLisp authored
-
ReinUsesLisp authored
-
ReinUsesLisp authored
-
ReinUsesLisp authored
-
ReinUsesLisp authored
-