Commits · 5c280e6ff04ae36e8cd7ba81cce4ae89e0a49b80 · many-archive / Suyu

Apr 14, 2019
- shader_ir: Implement STG, keep track of global memory usage and flush · 5c280e6f
  ReinUsesLisp authored Feb 07, 2019
  
  5c280e6f
- Merge pull request #2378 from lioncash/ro · 1f4dfb39
  bunnei authored Apr 13, 2019
```
ldr: Minor amendments to IPC-related parameters
```
  1f4dfb39
- Merge pull request #2373 from FernandoS27/z32 · c9454c84
  bunnei authored Apr 13, 2019
```
Set Pixel Format to Z32 if its R32F and depth compare enabled, and Implement format ZF32_X24S8
```
  c9454c84
- Merge pull request #2357 from zarroboogs/force-30fps-mode · 6088898b
  bunnei authored Apr 13, 2019
```
Add a toggle to force 30FPS mode
```
  6088898b
- Merge pull request #2381 from lioncash/fs · a788c861
  bunnei authored Apr 13, 2019
```
fsp_srv: Minor cleanup related changes
```
  a788c861
- Merge pull request #2386 from ReinUsesLisp/shader-manager · ee2206a1
  bunnei authored Apr 13, 2019
```
gl_shader_manager: Move code to source file and minor clean up
```
  ee2206a1
- Merge pull request #2017 from jroweboy/glwidget · 065f83c6
  bunnei authored Apr 13, 2019
```
Frontend: Migrate to QOpenGLWindow and support shared contexts
```
  065f83c6
- Merge pull request #2389 from FreddyFunk/rename-gamedir · ee3f5764
  bunnei authored Apr 13, 2019
```
ui_settings: Rename game directory variables
```
  ee3f5764
Apr 13, 2019
- Merge pull request #2391 from lioncash/scope · b42595fa
  bunnei authored Apr 12, 2019
```
common/scope_exit: Replace std::move with std::forward in ScopeExit()
```
  b42595fa
- Merge pull request #2392 from lioncash/swap · 0faf7b17
  bunnei authored Apr 12, 2019
```
common/swap: Minor cleanup and improvements to byte swapping functions
```
  0faf7b17
Apr 12, 2019

Fix Clang Format · 382722b9
FreddyFunk authored Apr 12, 2019

382722b9

common/swap: Improve codegen of the default swap fallbacks · 0d8ef2d3

Lioncash authored Apr 11, 2019

Uses arithmetic that can be identified more trivially by compilers for
optimizations. e.g. Rather than shifting the halves of the value and
then swapping and combining them, we can swap them in place.

e.g. for the original swap32 code on x86-64, clang 8.0 would generate:

    mov     ecx, edi
    rol     cx, 8
    shl     ecx, 16
    shr     edi, 16
    rol     di, 8
    movzx   eax, di
    or      eax, ecx
    ret

while GCC 8.3 would generate the ideal:

    mov     eax, edi
    bswap   eax
    ret

now both generate the same optimal output.

MSVC used to generate the following with the old code:

    mov     eax, ecx
    rol     cx, 8
    shr     eax, 16
    rol     ax, 8
    movzx   ecx, cx
    movzx   eax, ax
    shl     ecx, 16
    or      eax, ecx
    ret     0

Now MSVC also generates a similar, but equally optimal result as clang/GCC:

    bswap   ecx
    mov     eax, ecx
    ret     0

====

In the swap64 case, for the original code, clang 8.0 would generate:

    mov     eax, edi
    bswap   eax
    shl     rax, 32
    shr     rdi, 32
    bswap   edi
    or      rax, rdi
    ret

(almost there, but still missing the mark)

while, again, GCC 8.3 would generate the more ideal:

    mov     rax, rdi
    bswap   rax
    ret

now clang also generates the optimal sequence for this fallback as well.

This is a case where MSVC unfortunately falls short, despite the new
code, this one still generates a doozy of an output.

    mov     r8, rcx
    mov     r9, rcx
    mov     rax, 71776119061217280
    mov     rdx, r8
    and     r9, rax
    and     edx, 65280
    mov     rax, rcx
    shr     rax, 16
    or      r9, rax
    mov     rax, rcx
    shr     r9, 16
    mov     rcx, 280375465082880
    and     rax, rcx
    mov     rcx, 1095216660480
    or      r9, rax
    mov     rax, r8
    and     rax, rcx
    shr     r9, 16
    or      r9, rax
    mov     rcx, r8
    mov     rax, r8
    shr     r9, 8
    shl     rax, 16
    and     ecx, 16711680
    or      rdx, rax
    mov     eax, -16777216
    and     rax, r8
    shl     rdx, 16
    or      rdx, rcx
    shl     rdx, 16
    or      rax, rdx
    shl     rax, 8
    or      rax, r9
    ret     0

which is pretty unfortunate.

0d8ef2d3

Merge pull request #2235 from ReinUsesLisp/spirv-decompiler · ea80e2bc
bunnei authored Apr 11, 2019
```
vk_shader_decompiler: Implement a SPIR-V decompiler
```
ea80e2bc
Merge pull request #2360 from lioncash/svc-global · 83a2fb3c
bunnei authored Apr 11, 2019
```
kernel/svc: Deglobalize the supervisor call handlers
```
83a2fb3c
Merge pull request #2388 from lioncash/constexpr · e2f2155d
bunnei authored Apr 11, 2019
```
kernel: Make handle type declarations constexpr
```
e2f2155d
Merge pull request #2387 from FernandoS27/fast-copy-relax · c0b2b702
bunnei authored Apr 11, 2019
```
gl_rasterizer_cache: Relax restrictions on FastCopySurface
```
c0b2b702

common/swap: Mark byte swapping free functions with [[nodiscard]] and noexcept · 66b73fd3

Lioncash authored Apr 11, 2019

Allows the compiler to inform when the result of a swap function is
being ignored (which is 100% a bug in all usage scenarios). We also mark
them noexcept to allow other functions using them to be able to be
marked as noexcept and play nicely with things that potentially inspect
"nothrowability".

66b73fd3

common/swap: Simplify swap function ifdefs · 9cb4b7be

Lioncash authored Apr 11, 2019

Including every OS' own built-in byte swapping functions is kind of
undesirable, since it adds yet another build path to ensure compilation
succeeds on.

Given we only support clang, GCC, and MSVC for the time being, we can
utilize their built-in functions directly instead of going through the
OS's API functions.

This shrinks the overall code down to just

if (msvc)
  use msvc's functions
else if (clang or gcc)
  use clang/gcc's builtins
else
  use the slow path

9cb4b7be

common/swap: Remove 32-bit ARM path · 59895443

Lioncash authored Apr 11, 2019

We don't plan to support host 32-bit ARM execution environments, so this
is essentially dead code.

59895443

common/scope_exit: Replace std::move with std::forward in ScopeExit() · b5696410

Lioncash authored Apr 11, 2019

The template type here is actually a forwarding reference, not an rvalue
reference in this case, so it's more appropriate to use std::forward to
preserve the value category of the type being moved.

b5696410

Apr 11, 2019
- kernel: Make handle type declarations constexpr · 6300ccbc
  Lioncash authored Apr 11, 2019
```
Some objects declare their handle type as const, while others declare it
as constexpr. This makes the const ones constexpr for consistency, and
prevent unexpected compilation errors if these happen to be attempted to be
used within a constexpr context.
```
  6300ccbc
- ui_settings: Rename game directory variables · dffa1a87
  FreddyFunk authored Apr 11, 2019
  
  dffa1a87
- gl_rasterizer_cache: Relax restrictions on FastCopySurface and FastLayeredCopySurface · c9305959
  Fernando Sahmkow authored Apr 11, 2019
  
  c9305959
- Merge pull request #2278 from ReinUsesLisp/vc-texture-cache · 6951741a
  bunnei authored Apr 10, 2019
```
video_core: Implement API agnostic view based texture cache
```
  6951741a
- Merge pull request #2372 from FernandoS27/fermi-fix · 0371650b
  bunnei authored Apr 10, 2019
```
Correct Fermi Copy on Linear Textures.
```
  0371650b
Apr 10, 2019
- gl_shader_manager: Move code to source file and minor clean up · 93af6636
  ReinUsesLisp authored Apr 10, 2019
  
  93af6636
- ldr: Mark IsValidNROHash() as a const member function · dae24498
  Lioncash authored Apr 10, 2019
```
This doesn't modify instance state, so it can be made const.
```
  dae24498
- ldr: Amend parameters for LoadNro/UnloadNro LoadNrr/UnloadNrr · 0032cf38
  Lioncash authored Apr 10, 2019
```
The initial two words indicate a process ID. Also UnloadNro only
specifies one address, not two.
```
  0032cf38
- vk_shader_decompiler: Implement flow primitives · 75d23a36
  ReinUsesLisp authored Mar 14, 2019
  
  75d23a36
- vk_shader_decompiler: Implement most common texture primitives · 58ad8dfa
  ReinUsesLisp authored Mar 14, 2019
  
  58ad8dfa
- vk_shader_decompiler: Implement texture decompilation helper functions · 4667ed8e
  ReinUsesLisp authored Mar 14, 2019
  
  4667ed8e
- vk_shader_decompiler: Implement Assign and LogicalAssign · 676172e2
  ReinUsesLisp authored Mar 14, 2019
  
  676172e2
- vk_shader_decompiler: Implement non-OperationCode visits · d316d248
  ReinUsesLisp authored Mar 14, 2019
  
  d316d248
- vk_shader_decompiler: Implement OperationCode decompilation interface · b758c861
  ReinUsesLisp authored Mar 14, 2019
  
  b758c861
- vk_shader_decompiler: Implement Visit · fec4eb97
  ReinUsesLisp authored Mar 14, 2019
  
  fec4eb97
- vk_shader_decompiler: Implement labels tree and flow · ca51f998
  ReinUsesLisp authored Mar 14, 2019
  
  ca51f998
- vk_shader_decompiler: Implement declarations · 13aa664f
  ReinUsesLisp authored Mar 14, 2019
  
  13aa664f
- vk_shader_decompiler: Declare and stub interface for a SPIR-V decompiler · ad53b233
  ReinUsesLisp authored Mar 14, 2019
  
  ad53b233
- video_core: Add sirit as optional dependency with Vulkan · 970d9e57
  ReinUsesLisp authored Mar 14, 2019
```
sirit is a runtime assembler for SPIR-V
```
  970d9e57
- fsp_srv: Remove unnecessary parameter popping in IDirectory's Read() · 86768320
  Lioncash authored Apr 10, 2019
```
IDirectory's Read() function doesn't take any input parameters. It only
uses the output parameters that we already provide.
```
  86768320