Commits · 36f2e5cc0cf11b5f23c1d985f4eb65daaddedaaf · Recolic / azure-cloud-mining-script

Feb 11, 2019

OpenCL: fix the fix for the blockchain driver · ecbf8828
psychocrypt authored 6 years ago
```
 #2229 was not solving the issues

- revert #2229
- introduce the working fix
```
ecbf8828

OpenCl: fix blockchain driver support · 3f1cea0f

psychocrypt authored 6 years ago

The OpenCl version of the blockchain driver is not understanding if
apointer to a pointer points into shared memory and throw an error
during the compilation.

- revert the usage of the struct to group all shared memory arrays

3f1cea0f

Feb 10, 2019

OpenCl: optimize cn_gpu · 1ca66190
psychocrypt authored 6 years ago
```
Add seperate kernel to prepare the scratchpad memory.
```
1ca66190

CUDA: use shared mem object · b361b395

psychocrypt authored 6 years ago

Combine the shared memory for a hash within one struct.
Reduce the shared memory footprint per hash by 64 byte.

b361b395

better variable nameining · 0c26cb7e

psychocrypt authored 6 years ago

- rename variable names like `b` and `bb` to something with a little bit
of meaning.

0c26cb7e

Feb 09, 2019
- OpenCL: optimize cn_gpu · c60387b3
  psychocrypt authored 6 years ago
```
Optimize cn_gpu
```
  c60387b3
- fix whitespace indention issues · 8461e847
  psychocrypt authored 6 years ago
  
  8461e847
- fix groestl, skein and blake · 8b1a0176
  psychocrypt authored 6 years ago
```
based on the suggestion from @xmrig https://github.com/xmrig/xmrig-amd/commit/db4e169f3a78f273abf89ea8cf5bba7eccf1490b
```
  8b1a0176
Feb 07, 2019

remove cn_turtle as native POW · 1033dc28

psychocrypt authored 6 years ago

cryptonight_turtle is only cryptonight_v8 with a different scratchpad,
iteration and mask value.
We are using now the new machanism to describe such derived POWs.

1033dc28

refactor POW definition · 3426e185

psychocrypt authored 6 years ago

A POW is now defined by a function `f` and three degrees of freedom `f(iteration, scratchpad, mask)`.
`f` is the base algorithm like `cryptonight, cryptonight_gpu`
An easy to pars snytax to write the full POW definition down is: `cryptonight_gpu:0x0000c000:0x00200000:0x001fffc0`

This change make it very easy to integrate the new trend of variate the
number of iteations or the scratchpad size without modifying the full
code.

3426e185

OpenCL: fix groestl · d322ee4f

psychocrypt authored 6 years ago

@xmrig provided the information that the driver 19.2.1 for vega also
create invalid results if pragma unroll is used for the groestl algo.

d322ee4f

Feb 06, 2019

OpenCl: use user defined unroll in cn_gpu · 7008cbe1

psychocrypt authored 6 years ago

- use the user defined unroll
- auto suggestion:
  - only tune for cn_gpu if this is the main user
currency (after a fork)
  - set unroll to 1 for cn_gpu

7008cbe1

OpenCL: fix invalid work group size · ff92f4f2

psychocrypt authored 6 years ago

OpenCl kernel using a larger work group size than configured by the
user to increase the occupancy. Depending on the algorithm and device
the size is limited.

This PR fixes that the user was able to select a invalid work group size.

ff92f4f2

Feb 04, 2019

OpencL: fix cn_gpu · f14528ba

psychocrypt authored 6 years ago

If comp_mode is used the code will not compile.

- fix compile issue
- fix wrong conditions to handle `comp_mode`

f14528ba

Feb 02, 2019
- OpenCL: fix Blake hashing · e274dbcc
  psychocrypt authored 6 years ago
```
Windows driver creates wrong code if unroll is used.
```
  e274dbcc
Feb 01, 2019
- OpenCL: fix work size message · 4524b875
  psychocrypt authored 6 years ago
```
Fix message with the maximal allowed worksize if cryptonight_gpu is
used.
```
  4524b875
- OpenCL: use algorithm names instead of number · 88ea7f36
  psychocrypt authored 6 years ago
```
Use the algorithm names from `cryptonight.hpp` instead if number within the OpenCL kernel.
```
  88ea7f36
Jan 30, 2019

fix compile · 17f3aef0
psychocrypt authored 6 years ago
```
- fix broken trutle coin
- fix non cn_gpu algorithms
```
17f3aef0

Implement CN-GPU Proof-of-Work Algo · 346933d1

fireice-uk authored 6 years ago


Co-authored-by: psychocrypt <psychocryptHPC@gmail.com>
Co-authored-by: fireice-uk <fireice-uk@users.noreply.github.com>

346933d1

Jan 25, 2019
- Add CryptoNight Turtle Support. Special thanks to @DaveLong for his hard work in getting this done. · 749751e3
  Brandon Lehmann authored 6 years ago
  
  Unverified
  
  749751e3
Dec 29, 2018

OpenCl: avoid multiple map lookups · 0643f601

psychocrypt authored 6 years ago

Avoid that we do multiple lookups to `std::map` to find the OpenCL
kernel binaries.

0643f601

improve POW algorithm selection · 758dbfb1

psychocrypt authored 6 years ago

- add helper method `GetAllAlgorithms()` to get all active POW
algorithms
- select max scratchpad memory size based on the dev pool and user
algorithms

758dbfb1

OpenCL: allow more than two algorithms · a39ee088

psychocrypt authored 6 years ago

In the current implementation the POW algorithm in dev pool section of a
currency will not be taken into account during the binary creation.
This PR changes the behavior and allow to create binaries for more than two POW algorihms.

a39ee088

Dec 06, 2018

fix bittube2 · e01eebc2

psychocrypt authored 6 years ago

Since #2080 bittube2 is broken.

- reintroduce special AES function for bittube2

e01eebc2

Dec 04, 2018
- Grammar fix · a7bdd603
  MarosM authored 6 years ago
  
  Unverified
  
  a7bdd603
Dec 03, 2018

fix default interleave value · 05b4976d

psychocrypt authored 6 years ago

The default value for interleave was wrongly set to 50.

Remove the value and take the devault from the default constructor instead of side channeling it from the json parser.

05b4976d

OpenCL: enable cn_v8 optimization for NVIDIA · ab19d370

psychocrypt authored 6 years ago

NVIDIA is using clang as device compiler so the reciprocal optimizations was disabled with #2104.

- re-enable optimized reciprocal calculation

ab19d370

Dec 02, 2018

OpenCL: auto tuning option · af87b408

psychocrypt authored 6 years ago

Add an option to brute force intensity settings and lock in at the intensity with the highest hashrate.

- update decumentation of the `interleave` option to mention the side effect with `auto-tune`
- disable `interleave` auto adjustment if `auto-tune` is enabled
- jconf: add `auto-tune` as optional option

af87b408

OpenCl: fix NVIDIA · 1b27f0f3

psychocrypt authored 6 years ago

- fix broken compile: change used `ULL` to `UL` because `UL` is defined as 64bit
- fix memory copy to shared memory via vload8 (somehow it create wrong access)

1b27f0f3

OpenCL: auto config two threads per GPU · e46226fa

psychocrypt authored 6 years ago

The auto config generates for AMD devices now by default two threads per GPU.

- remove the savety 128MiB memory now only from the max available GPU memory not from the avaialble memory for one alloc call
- extend the memory documentation in amd.txt

e46226fa

fix clamp implementation · b606304b
psychocrypt authored 6 years ago
```
Due to a wrong implementation clamp was not working.
```
b606304b

Nov 30, 2018

OpenCL: opimize reciprocal calculation · bc91088a

psychocrypt authored 6 years ago


use for non clang (Rocm) OpenCL a optimized reciprocal calculation without lookup table.

Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>

bc91088a

OpenCL: comp mode optimization · 307dda83

psychocrypt authored 6 years ago

Disable compatibility mode if intensity is a multiple of worksize. In that case enabled compaibility mode will only slow down the miner.

307dda83

Nov 29, 2018
- Added Cryptonight-Superfast · 053190bb
  LPHuynh authored 6 years ago
  
  053190bb
Nov 27, 2018

OpenCL: thread interleaving · d8316f7d

psychocrypt authored 6 years ago

If two threads are using the same GPU device the start time of each hash round is optimized based on the average time needed to calculate a bunch of hashes.

This way to optimize the hash rate was first introduced by @SChernykh. This implementation based on the implementation in xmrig but differen in the details.

- introduce a new config option `interleave`
- implement thread interleaving

d8316f7d

Nov 21, 2018

OpenCl: optimize strided index 1 · 39fa7c62

psychocrypt authored 6 years ago


Use `mul24` to speedup the scratchpad index calculation.

Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>

39fa7c62

OpenCL: add strided_index 3 · 3c9442ce

psychocrypt authored 6 years ago


Add new striding index where the memory is chunked by the size of the work group (worksize).

Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>

3c9442ce

OpenCL: cn1 optimization · 33e5825c
psychocrypt authored 6 years ago
```
small optimization for non cryptonight_v8 algorithms
```
33e5825c

Nov 20, 2018
- OpenCl: optimize cn-v8 div · bff5b000
  SChernykh authored 6 years ago
```
- optimize division
```
  bff5b000
- OpenCL: optimize cn-heavy div · 9813e1c0
  SChernykh authored 6 years ago
```
optimize cryptonight_heavy diff
```
  9813e1c0