Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 12 Jan 2017 23:48:10 -0800
From:      <soralx@cydem.org>
To:        <freebsd-virtualization@freebsd.org>
Subject:   Re: Issues with GTX960 on CentOS7 using bhyve PCI passthru (FreeBSD 11-RC2)
Message-ID:  <20170112234810.2cb83671@mscad14>
In-Reply-To: <201701130144.v0D1ifxJ051207@pdx.rh.CN85.dnsmgr.net>
References:  <20170111213941.0789c8ce@mscad14> <201701130144.v0D1ifxJ051207@pdx.rh.CN85.dnsmgr.net>

next in thread | previous in thread | raw e-mail | index | archive | help

Good news, everyone!

 I tried an AMD card, and it is almost working. I have a lot of logs
 and info, but I will try to restrain the length of this message.

 There was no need to do anything special to get the card to work,
 other than figuring out how to deal with Linux, setting up drivers
 and OpenCL SDK & linking libraries to the right places, compiling
 soft, etc.

 First, PCI info [0] and some dmesg bits [1]. AMD drivers:
  amdgpu-pro-16.50-362463.tar.xz
  AMD-APP-SDKInstaller-v3.0.130.136-GA-linux64.tar.bz2

 Next, the good news: Xorg starts, and display works!.. Kind of:
  -- `glxgears` window is flickery, has parts of gears missing,
     and does not look good in general;
  -- xterm window has the rectangular cursor shapes plastered
     all over, in random places;
  -- full-screen (1600x1200) `glxgears` is slower than expected,
     and the performance varies suddenly [2];
  -- VDPAU works, but I suspect it's not using the GPU [3][4];
     I haven't figure a way how to force the use of GPU. Also,
     the main window with text looks OK most of the time (when
     doing the video test and in the end, in particular), but
     show a smaller black rectangle in top left corner of the
     screen instead of the video samples;
  -- it almost feels like the DMA and framebuffers aren't always
     correctly configured, but still are transferring data [from
     somewhere to somewhere sometimes].
  
 I'm getting lots of messages like [5][6], among others, in
 various cases.

 Of some 3 OpenCL applications I tested, one appeared to complete
 successfully [7]. Running it also produces messages as in [6].
 But the numbers make sense, comparing to e.g. tests of R9 Nano
 (~/mixbench/results/OpenCL/alt_R9-Nano_d1912.5.log) and expectations
 of the GPU chip. This is exciting! Dunno if the benchmark check
 whether the computations are correct, though.

 `clinfo` result [8] is a bit mixed... e.g., "Max clock frequency:
 555Mhz" is wrong.

 Some more tests [9].
 

[0]
00:04.0 0300: 1002:67df (rev c7) (prog-if 00 [VGA controller])
        Subsystem: 1682:9480
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 29
        Region 0: Memory at 3400000000 (64-bit, prefetchable) [size=256M]
        Region 2: Memory at c0200000 (64-bit, prefetchable) [size=2M]
        Region 4: I/O ports at 2100 [size=256]
        Region 5: Memory at c0400000 (32-bit, non-prefetchable) [size=256K]
        Expansion ROM at c0020000 [disabled] [size=128K]
        Capabilities: [48] Vendor Specific Information: Len=08 <?>
        Capabilities: [50] Power Management version 3
                Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1+,D2+,D3hot+,D3cold+)
                Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [58] Express (v2) Legacy Endpoint, MSI 00
                DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <4us, L1 unlimited
                        ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
                DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
                        RlxdOrd- ExtTag+ PhantFunc- AuxPwr- NoSnoop+
                        MaxPayload 256 bytes, MaxReadReq 512 bytes
                DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr- TransPend-
                LnkCap: Port #0, Speed 8GT/s, Width x16, ASPM L1, Exit Latency L0s <64ns, L1 <1us
                        ClockPM+ Surprise- LLActRep- BwNot- ASPMOptComp+
                LnkCtl: ASPM L1 Enabled; RCB 64 bytes Disabled- CommClk+
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 2.5GT/s, Width x16, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
                DevCap2: Completion Timeout: Not Supported, TimeoutDis-, LTR+, OBFF Not Supported
                DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR+, OBFF Disabled
                LnkSta2: Current De-emphasis Level: -3.5dB, EqualizationComplete+, EqualizationPhase1+
                         EqualizationPhase2+, EqualizationPhase3+, LinkEqualizationRequest-
        Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
                Address: 00000000fee03000  Data: 4022
        Kernel driver in use: amdgpu
        Kernel modules: amdgpu

00:05.0 0300: 10de:0dd8 (rev a1) (prog-if 00 [VGA controller])
        Subsystem: 10de:084a
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0
        Interrupt: pin A routed to IRQ 16
        Region 0: Memory at c2000000 (32-bit, non-prefetchable) [size=32M]
        Region 1: Memory at 3410000000 (64-bit, prefetchable) [size=128M]
        Region 3: Memory at 3418000000 (64-bit, prefetchable) [size=64M]
        Region 5: I/O ports at 2200 [size=128]
        [virtual] Expansion ROM at c0080000 [disabled] [size=512K]
        Capabilities: [60] Power Management version 3
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
                Address: 0000000000000000  Data: 0000
        Capabilities: [78] Express (v2) Endpoint, MSI 00
                DevCap: MaxPayload 128 bytes, PhantFunc 0, Latency L0s <4us, L1 <64us
                        ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
                DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
                        RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
                        MaxPayload 128 bytes, MaxReadReq 512 bytes
                DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend-
                LnkCap: Port #0, Speed 2.5GT/s, Width x16, ASPM L0s L1, Exit Latency L0s <256ns, L1 <4us
                        ClockPM+ Surprise- LLActRep- BwNot- ASPMOptComp-
                LnkCtl: ASPM L0s L1 Enabled; RCB 128 bytes Disabled- CommClk+
                        ExtSynch- ClockPM+ AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 2.5GT/s, Width x4, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
                DevCap2: Completion Timeout: Not Supported, TimeoutDis+, LTR-, OBFF Not Supported
                DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-, OBFF Disabled
                LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete-, EqualizationPhase1-
                         EqualizationPhase2-, EqualizationPhase3-, LinkEqualizationRequest-
        Capabilities: [b4] Vendor Specific Information: Len=14 <?>
        Kernel driver in use: nvidia
        Kernel modules: nvidiafb, nouveau, nvidia_367, nvidia_367_drm


[1]
[    0.617109] pci 0000:00:04.0: can't claim BAR 6 [mem 0xf6840000-0xf685ffff pref]: no compatible bridge window
[    0.617806] pci 0000:00:05.0: can't claim BAR 6 [mem 0xf6000000-0xf607ffff pref]: no compatible bridge window
[    0.618496] pci 0000:00:05.0: BAR 6: assigned [mem 0xc0080000-0xc00fffff pref]
[    0.619000] pci 0000:00:04.0: BAR 6: assigned [mem 0xc0020000-0xc003ffff pref]
[    0.619508] pci 0000:00:01.0: BAR 6: assigned [mem 0xc0004000-0xc00047ff pref]
[    0.620011] pci 0000:00:02.0: BAR 6: assigned [mem 0xc0004800-0xc0004fff pref]
[    0.620513] pci 0000:00:03.0: BAR 6: assigned [mem 0xc0005000-0xc00057ff pref]
[    0.621033] pci_bus 0000:00: resource 4 [io  0x0000-0x0cf7 window]
[    0.621034] pci_bus 0000:00: resource 5 [io  0x0d00-0x1fff window]
[    0.621035] pci_bus 0000:00: resource 6 [io  0x2000-0x229f window]
[    0.621036] pci_bus 0000:00: resource 7 [mem 0xc0000000-0xc40fffff window]
[    0.621037] pci_bus 0000:00: resource 8 [mem 0x3400000000-0x341c0fffff window]

[    1.692950] [drm] Initialized drm 1.1.0 20060810
[    1.701483] AVX2 version of gcm_enc/dec engaged.
[    1.702477] AES CTR mode by8 optimization enabled
[    1.705010] nvidia: module license 'NVIDIA' taints kernel.
[    1.705982] Disabling lock debugging due to kernel taint
[    1.706716] amdgpu: module verification failed: signature and/or required key missing - tainting kernel
[    1.714692] Could not create tracefs 'fence_wait_end' directory
[    1.715984] Could not create tracefs 'fence_wait_start' directory
[    1.717618] Could not create tracefs 'fence_signaled' directory
[    1.718110] Could not create tracefs 'fence_enable_signal' directory
[    1.718593] Could not create tracefs 'fence_destroy' directory
[    1.719214] Could not create tracefs 'fence_init' directory
[    1.719616] Could not create tracefs 'fence_emit' directory
[    1.720027] Could not create tracefs 'fence_annotate_wait_on' directory
[    1.724756] nvidia 0000:00:05.0: can't derive routing for PCI INT A
[    1.726058] nvidia 0000:00:05.0: PCI INT A: no GSI
[    1.727090] vgaarb: device changed decodes: PCI:0000:00:05.0,olddecodes=io+mem,decodes=none:owns=io+mem
[    1.729188] nvidia-nvlink: Nvlink Core is being initialized, major device number 247
[    1.730133] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  367.57  Mon Oct  3 20:37:01 PDT 2016
[    1.741012] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  367.57  Mon Oct  3 20:32:57 PDT 2016
[    1.743758] [drm] [nvidia-drm] [GPU ID 0x00000005] Loading driver
[    1.808253] [drm] amdgpu kernel modesetting enabled.
[    1.810043] AMD IOMMUv2 driver by Joerg Roedel <jroedel@suse.de>
[    1.810481] AMD IOMMUv2 functionality not available on this system
[    1.812928] CRAT table not found
[    1.813165] Finished initializing topology ret=0
[    1.813570] kfd kfd: Initialized module
[    1.814088] amdgpu 0000:00:04.0: can't derive routing for PCI INT A
[    1.814527] amdgpu 0000:00:04.0: PCI INT A: no GSI
[    1.815077] [drm] initializing kernel modesetting (POLARIS10 0x1002:0x67DF 0x1682:0x9480 0xC7).
[    1.815690] [drm] register mmio base: 0xC0400000
[    1.816012] [drm] register mmio size: 262144
[    1.816332] [drm] doorbell mmio base: 0xC0200000
[    1.816659] [drm] doorbell mmio size: 2097152
[    1.816985] [drm] UVD is enabled in VM mode
[    1.817279] [drm] VCE enabled in VM mode
[    1.817567] amdgpu 0000:00:04.0: Invalid ROM contents
[    1.990245] ATOM BIOS: D00901
[    1.990462] [drm] GPU posting now...
[    2.043833] tsc: Refined TSC clocksource calibration: 3489.546 MHz
[    2.044265] clocksource: tsc: mask: 0xffffffffffffffff max_cycles: 0x324cbd595a2, max_idle_ns: 440795315348 ns
[    2.121186] amdgpu 0000:00:04.0: VRAM: 8192M 0x0000000000000000 - 0x00000001FFFFFFFF (8192M used)
[    2.121808] amdgpu 0000:00:04.0: GTT: 8192M 0x0000000200000000 - 0x00000003FFFFFFFF
[    2.122342] [drm] Detected VRAM RAM=8192M, BAR=256M
[    2.122686] [drm] RAM width 256bits GDDR5
[    2.123063] [TTM] Zone  kernel: Available graphics memory: 2023196 kiB
[    2.123521] [TTM] Initializing pool allocator
[    2.123872] [TTM] Initializing DMA pool allocator
[    2.124214] [drm] amdgpu: 8192M of VRAM memory ready
[    2.124563] [drm] amdgpu: 8192M of GTT memory ready.
[    2.124914] [drm] GART: num cpu pages 2097152, num gpu pages 2097152
[    2.126509] [drm] PCIE GART of 8192M enabled (table at 0x0000000000040000).
[    2.128085] amdgpu 0000:00:04.0: amdgpu: using MSI.
[    2.136333] [drm] amdgpu: irq initialized.
[    2.136631] Can't find requested voltage id in vdd_dep_on_sclk table!
[    2.137190] amdgpu: powerplay initialized
[    2.137738] amdgpu 0000:00:04.0: fence driver on ring 0 use gpu addr 0x0000000200000008, cpu addr 0xffff880006650008
[    2.138528] amdgpu 0000:00:04.0: fence driver on ring 1 use gpu addr 0x0000000200000018, cpu addr 0xffff880006650018
[    2.139317] amdgpu 0000:00:04.0: fence driver on ring 2 use gpu addr 0x0000000200000028, cpu addr 0xffff880006650028
[    2.140119] amdgpu 0000:00:04.0: fence driver on ring 3 use gpu addr 0x0000000200000038, cpu addr 0xffff880006650038
[    2.140912] amdgpu 0000:00:04.0: fence driver on ring 4 use gpu addr 0x0000000200000048, cpu addr 0xffff880006650048
[    2.141689] amdgpu 0000:00:04.0: fence driver on ring 5 use gpu addr 0x0000000200000058, cpu addr 0xffff880006650058
[    2.142490] amdgpu 0000:00:04.0: fence driver on ring 6 use gpu addr 0x0000000200000068, cpu addr 0xffff880006650068
[    2.143283] amdgpu 0000:00:04.0: fence driver on ring 7 use gpu addr 0x0000000200000078, cpu addr 0xffff880006650078
[    2.144055] amdgpu 0000:00:04.0: fence driver on ring 8 use gpu addr 0x0000000200000088, cpu addr 0xffff880006650088
[    2.144888] amdgpu 0000:00:04.0: fence driver on ring 9 use gpu addr 0x0000000200000098, cpu addr 0xffff880006650098
[    2.145663] amdgpu 0000:00:04.0: fence driver on ring 10 use gpu addr 0x00000002000000a8, cpu addr 0xffff8800066500a8
[    2.146571] [drm] Found UVD firmware Version: 1.79 Family ID: 16
[    2.214736] amdgpu 0000:00:04.0: fence driver on ring 11 use gpu addr 0x000000000109c420, cpu addr 0xffffc90000c5a420
[    2.215510] [drm] Found VCE firmware Version: 52.4 Binary ID: 3
[    2.224329] amdgpu 0000:00:04.0: fence driver on ring 12 use gpu addr 0x00000002000000c8, cpu addr 0xffff8800066500c8
[    2.225113] amdgpu 0000:00:04.0: fence driver on ring 13 use gpu addr 0x00000002000000d8, cpu addr 0xffff8800066500d8
[    2.225904] amdgpu 0000:00:04.0: fence driver on ring 14 use gpu addr 0x00000002000000e8, cpu addr 0xffff8800066500e8
[    2.553099] 
                failed to send message 150 ret is 0 
[    2.875033] 
                failed to send pre message 136 ret is 0 
[    2.876529] [drm] DAL is enabled
[    2.876840] [drm] [FeatureOverride_overriden feature]        Overridden FEATURE_ENABLE_GPU_SCALING is enabled now
[    2.877528] [drm] [Backlight_Caps]   This method is not supported
[    2.878140] [drm] DM_PPLIB: values for Engine clock
[    2.878488] [drm] DM_PPLIB:   30000
[    2.878739] [drm] DM_PPLIB:   60800
[    2.879011] [drm] DM_PPLIB:   91000
[    2.879264] [drm] DM_PPLIB:   107700
[    2.879514] [drm] DM_PPLIB:   114500
[    2.879763] [drm] DM_PPLIB:   119100
[    2.880013] [drm] DM_PPLIB:   123600
[    2.880259] [drm] DM_PPLIB:   128800
[    2.880507] [drm] DM_PPLIB: Warning: using default validation clocks!
[    2.880961] [drm] DM_PPLIB: Validation clocks:
[    2.881273] [drm] DM_PPLIB:    engine_max_clock: 72000
[    2.881635] [drm] DM_PPLIB:    memory_max_clock: 80000
[    2.881996] [drm] DM_PPLIB:    level           : 0
[    2.882338] [drm] DM_PPLIB: reducing engine clock level from 8 to 3
[    2.882777] [drm] DM_PPLIB: values for Memory clock
[    2.883140] [drm] DM_PPLIB:   30000
[    2.883381] [drm] DM_PPLIB:   200000
[    2.883631] [drm] DM_PPLIB: Warning: using default validation clocks!
[    2.884080] [drm] DM_PPLIB: Validation clocks:
[    2.884391] [drm] DM_PPLIB:    engine_max_clock: 72000
[    2.884752] [drm] DM_PPLIB:    memory_max_clock: 80000
[    2.885115] [drm] DM_PPLIB:    level           : 0
[    2.885458] [drm] DM_PPLIB: reducing memory clock level from 2 to 2
[    2.885903] [drm] DC: create_links: connectors_num: physical:5, virtual:0
[    2.886384] [drm] [HwTrace_Hotplug]  Connector[0] description:signal 32
[    2.886846] [drm] [I2cAux_Config]    Using channel: CHANNEL_ID_DDC1 [1]
[    2.887325] [drm] [HwTrace_Hotplug]  Connector[1] description:signal 32
[    2.887781] [drm] [I2cAux_Config]    Using channel: CHANNEL_ID_DDC3 [3]
[    2.888240] [drm] [HwTrace_Hotplug]  Connector[2] description:signal 32
[    2.888700] [drm] [I2cAux_Config]    Using channel: CHANNEL_ID_DDC2 [2]
[    2.889153] [drm] [HwTrace_Hotplug]  Connector[3] description:signal 4
[    2.889607] [drm] [I2cAux_Config]    Using channel: CHANNEL_ID_DDC4 [4]
[    2.890093] [drm] [HwTrace_Hotplug]  Connector[4] description:signal 2
[    2.890554] [drm] [I2cAux_Config]    Using channel: CHANNEL_ID_DDC6 [6]
[    2.893310] [drm] [IfTrace_DC]       Display Core initialized
[    2.893687] [drm] amdgpu: freesync_module init done ffff880005005700.
[    2.894391] [drm] [HwTrace_Hotplug]  link=0, dc_sink_in=          (null) is now Disconnected
[    2.895019] [drm] DCHPD: connector_id=0: dc_sink didn't change.
[    2.895618] [drm] [HwTrace_Hotplug]  link=1, dc_sink_in=          (null) is now Disconnected
[    2.896204] [drm] DCHPD: connector_id=1: dc_sink didn't change.
[    2.896791] [drm] [HwTrace_Hotplug]  link=2, dc_sink_in=          (null) is now Disconnected
[    2.897366] [drm] DCHPD: connector_id=2: dc_sink didn't change.
[    2.897878] [drm] [HwTrace_Hotplug]  link=3, dc_sink_in=          (null) is now Disconnected
[    2.898455] [drm] DCHPD: connector_id=3: dc_sink didn't change.
[    2.911776] [drm] SADs count is: -2, don't need to read it
[    2.912168] [drm] [Conn_Detect]      [DVI-D-1] DELL 2007FP: [Block 0] 00 FF FF FF FF FF FF 00 10 AC 21 A0 4C 38 56 32 05 11 01 03 80 29 1F 78 EE EE 91 A3 54 4C 99 26 0F 50 54 A5 4B 00 81 80 A9 40 71 4F 01 01 01 01 01 01 01 01 01 01 48 3F 40 30 62 B0 32 40 40 C0 13 00 6F 13 11 00 00 1E 00 00 00 FF 00 43 39 35 33 36 37 31 55 32 56 38 4C 0A 00 00 00 FC 00 44 45 4C 4C 20 32 30 30 37 46 50 0A 20 00 00 00 FD 00 38 4C 1E 53 11 00 0A 20 20 20 20 20 20 00 9F ^
[    2.914871] [drm] [Detection_EDID Parser]    dc_link_detect: manufacturer_id = AC10, product_id = A021, serial_number = 3256384C, manufacture_week = 5, manufacture_year = 17, display_name = DELL 2007FP, speaker_flag = 0, audio_mode_count = 0
[    2.916301] [drm] [HwTrace_Hotplug]  link=4, dc_sink_in=ffff880137b03000 is now Connected
[    2.916867] [drm] DCHPD: connector_id=4: Old sink=          (null) New sink=ffff880137b03000
[    2.917458] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
[    2.917923] [drm] Driver supports precise vblank timestamp query.
[    2.918351] [drm] KMS initialized.
[    2.920363] [drm] ring test on 0 succeeded in 7 usecs
[    3.262416] clocksource: Switched to clocksource tsc
[    5.469348] [drm] ring test on 1 succeeded in 89 usecs
[    5.469725] [drm] ring test on 2 succeeded in 2 usecs
[    5.470090] [drm] ring test on 3 succeeded in 2 usecs
[    5.470455] [drm] ring test on 4 succeeded in 2 usecs
[    5.470827] [drm] ring test on 5 succeeded in 2 usecs
[    5.471193] [drm] ring test on 6 succeeded in 2 usecs
[    5.471563] [drm] ring test on 7 succeeded in 2 usecs
[    5.471931] [drm] ring test on 8 succeeded in 2 usecs
[    5.472576] [drm] ring test on 9 succeeded in 109 usecs
[    5.473104] [drm] ring test on 10 succeeded in 65 usecs
[    5.499798] [drm] ring test on 11 succeeded in 1 usecs
[    5.500163] [drm] UVD initialized successfully.
[    5.599850] [drm] ring test on 12 succeeded in 11 usecs
[    5.600239] [drm] ring test on 13 succeeded in 5 usecs
[    5.600613] [drm] ring test on 14 succeeded in 4 usecs
[    5.600975] [drm] VCE initialized successfully.
[    5.992789] [drm] fb mappable at 0x340136F000
[    5.993113] [drm] vram apper at 0x3400000000
[    5.993451] [drm] size 7680000
[    5.993670] [drm] fb depth is 24
[    5.993900] [drm]    pitch is 6400
[    5.994226] fbcon: amdgpudrmfb (fb0) is primary device
[    5.994500] [drm] Atomic commit: SET crtc id 0: [ffff880137c7a000]
[    5.994502] [drm] [IfTrace_DC]       dc_commit_targets: 1 targets
[    5.994503] [drm] [IfTrace_DC]       core_target 0x69b6860: stream_count=1
[    5.994504] [drm] [IfTrace_DC]       core_stream 0x381f8400: src: 0, 0, 1600, 1200; dst: 0, 0, 1600, 1200;
[    5.994505] [drm] [IfTrace_DC]               pix_clk_khz: 162000, h_total: 2160, v_total: 1250
[    5.994505] [drm] [IfTrace_DC]               sink name: DELL 2007FP, serial: 844511308
[    5.994506] [drm] [IfTrace_DC]               link: 4
[    5.995044] [drm] [Conn_Mode]        [DVI-D-1] {1600x1200, 2160x1250@162000Khz}^
[    5.995045] [drm] DM_NOT_IMPL: dm_bandwidth_update
[    5.998135] [drm] [IfTrace_DC]       dc_pre_update_surfaces_to_target: commit 1 surfaces to target 0x69b6860
[    5.998566] [drm] DM_NOT_IMPL: dm_bandwidth_update
[    5.999384] [drm] DM_NOT_IMPL: dm_bandwidth_update
[    6.000156] [drm] DM_NOT_IMPL: dm_bandwidth_update
[    6.097314] Console: switching to colour frame buffer device 200x75
[    6.097731] [drm] DM_NOT_IMPL: dm_bandwidth_update
[    6.098073] [drm] DM_NOT_IMPL: dm_bandwidth_update
[    6.202405] amdgpu 0000:00:04.0: fb0: amdgpudrmfb frame buffer device
[    6.213420] [drm] ib test on ring 0 succeeded
[    6.213888] [drm] ib test on ring 1 succeeded
[    6.214246] [drm] ib test on ring 2 succeeded
[    6.214605] [drm] ib test on ring 3 succeeded
[    6.214962] [drm] ib test on ring 4 succeeded
[    6.215319] [drm] ib test on ring 5 succeeded
[    6.215679] [drm] ib test on ring 6 succeeded
[    6.216037] [drm] ib test on ring 7 succeeded
[    6.216395] [drm] ib test on ring 8 succeeded
[    6.216738] [drm] ib test on ring 9 succeeded
[    6.217081] [drm] ib test on ring 10 succeeded
[    6.218797] [drm] ib test on ring 11 succeeded
[    6.219554] [drm] ib test on ring 12 succeeded
[    6.221227] [drm] Initialized amdgpu 3.9.0 20150101 for 0000:00:04.0 on minor 1
[    6.226666] [drm] DM_NOT_IMPL: dm_bandwidth_update
[    6.227220] [drm] DM_NOT_IMPL: dm_bandwidth_update


[2]
GL_RENDERER   = AMD Radeon (TM) RX 480 Graphics
GL_VERSION    = 4.5.13462 Compatibility Profile Context 16.50.5
GL_VENDOR     = ATI Technologies Inc.
GL_EXTENSIONS = GL_AMDX_debug_output GL_AMD_blend_minmax_factor[...]
VisualID 258, 0x102
878 frames in 394.5 seconds =  2.226 FPS
14175 frames in 5.0 seconds = 2834.942 FPS
14269 frames in 5.0 seconds = 2853.633 FPS
14191 frames in 5.0 seconds = 2838.192 FPS
14231 frames in 5.0 seconds = 2846.003 FPS
14376 frames in 5.0 seconds = 2875.148 FPS
14346 frames in 5.0 seconds = 2868.773 FPS
12828 frames in 5.0 seconds = 2565.346 FPS
6007 frames in 5.0 seconds = 1201.194 FPS
6006 frames in 5.0 seconds = 1201.018 FPS
6004 frames in 5.0 seconds = 1200.700 FPS
6004 frames in 5.0 seconds = 1200.657 FPS
[...]
6004 frames in 5.0 seconds = 1200.800 FPS
6004 frames in 5.0 seconds = 1200.704 FPS
5983 frames in 5.0 seconds = 1196.414 FPS
6177 frames in 5.0 seconds = 1235.288 FPS
6177 frames in 5.0 seconds = 1235.267 FPS
6182 frames in 5.0 seconds = 1236.311 FPS
6183 frames in 5.0 seconds = 1236.526 FPS
6181 frames in 5.0 seconds = 1236.176 FPS
6181 frames in 5.0 seconds = 1236.167 FPS


[3] # vdpauinfo
display: :0   screen: 0
API version: 1
Information string: G3DVL VDPAU Driver Shared Library version 1.0

Video surface:

name   width height types
-------------------------------------------
420    16384 16384  NV12 YV12
422    16384 16384  UYVY YUYV
444    16384 16384  Y8U8V8A8 V8U8Y8A8

Decoder capabilities:

name                        level macbs width height
----------------------------------------------------
MPEG1                          --- not supported ---
MPEG2_SIMPLE                    3 65536  4096  4096
MPEG2_MAIN                      3 65536  4096  4096
H264_BASELINE                  52 65536  4096  4096
H264_MAIN                      52 65536  4096  4096
H264_HIGH                      52 65536  4096  4096
VC1_SIMPLE                      1 65536  4096  4096
VC1_MAIN                        2 65536  4096  4096
VC1_ADVANCED                    4 65536  4096  4096
MPEG4_PART2_SP                  3 65536  4096  4096
MPEG4_PART2_ASP                 5 65536  4096  4096
DIVX4_QMOBILE                  --- not supported ---
DIVX4_MOBILE                   --- not supported ---
DIVX4_HOME_THEATER             --- not supported ---
DIVX4_HD_1080P                 --- not supported ---
DIVX5_QMOBILE                  --- not supported ---
DIVX5_MOBILE                   --- not supported ---
DIVX5_HOME_THEATER             --- not supported ---
DIVX5_HD_1080P                 --- not supported ---
H264_CONSTRAINED_BASELINE      --- not supported ---
H264_EXTENDED                  --- not supported ---
H264_PROGRESSIVE_HIGH          --- not supported ---
H264_CONSTRAINED_HIGH          --- not supported ---
H264_HIGH_444_PREDICTIVE       --- not supported ---
HEVC_MAIN                      186 65536  4096  4096
HEVC_MAIN_10                   186 65536  4096  4096
HEVC_MAIN_STILL                --- not supported ---
HEVC_MAIN_12                   --- not supported ---
HEVC_MAIN_444                  --- not supported ---

Output surface:

name              width height nat types
----------------------------------------------------
B8G8R8A8         16384 16384    y  NV12 YV12 UYVY YUYV Y8U8V8A8 V8U8Y8A8 A8I8 I8A8
R8G8B8A8         16384 16384    y  NV12 YV12 UYVY YUYV Y8U8V8A8 V8U8Y8A8 A8I8 I8A8
R10G10B10A2      16384 16384    y  NV12 YV12 UYVY YUYV Y8U8V8A8 V8U8Y8A8 A8I8 I8A8
B10G10R10A2      16384 16384    y  NV12 YV12 UYVY YUYV Y8U8V8A8 V8U8Y8A8 A8I8 I8A8

Bitmap surface:

name              width height
------------------------------
B8G8R8A8         16384 16384
R8G8B8A8         16384 16384
R10G10B10A2      16384 16384
B10G10R10A2      16384 16384
A8               16384 16384

Video mixer:

feature name                    sup
------------------------------------
DEINTERLACE_TEMPORAL             y
DEINTERLACE_TEMPORAL_SPATIAL     -
INVERSE_TELECINE                 -
NOISE_REDUCTION                  y
SHARPNESS                        y
LUMA_KEY                         -
HIGH QUALITY SCALING - L1        -
HIGH QUALITY SCALING - L2        -
HIGH QUALITY SCALING - L3        -
HIGH QUALITY SCALING - L4        -
HIGH QUALITY SCALING - L5        -
HIGH QUALITY SCALING - L6        -
HIGH QUALITY SCALING - L7        -
HIGH QUALITY SCALING - L8        -
HIGH QUALITY SCALING - L9        -

parameter name                  sup      min      max
-----------------------------------------------------
VIDEO_SURFACE_WIDTH              y        48     4096
VIDEO_SURFACE_HEIGHT             y        48     4096
CHROMA_TYPE                      y
LAYERS                           y         0        4

attribute name                  sup      min      max
-----------------------------------------------------
BACKGROUND_COLOR                 y
CSC_MATRIX                       y
NOISE_REDUCTION_LEVEL            y      0.00     1.00
SHARPNESS_LEVEL                  y     -1.00     1.00
LUMA_KEY_MIN_LUMA                y
LUMA_KEY_MAX_LUMA                y


[4] # ./qvdpautest 
Warning: LLVM emitted unknown config register: 0x4
qvdpautest 0.5.1
Intel(R) Core(TM) i5-4690 CPU @ 3.50GHz
Unknown GPU

VDPAU API version : 1
VDPAU implementation : G3DVL VDPAU Driver Shared Library version 1.0

FATAL: get_bits failed : No backend implementation could be loaded.!!

MPEG DECODING (1920x1080): 118 frames/s
MPEG DECODING (1280x720): 254 frames/s
H264 DECODING (1920x1080): 277 frames/s
H264 DECODING (1280x720): 476 frames/s
VC1 DECODING (1440x1080): 60 frames/s
MPEG4 DECODING (1920x1080): 52 frames/s

MIXER WEAVE (1920x1080): 1328 frames/s
MIXER BOB (1920x1080): 2730 fields/s
MIXER TEMPORAL (1920x1080): 237 fields/s
MIXER TEMPORAL + IVTC (1920x1080): 880 fields/s
MIXER TEMPORAL + SKIP_CHROMA (1920x1080): 994 fields/s
MIXER TEMPORAL_SPATIAL (1920x1080): 1021 fields/s
MIXER TEMPORAL_SPATIAL + IVTC (1920x1080): 947 fields/s
MIXER TEMPORAL_SPATIAL + SKIP_CHROMA (1920x1080): 918 fields/s
MIXER TEMPORAL_SPATIAL (720x576 video to 1920x1080 display): 1689 fields/s

MULTITHREADED MPEG DECODING (1920x1080): 43 frames/s
MULTITHREADED MIXER TEMPORAL (1920x1080): 885 fields/s


[5]
[ 6614.076946] amdgpu 0000:00:04.0: GPU fault detected: 146 0x0750080c
[ 6614.078367] amdgpu 0000:00:04.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010B8EA
[ 6614.080023] amdgpu 0000:00:04.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0C00800C
[ 6614.081672] amdgpu 0000:00:04.0: GPU fault detected: 146 0x0750880c
[ 6614.083202] amdgpu 0000:00:04.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010B8EA
[ 6614.084816] amdgpu 0000:00:04.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0608800C


[6]
[13757.971926] VM fault (0x0c, vmid 4) at page 1095939, read from 'TC4' (0x54433400) (72)
[13757.971931] amdgpu 0000:00:04.0: GPU fault detected: 146 0x0818480c
[13757.973101] amdgpu 0000:00:04.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010B903
[13757.974489] amdgpu 0000:00:04.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0804800C
[13757.975875] VM fault (0x0c, vmid 4) at page 1095939, read from 'TC4' (0x54433400) (72)
[13764.909712] amdgpu 0000:00:04.0: GPU fault detected: 146 0x07e0480c
[13764.911848] amdgpu 0000:00:04.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010B8FC
[13764.914273] amdgpu 0000:00:04.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E04800C
[13764.916700] VM fault (0x0c, vmid 7) at page 1095932, read from 'TC4' (0x54433400) (72)


[7]
root@lunixtst:~/mixbench# ./mixbench-ocl-alt 
mixbench-ocl/alternating (v0.02-25-g7587376)
Use "-h" argument to see available options
------------------------ Device specifications ------------------------
Platform:            AMD Accelerated Parallel Processing
Device:              Ellesmere/Advanced Micro Devices, Inc.
Driver version:      2236.5
Address bits:        64
GPU clock rate:      555 MHz
Total global mem:    8093 MB
Max allowed buffer:  4048 MB
OpenCL version:      OpenCL 1.2 AMD-APP (2236.5)
Total CUs:           14
-----------------------------------------------------------------------
Buffer size:            64MB
Workgroup size:         256
Workitem stride:        NDRange
Buffer allocation:      Device allocated
Loading kernel source file...
Precompilation of kernels... [>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>]
---------------------------------------------------------- CSV data ----------------------------------------------------------
Experiment ID, Single Precision ops,,,,              Double precision ops,,,,              Integer operations,,, 
Compute iters, Flops/byte, ex.time,  GFLOPS, GB/sec, Flops/byte, ex.time,  GFLOPS, GB/sec, Iops/byte, ex.time,   GIOPS, GB/sec
            0,      0.000,   14.36,    0.00, 598.08,      0.000,   19.61,    0.00, 876.04,     0.000,    9.88,    0.00, 869.55
            1,      0.129,   10.18,  105.50, 817.59,      0.065,   18.89,   56.85, 881.23,     0.129,   10.83,   99.13, 768.29
            2,      0.267,   10.33,  207.89, 779.57,      0.133,   17.37,  123.64, 927.27,     0.267,   11.78,  182.27, 683.51
            3,      0.414,   10.16,  317.15, 766.46,      0.207,   19.52,  165.02, 797.57,     0.414,   13.67,  235.65, 569.49
            4,      0.571,   10.71,  400.96, 701.68,      0.286,   23.51,  182.71, 639.50,     0.571,   13.32,  322.40, 564.20
            5,      0.741,   10.64,  504.52, 681.10,      0.370,   26.50,  202.55, 546.90,     0.741,   13.96,  384.45, 519.01
            6,      0.923,   10.07,  639.89, 693.22,      0.462,   28.05,  229.66, 497.59,     0.923,   13.72,  469.41, 508.53
            7,      1.120,    9.80,  767.00, 684.82,      0.560,   30.77,  244.27, 436.19,     1.120,   16.14,  465.59, 415.71
            8,      1.333,   10.49,  818.70, 614.03,      0.667,   33.39,  257.24, 385.87,     1.333,   15.47,  555.22, 416.41
            9,      1.565,    9.69,  997.21, 637.11,      0.783,   36.19,  266.99, 341.16,     1.565,   16.76,  576.74, 368.47
           10,      1.818,    9.60, 1117.98, 614.89,      0.909,   38.92,  275.89, 303.48,     1.818,   16.16,  664.30, 365.36
           11,      2.095,    9.49, 1245.15, 594.27,      1.048,   41.58,  284.05, 271.14,     2.095,   16.84,  701.37, 334.74
           12,      2.400,    8.96, 1438.38, 599.33,      1.200,   44.18,  291.62, 243.02,     2.400,   17.44,  738.98, 307.91
           13,      2.737,    8.88, 1571.18, 574.09,      1.368,   51.88,  269.06, 196.62,     2.737,   18.10,  771.01, 281.72
           14,      3.111,    9.15, 1643.19, 528.17,      1.556,   54.66,  275.01, 176.79,     3.111,   19.47,  772.22, 248.21
           15,      3.529,    8.67, 1857.80, 526.38,      1.765,   52.21,  308.49, 174.81,     3.529,   20.13,  800.24, 226.74
           16,      4.000,    7.80, 2201.42, 550.35,      2.000,   54.71,  314.03, 157.02,     4.000,   20.58,  834.89, 208.72
           17,      4.533,    7.96, 2292.38, 505.67,      2.267,   56.13,  325.19, 143.46,     4.533,   20.47,  891.76, 196.71
           18,      5.143,    8.07, 2394.13, 465.53,      2.571,   60.17,  321.19, 124.91,     5.143,   22.17,  871.61, 169.48
           19,      5.846,    8.25, 2473.12, 423.03,      2.923,   62.81,  324.81, 111.12,     5.846,   22.82,  894.11, 152.94
           20,      6.667,    8.14, 2636.84, 395.53,      3.333,   65.43,  328.20,  98.46,     6.667,   22.59,  950.54, 142.58
           21,      7.636,    8.03, 2806.95, 367.58,      3.818,   68.10,  331.12,  86.72,     7.636,   23.24,  970.12, 127.04
           22,      8.800,    8.23, 2870.13, 326.15,      4.400,   70.75,  333.88,  75.88,     8.800,   24.72,  955.73, 108.61
           23,     10.222,    7.81, 3160.13, 309.14,      5.111,   73.54,  335.82,  65.70,    10.222,   24.52, 1007.29,  98.54
           24,     12.000,    7.77, 3316.47, 276.37,      6.000,   77.60,  332.10,  55.35,    12.000,   24.97, 1031.90,  85.99
           25,     14.286,    7.35, 3651.59, 255.61,      7.143,   80.30,  334.30,  46.80,    14.286,   25.61, 1048.37,  73.39
           26,     17.333,    8.23, 3391.18, 195.65,      8.667,   83.22,  335.45,  38.71,    17.333,   27.26, 1024.18,  59.09
           27,     21.600,    7.34, 3951.54, 182.94,     10.800,   85.91,  337.47,  31.25,    21.600,   27.10, 1069.93,  49.53
           28,     28.000,    7.01, 4287.91, 153.14,     14.000,   86.43,  347.85,  24.85,    28.000,   28.36, 1060.17,  37.86
           29,     38.667,    7.13, 4366.03, 112.91,     19.333,   88.14,  353.30,  18.27,    38.667,   29.04, 1072.42,  27.73
           30,     60.000,    6.78, 4754.44,  79.24,     30.000,   91.74,  351.14,  11.70,    60.000,   28.78, 1119.19,  18.65
           31,    124.000,    6.64, 5016.70,  40.46,     62.000,   94.45,  352.43,   5.68,   124.000,   30.30, 1098.50,   8.86
           32,        inf,    6.41, 5362.94,   0.00,        inf,   95.21,  360.90,   0.00,       inf,   29.98, 1146.16,   0.00
------------------------------------------------------------------------------------------------------------------------------

Others tried:
root@lunixtst:~/shoc# ./shoc/src/opencl/level1/md5hash/MD5Hash
bash: ./shoc/src/opencl/level1/md5hash/MD5Hash: No such file or directory
root@lunixtst:~/shoc# ./shoc/src/opencl/level1/md5hash/MD5Hash^C
root@lunixtst:~/shoc# ./src/opencl/level1/md5hash/MD5Hash
Chose device: name='Ellesmere' index=0 id=0x1005150

ERROR: mismatch in key index found.

ERROR: mismatch in key index found.

ERROR: mismatch in key index found.

ERROR: mismatch in key index found.

ERROR: mismatch in key index found.

ERROR: mismatch in key index found.

ERROR: mismatch in key index found.

ERROR: mismatch in key index found.

ERROR: mismatch in key index found.

ERROR: mismatch in key index found.
test    atts    units   median  mean    stddev  min     max     trial0  trial1  trial2  trial3  trial4  trial5  trial6        trial7   trial8  trial9
MD5Hash 7x10    GHash/s N/A     N/A     N/A     N/A     N/A     N/A     N/A     N/A     N/A     N/A     N/A     N/A           N/A      N/A     N/A

Note: Any results marked with (*) had missing values.
      This can occur on systems with a mixture of
      device types or architectural capabilities.

# ethminer -G -M #(after starting and stopping X)
[OPENCL]:Found suitable OpenCL device [Ellesmere] with 8533725184 bytes of GPU memory
  ◇  23:39:16.061|ethminer  #00004000…
Benchmarking on platform: { "platform": "AMD Accelerated Parallel Processing", "device": "Ellesmere", "version": "OpenCL 1.2 AMD-APP (2236.5)" }
Preparing DAG...
  ℹ  23:39:16.062|gpuminer0  workLoop 0 #00000000… #00000000…
  ℹ  23:39:16.063|gpuminer0  Initialising miner...
Warming up...
Trial 1... [OPENCL]:Using platform: AMD Accelerated Parallel Processing
[OPENCL]:Using device: Ellesmere(OpenCL 1.2 AMD-APP (2236.5))
[OPENCL]:Printing program log
[OPENCL]:
[OPENCL]:Creating one big buffer for the DAG
[OPENCL]:Loading single big chunk kernels
[OPENCL]:Mapping one big chunk.
0
Trial 2... [OPENCL]:Creating buffer for header.
[OPENCL]:Creating mining buffer 0
[OPENCL]:Creating mining buffer 1
[   46.474707] amdgpu 0000:00:04.0: GPU fault detected: 147 0x00004802
[   46.475150] amdgpu 0000:00:04.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00000000
[   46.475672] amdgpu 0000:00:04.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0C048002
0
Trial 3... 0
Trial 4... 0
Trial 5... 0
[***HANG HERE, HAVE TO REBOOT HOST TO RESET CARD***]


[8] # clinfo 
Number of platforms                               1
  Platform Name                                   AMD Accelerated Parallel Processing
  Platform Vendor                                 Advanced Micro Devices, Inc.
  Platform Version                                OpenCL 2.0 AMD-APP (2236.5)
  Platform Profile                                FULL_PROFILE
  Platform Extensions                             cl_khr_icd cl_amd_event_callback cl_amd_offline_devices 
  Platform Extensions function suffix             AMD

  Platform Name                                   AMD Accelerated Parallel Processing
Number of devices                                 2
  Device Name                                     Ellesmere
  Device Vendor                                   Advanced Micro Devices, Inc.
  Device Vendor ID                                0x1002
  Device Version                                  OpenCL 1.2 AMD-APP (2236.5)
  Driver Version                                  2236.5
  Device OpenCL C Version                         OpenCL C 1.2 
  Device Type                                     GPU
  Device Profile                                  FULL_PROFILE
  Device Board Name (AMD)                         AMD Radeon (TM) RX 480 Graphics
  Device Topology (AMD)                           PCI-E, 00:04.0
  Max compute units                               14
  SIMD per compute unit (AMD)                     4
  SIMD width (AMD)                                16
  SIMD instruction width (AMD)                    1
  Max clock frequency                             555MHz
  Graphics IP (AMD)                               8.0
  Device Partition                                (core)
    Max number of sub-devices                     14
    Supported partition types                     none specified
  Max work item dimensions                        3
  Max work item sizes                             256x256x256
  Max work group size                             256
  Preferred work group size multiple              64
  Wavefront width (AMD)                           64
  Preferred / native vector sizes                 
    char                                                 4 / 4       
    short                                                2 / 2       
    int                                                  1 / 1       
    long                                                 1 / 1       
    half                                                 1 / 1        (cl_khr_fp16)
    float                                                1 / 1       
    double                                               1 / 1        (cl_khr_fp64)
  Half-precision Floating-point support           (cl_khr_fp16)
    Denormals                                     No
    Infinity and NANs                             No
    Round to nearest                              No
    Round to zero                                 No
    Round to infinity                             No
    IEEE754-2008 fused multiply-add               No
    Support is emulated in software               No
    Correctly-rounded divide and sqrt operations  No
  Single-precision Floating-point support         (core)
    Denormals                                     No
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 Yes
    Round to infinity                             Yes
    IEEE754-2008 fused multiply-add               Yes
    Support is emulated in software               No
    Correctly-rounded divide and sqrt operations  Yes
  Double-precision Floating-point support         (cl_khr_fp64)
    Denormals                                     Yes
    Infinity and NANs                             Yes
    Round to nearest                              Yes
    Round to zero                                 Yes
    Round to infinity                             Yes
    IEEE754-2008 fused multiply-add               Yes
    Support is emulated in software               No
    Correctly-rounded divide and sqrt operations  No
  Address bits                                    64, Little-Endian
  Global memory size                              8495808512 (7.912GiB)
  Global free memory (AMD)                        8278188 (7.895GiB)
  Global memory channels (AMD)                    8
  Global memory banks per channel (AMD)           16
  Global memory bank width (AMD)                  256 bytes
  Error Correction support                        No
  Max memory allocation                           4244635648 (3.953GiB)
  Unified memory for Host and Device              No
  Minimum alignment for any data type             128 bytes
  Alignment of base address                       2048 bits (256 bytes)
  Global Memory cache type                        Read/Write
  Global Memory cache size                        16384
  Global Memory cache line                        64 bytes
  Image support                                   Yes
    Max number of samplers per kernel             16
    Max size for 1D images from buffer            134217728 pixels
    Max 1D or 2D image array size                 2048 images
    Base address alignment for 2D image buffers   256 bytes
    Pitch alignment for 2D image buffers          256 bytes
    Max 2D image size                             16384x16384 pixels
    Max 3D image size                             2048x2048x2048 pixels
    Max number of read image args                 128
    Max number of write image args                8
  Local memory type                               Local
  Local memory size                               32768 (32KiB)
  Local memory syze per CU (AMD)                  65536 (64KiB)
  Local memory banks (AMD)                        32
  Max constant buffer size                        65536 (64KiB)
  Max number of constant args                     8
  Max size of kernel argument                     1024
  Queue properties                                
    Out-of-order execution                        No
    Profiling                                     Yes
  Prefer user sync for interop                    Yes
  Profiling timer resolution                      1ns
  Profiling timer offset since Epoch (AMD)        1484275845220815516ns (Thu Jan 12 18:50:45 2017)
  Execution capabilities                          
    Run OpenCL kernels                            Yes
    Run native kernels                            No
    Thread trace supported (AMD)                  Yes
    SPIR versions                                 1.2
  printf() buffer size                            1048576 (1024KiB)
  Built-in kernels                                
  Device Available                                Yes
  Compiler Available                              Yes
  Linker Available                                Yes
  Device Extensions                               cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event
[...skip Intel CPU...]
NULL platform behavior
  clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...)  No platform
  clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...)   No platform
  clCreateContext(NULL, ...) [default]            No platform
  clCreateContext(NULL, ...) [other]              Success [AMD]
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU)  No platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU)  No platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR)  No platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM)  No platform
  clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL)  No platform

# /opt/AMDAPPSDK-3.0/bin/x86_64/clinfo 
Number of platforms:                             1
  Platform Profile:                              FULL_PROFILE
  Platform Version:                              OpenCL 2.0 AMD-APP (2236.5)
  Platform Name:                                 AMD Accelerated Parallel Processing
  Platform Vendor:                               Advanced Micro Devices, Inc.
  Platform Extensions:                           cl_khr_icd cl_amd_event_callback cl_amd_offline_devices 


  Platform Name:                                 AMD Accelerated Parallel Processing
Number of devices:                               2
  Device Type:                                   CL_DEVICE_TYPE_GPU
  Vendor ID:                                     1002h
  Board name:                                    AMD Radeon (TM) RX 480 Graphics
  Device Topology:                               PCI[ B#0, D#4, F#0 ]
  Max compute units:                             14
  Max work items dimensions:                     3
    Max work items[0]:                           256
    Max work items[1]:                           256
    Max work items[2]:                           256
  Max work group size:                           256
  Preferred vector width char:                   4
  Preferred vector width short:                  2
  Preferred vector width int:                    1
  Preferred vector width long:                   1
  Preferred vector width float:                  1
  Preferred vector width double:                 1
  Native vector width char:                      4
  Native vector width short:                     2
  Native vector width int:                       1
  Native vector width long:                      1
  Native vector width float:                     1
  Native vector width double:                    1
  Max clock frequency:                           555Mhz
  Address bits:                                  64
  Max memory allocation:                         4244635648
  Image support:                                 Yes
  Max number of images read arguments:           128
  Max number of images write arguments:          8
  Max image 2D width:                            16384
  Max image 2D height:                           16384
  Max image 3D width:                            2048
  Max image 3D height:                           2048
  Max image 3D depth:                            2048
  Max samplers within kernel:                    16
  Max size of kernel argument:                   1024
  Alignment (bits) of base address:              2048
  Minimum alignment (bytes) for any datatype:    128
  Single precision floating point capability
    Denorms:                                     No
    Quiet NaNs:                                  Yes
    Round to nearest even:                       Yes
    Round to zero:                               Yes
    Round to +ve and infinity:                   Yes
    IEEE754-2008 fused multiply-add:             Yes
  Cache type:                                    Read/Write
  Cache line size:                               64
  Cache size:                                    16384
  Global memory size:                            8495808512
  Constant buffer size:                          65536
  Max number of constant args:                   8
  Local memory type:                             Scratchpad
  Local memory size:                             32768
  Max pipe arguments:                            0
  Max pipe active reservations:                  0
  Max pipe packet size:                          0
  Max global variable size:                      0
  Max global variable preferred total size:      0
  Max read/write image args:                     0
  Max on device events:                          0
  Queue on device max size:                      0
  Max on device queues:                          0
  Queue on device preferred size:                0
  SVM capabilities:                              
    Coarse grain buffer:                         No
    Fine grain buffer:                           No
    Fine grain system:                           No
    Atomics:                                     No
  Preferred platform atomic alignment:           0
  Preferred global atomic alignment:             0
  Preferred local atomic alignment:              0
  Kernel Preferred work group size multiple:     64
  Error correction support:                      0
  Unified memory for Host and Device:            0
  Profiling timer resolution:                    1
  Device endianess:                              Little
  Available:                                     Yes
  Compiler available:                            Yes
  Execution capabilities:                                
    Execute OpenCL kernels:                      Yes
    Execute native function:                     No
  Queue on Host properties:                              
    Out-of-Order:                                No
    Profiling :                                  Yes
  Queue on Device properties:                            
    Out-of-Order:                                No
    Profiling :                                  No
  Platform ID:                                   0x7faab8eecdb8
  Name:                                          Ellesmere
  Vendor:                                        Advanced Micro Devices, Inc.
  Device OpenCL C version:                       OpenCL C 1.2 
  Driver version:                                2236.5
  Profile:                                       FULL_PROFILE
  Version:                                       OpenCL 1.2 AMD-APP (2236.5)
  Extensions:                                    cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event 


[9]
# /opt/amdgpu-pro/bin/kmstest 
trying to open device 'i915'...failed
trying to open device 'amdgpu'...done
main: All ok!

# /opt/amdgpu-pro/bin/modetest 
trying to open device 'i915'...failed
trying to open device 'amdgpu'...done
Encoders:
id      crtc    type    possible crtcs  possible clones 
45      0       TMDS    0x0000003f      0x00000000
47      0       TMDS    0x0000003f      0x00000000
49      0       TMDS    0x0000003f      0x00000000
51      0       TMDS    0x0000003f      0x00000000
53      34      TMDS    0x0000003f      0x00000000
[...and so on...]

# /opt/amdgpu-pro/bin/proptest 
[works fine, output similar to subset of `modetest`]

# /opt/amdgpu-pro/bin/vbltest 
trying to open device 'i915'...failed
trying to open device 'amdgpu'...done
starting count: 276079
freq: 60.87Hz
freq: 60.02Hz
freq: 60.02Hz
freq: 60.02Hz
freq: 60.02Hz
freq: 60.02Hz
freq: 60.02Hz
^C

# /opt/amdgpu-pro/bin/amdgpu_test 
DRM Driver: Name: [amdgpu] : Date [20150101] : Description [AMD GPU]
     CUnit - A unit testing framework for C - Version 2.1-3
     http://cunit.sourceforge.net/
Suite: Basic Tests
  Test: Query Info Test ...passed
  Test: Memory alloc Test ...passed
  Test: Userptr Test ...passed
  Test: Command submission Test (GFX) ...
[HANGS HERE]


OK, I failed to restrain the length of this message.


-- 
[SorAlx]  ridin' VN2000 Classic LT



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20170112234810.2cb83671>