Date: Thu, 12 Jan 2017 23:48:10 -0800 From: <soralx@cydem.org> To: <freebsd-virtualization@freebsd.org> Subject: Re: Issues with GTX960 on CentOS7 using bhyve PCI passthru (FreeBSD 11-RC2) Message-ID: <20170112234810.2cb83671@mscad14> In-Reply-To: <201701130144.v0D1ifxJ051207@pdx.rh.CN85.dnsmgr.net> References: <20170111213941.0789c8ce@mscad14> <201701130144.v0D1ifxJ051207@pdx.rh.CN85.dnsmgr.net>
next in thread | previous in thread | raw e-mail | index | archive | help
Good news, everyone! I tried an AMD card, and it is almost working. I have a lot of logs and info, but I will try to restrain the length of this message. There was no need to do anything special to get the card to work, other than figuring out how to deal with Linux, setting up drivers and OpenCL SDK & linking libraries to the right places, compiling soft, etc. First, PCI info [0] and some dmesg bits [1]. AMD drivers: amdgpu-pro-16.50-362463.tar.xz AMD-APP-SDKInstaller-v3.0.130.136-GA-linux64.tar.bz2 Next, the good news: Xorg starts, and display works!.. Kind of: -- `glxgears` window is flickery, has parts of gears missing, and does not look good in general; -- xterm window has the rectangular cursor shapes plastered all over, in random places; -- full-screen (1600x1200) `glxgears` is slower than expected, and the performance varies suddenly [2]; -- VDPAU works, but I suspect it's not using the GPU [3][4]; I haven't figure a way how to force the use of GPU. Also, the main window with text looks OK most of the time (when doing the video test and in the end, in particular), but show a smaller black rectangle in top left corner of the screen instead of the video samples; -- it almost feels like the DMA and framebuffers aren't always correctly configured, but still are transferring data [from somewhere to somewhere sometimes]. I'm getting lots of messages like [5][6], among others, in various cases. Of some 3 OpenCL applications I tested, one appeared to complete successfully [7]. Running it also produces messages as in [6]. But the numbers make sense, comparing to e.g. tests of R9 Nano (~/mixbench/results/OpenCL/alt_R9-Nano_d1912.5.log) and expectations of the GPU chip. This is exciting! Dunno if the benchmark check whether the computations are correct, though. `clinfo` result [8] is a bit mixed... e.g., "Max clock frequency: 555Mhz" is wrong. Some more tests [9]. [0] 00:04.0 0300: 1002:67df (rev c7) (prog-if 00 [VGA controller]) Subsystem: 1682:9480 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 64 bytes Interrupt: pin A routed to IRQ 29 Region 0: Memory at 3400000000 (64-bit, prefetchable) [size=256M] Region 2: Memory at c0200000 (64-bit, prefetchable) [size=2M] Region 4: I/O ports at 2100 [size=256] Region 5: Memory at c0400000 (32-bit, non-prefetchable) [size=256K] Expansion ROM at c0020000 [disabled] [size=128K] Capabilities: [48] Vendor Specific Information: Len=08 <?> Capabilities: [50] Power Management version 3 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1+,D2+,D3hot+,D3cold+) Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME- Capabilities: [58] Express (v2) Legacy Endpoint, MSI 00 DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <4us, L1 unlimited ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset- DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported- RlxdOrd- ExtTag+ PhantFunc- AuxPwr- NoSnoop+ MaxPayload 256 bytes, MaxReadReq 512 bytes DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr- TransPend- LnkCap: Port #0, Speed 8GT/s, Width x16, ASPM L1, Exit Latency L0s <64ns, L1 <1us ClockPM+ Surprise- LLActRep- BwNot- ASPMOptComp+ LnkCtl: ASPM L1 Enabled; RCB 64 bytes Disabled- CommClk+ ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- LnkSta: Speed 2.5GT/s, Width x16, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt- DevCap2: Completion Timeout: Not Supported, TimeoutDis-, LTR+, OBFF Not Supported DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR+, OBFF Disabled LnkSta2: Current De-emphasis Level: -3.5dB, EqualizationComplete+, EqualizationPhase1+ EqualizationPhase2+, EqualizationPhase3+, LinkEqualizationRequest- Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+ Address: 00000000fee03000 Data: 4022 Kernel driver in use: amdgpu Kernel modules: amdgpu 00:05.0 0300: 10de:0dd8 (rev a1) (prog-if 00 [VGA controller]) Subsystem: 10de:084a Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin A routed to IRQ 16 Region 0: Memory at c2000000 (32-bit, non-prefetchable) [size=32M] Region 1: Memory at 3410000000 (64-bit, prefetchable) [size=128M] Region 3: Memory at 3418000000 (64-bit, prefetchable) [size=64M] Region 5: I/O ports at 2200 [size=128] [virtual] Expansion ROM at c0080000 [disabled] [size=512K] Capabilities: [60] Power Management version 3 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME- Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+ Address: 0000000000000000 Data: 0000 Capabilities: [78] Express (v2) Endpoint, MSI 00 DevCap: MaxPayload 128 bytes, PhantFunc 0, Latency L0s <4us, L1 <64us ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset- DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported- RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+ MaxPayload 128 bytes, MaxReadReq 512 bytes DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend- LnkCap: Port #0, Speed 2.5GT/s, Width x16, ASPM L0s L1, Exit Latency L0s <256ns, L1 <4us ClockPM+ Surprise- LLActRep- BwNot- ASPMOptComp- LnkCtl: ASPM L0s L1 Enabled; RCB 128 bytes Disabled- CommClk+ ExtSynch- ClockPM+ AutWidDis- BWInt- AutBWInt- LnkSta: Speed 2.5GT/s, Width x4, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt- DevCap2: Completion Timeout: Not Supported, TimeoutDis+, LTR-, OBFF Not Supported DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-, OBFF Disabled LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete-, EqualizationPhase1- EqualizationPhase2-, EqualizationPhase3-, LinkEqualizationRequest- Capabilities: [b4] Vendor Specific Information: Len=14 <?> Kernel driver in use: nvidia Kernel modules: nvidiafb, nouveau, nvidia_367, nvidia_367_drm [1] [ 0.617109] pci 0000:00:04.0: can't claim BAR 6 [mem 0xf6840000-0xf685ffff pref]: no compatible bridge window [ 0.617806] pci 0000:00:05.0: can't claim BAR 6 [mem 0xf6000000-0xf607ffff pref]: no compatible bridge window [ 0.618496] pci 0000:00:05.0: BAR 6: assigned [mem 0xc0080000-0xc00fffff pref] [ 0.619000] pci 0000:00:04.0: BAR 6: assigned [mem 0xc0020000-0xc003ffff pref] [ 0.619508] pci 0000:00:01.0: BAR 6: assigned [mem 0xc0004000-0xc00047ff pref] [ 0.620011] pci 0000:00:02.0: BAR 6: assigned [mem 0xc0004800-0xc0004fff pref] [ 0.620513] pci 0000:00:03.0: BAR 6: assigned [mem 0xc0005000-0xc00057ff pref] [ 0.621033] pci_bus 0000:00: resource 4 [io 0x0000-0x0cf7 window] [ 0.621034] pci_bus 0000:00: resource 5 [io 0x0d00-0x1fff window] [ 0.621035] pci_bus 0000:00: resource 6 [io 0x2000-0x229f window] [ 0.621036] pci_bus 0000:00: resource 7 [mem 0xc0000000-0xc40fffff window] [ 0.621037] pci_bus 0000:00: resource 8 [mem 0x3400000000-0x341c0fffff window] [ 1.692950] [drm] Initialized drm 1.1.0 20060810 [ 1.701483] AVX2 version of gcm_enc/dec engaged. [ 1.702477] AES CTR mode by8 optimization enabled [ 1.705010] nvidia: module license 'NVIDIA' taints kernel. [ 1.705982] Disabling lock debugging due to kernel taint [ 1.706716] amdgpu: module verification failed: signature and/or required key missing - tainting kernel [ 1.714692] Could not create tracefs 'fence_wait_end' directory [ 1.715984] Could not create tracefs 'fence_wait_start' directory [ 1.717618] Could not create tracefs 'fence_signaled' directory [ 1.718110] Could not create tracefs 'fence_enable_signal' directory [ 1.718593] Could not create tracefs 'fence_destroy' directory [ 1.719214] Could not create tracefs 'fence_init' directory [ 1.719616] Could not create tracefs 'fence_emit' directory [ 1.720027] Could not create tracefs 'fence_annotate_wait_on' directory [ 1.724756] nvidia 0000:00:05.0: can't derive routing for PCI INT A [ 1.726058] nvidia 0000:00:05.0: PCI INT A: no GSI [ 1.727090] vgaarb: device changed decodes: PCI:0000:00:05.0,olddecodes=io+mem,decodes=none:owns=io+mem [ 1.729188] nvidia-nvlink: Nvlink Core is being initialized, major device number 247 [ 1.730133] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 367.57 Mon Oct 3 20:37:01 PDT 2016 [ 1.741012] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 367.57 Mon Oct 3 20:32:57 PDT 2016 [ 1.743758] [drm] [nvidia-drm] [GPU ID 0x00000005] Loading driver [ 1.808253] [drm] amdgpu kernel modesetting enabled. [ 1.810043] AMD IOMMUv2 driver by Joerg Roedel <jroedel@suse.de> [ 1.810481] AMD IOMMUv2 functionality not available on this system [ 1.812928] CRAT table not found [ 1.813165] Finished initializing topology ret=0 [ 1.813570] kfd kfd: Initialized module [ 1.814088] amdgpu 0000:00:04.0: can't derive routing for PCI INT A [ 1.814527] amdgpu 0000:00:04.0: PCI INT A: no GSI [ 1.815077] [drm] initializing kernel modesetting (POLARIS10 0x1002:0x67DF 0x1682:0x9480 0xC7). [ 1.815690] [drm] register mmio base: 0xC0400000 [ 1.816012] [drm] register mmio size: 262144 [ 1.816332] [drm] doorbell mmio base: 0xC0200000 [ 1.816659] [drm] doorbell mmio size: 2097152 [ 1.816985] [drm] UVD is enabled in VM mode [ 1.817279] [drm] VCE enabled in VM mode [ 1.817567] amdgpu 0000:00:04.0: Invalid ROM contents [ 1.990245] ATOM BIOS: D00901 [ 1.990462] [drm] GPU posting now... [ 2.043833] tsc: Refined TSC clocksource calibration: 3489.546 MHz [ 2.044265] clocksource: tsc: mask: 0xffffffffffffffff max_cycles: 0x324cbd595a2, max_idle_ns: 440795315348 ns [ 2.121186] amdgpu 0000:00:04.0: VRAM: 8192M 0x0000000000000000 - 0x00000001FFFFFFFF (8192M used) [ 2.121808] amdgpu 0000:00:04.0: GTT: 8192M 0x0000000200000000 - 0x00000003FFFFFFFF [ 2.122342] [drm] Detected VRAM RAM=8192M, BAR=256M [ 2.122686] [drm] RAM width 256bits GDDR5 [ 2.123063] [TTM] Zone kernel: Available graphics memory: 2023196 kiB [ 2.123521] [TTM] Initializing pool allocator [ 2.123872] [TTM] Initializing DMA pool allocator [ 2.124214] [drm] amdgpu: 8192M of VRAM memory ready [ 2.124563] [drm] amdgpu: 8192M of GTT memory ready. [ 2.124914] [drm] GART: num cpu pages 2097152, num gpu pages 2097152 [ 2.126509] [drm] PCIE GART of 8192M enabled (table at 0x0000000000040000). [ 2.128085] amdgpu 0000:00:04.0: amdgpu: using MSI. [ 2.136333] [drm] amdgpu: irq initialized. [ 2.136631] Can't find requested voltage id in vdd_dep_on_sclk table! [ 2.137190] amdgpu: powerplay initialized [ 2.137738] amdgpu 0000:00:04.0: fence driver on ring 0 use gpu addr 0x0000000200000008, cpu addr 0xffff880006650008 [ 2.138528] amdgpu 0000:00:04.0: fence driver on ring 1 use gpu addr 0x0000000200000018, cpu addr 0xffff880006650018 [ 2.139317] amdgpu 0000:00:04.0: fence driver on ring 2 use gpu addr 0x0000000200000028, cpu addr 0xffff880006650028 [ 2.140119] amdgpu 0000:00:04.0: fence driver on ring 3 use gpu addr 0x0000000200000038, cpu addr 0xffff880006650038 [ 2.140912] amdgpu 0000:00:04.0: fence driver on ring 4 use gpu addr 0x0000000200000048, cpu addr 0xffff880006650048 [ 2.141689] amdgpu 0000:00:04.0: fence driver on ring 5 use gpu addr 0x0000000200000058, cpu addr 0xffff880006650058 [ 2.142490] amdgpu 0000:00:04.0: fence driver on ring 6 use gpu addr 0x0000000200000068, cpu addr 0xffff880006650068 [ 2.143283] amdgpu 0000:00:04.0: fence driver on ring 7 use gpu addr 0x0000000200000078, cpu addr 0xffff880006650078 [ 2.144055] amdgpu 0000:00:04.0: fence driver on ring 8 use gpu addr 0x0000000200000088, cpu addr 0xffff880006650088 [ 2.144888] amdgpu 0000:00:04.0: fence driver on ring 9 use gpu addr 0x0000000200000098, cpu addr 0xffff880006650098 [ 2.145663] amdgpu 0000:00:04.0: fence driver on ring 10 use gpu addr 0x00000002000000a8, cpu addr 0xffff8800066500a8 [ 2.146571] [drm] Found UVD firmware Version: 1.79 Family ID: 16 [ 2.214736] amdgpu 0000:00:04.0: fence driver on ring 11 use gpu addr 0x000000000109c420, cpu addr 0xffffc90000c5a420 [ 2.215510] [drm] Found VCE firmware Version: 52.4 Binary ID: 3 [ 2.224329] amdgpu 0000:00:04.0: fence driver on ring 12 use gpu addr 0x00000002000000c8, cpu addr 0xffff8800066500c8 [ 2.225113] amdgpu 0000:00:04.0: fence driver on ring 13 use gpu addr 0x00000002000000d8, cpu addr 0xffff8800066500d8 [ 2.225904] amdgpu 0000:00:04.0: fence driver on ring 14 use gpu addr 0x00000002000000e8, cpu addr 0xffff8800066500e8 [ 2.553099] failed to send message 150 ret is 0 [ 2.875033] failed to send pre message 136 ret is 0 [ 2.876529] [drm] DAL is enabled [ 2.876840] [drm] [FeatureOverride_overriden feature] Overridden FEATURE_ENABLE_GPU_SCALING is enabled now [ 2.877528] [drm] [Backlight_Caps] This method is not supported [ 2.878140] [drm] DM_PPLIB: values for Engine clock [ 2.878488] [drm] DM_PPLIB: 30000 [ 2.878739] [drm] DM_PPLIB: 60800 [ 2.879011] [drm] DM_PPLIB: 91000 [ 2.879264] [drm] DM_PPLIB: 107700 [ 2.879514] [drm] DM_PPLIB: 114500 [ 2.879763] [drm] DM_PPLIB: 119100 [ 2.880013] [drm] DM_PPLIB: 123600 [ 2.880259] [drm] DM_PPLIB: 128800 [ 2.880507] [drm] DM_PPLIB: Warning: using default validation clocks! [ 2.880961] [drm] DM_PPLIB: Validation clocks: [ 2.881273] [drm] DM_PPLIB: engine_max_clock: 72000 [ 2.881635] [drm] DM_PPLIB: memory_max_clock: 80000 [ 2.881996] [drm] DM_PPLIB: level : 0 [ 2.882338] [drm] DM_PPLIB: reducing engine clock level from 8 to 3 [ 2.882777] [drm] DM_PPLIB: values for Memory clock [ 2.883140] [drm] DM_PPLIB: 30000 [ 2.883381] [drm] DM_PPLIB: 200000 [ 2.883631] [drm] DM_PPLIB: Warning: using default validation clocks! [ 2.884080] [drm] DM_PPLIB: Validation clocks: [ 2.884391] [drm] DM_PPLIB: engine_max_clock: 72000 [ 2.884752] [drm] DM_PPLIB: memory_max_clock: 80000 [ 2.885115] [drm] DM_PPLIB: level : 0 [ 2.885458] [drm] DM_PPLIB: reducing memory clock level from 2 to 2 [ 2.885903] [drm] DC: create_links: connectors_num: physical:5, virtual:0 [ 2.886384] [drm] [HwTrace_Hotplug] Connector[0] description:signal 32 [ 2.886846] [drm] [I2cAux_Config] Using channel: CHANNEL_ID_DDC1 [1] [ 2.887325] [drm] [HwTrace_Hotplug] Connector[1] description:signal 32 [ 2.887781] [drm] [I2cAux_Config] Using channel: CHANNEL_ID_DDC3 [3] [ 2.888240] [drm] [HwTrace_Hotplug] Connector[2] description:signal 32 [ 2.888700] [drm] [I2cAux_Config] Using channel: CHANNEL_ID_DDC2 [2] [ 2.889153] [drm] [HwTrace_Hotplug] Connector[3] description:signal 4 [ 2.889607] [drm] [I2cAux_Config] Using channel: CHANNEL_ID_DDC4 [4] [ 2.890093] [drm] [HwTrace_Hotplug] Connector[4] description:signal 2 [ 2.890554] [drm] [I2cAux_Config] Using channel: CHANNEL_ID_DDC6 [6] [ 2.893310] [drm] [IfTrace_DC] Display Core initialized [ 2.893687] [drm] amdgpu: freesync_module init done ffff880005005700. [ 2.894391] [drm] [HwTrace_Hotplug] link=0, dc_sink_in= (null) is now Disconnected [ 2.895019] [drm] DCHPD: connector_id=0: dc_sink didn't change. [ 2.895618] [drm] [HwTrace_Hotplug] link=1, dc_sink_in= (null) is now Disconnected [ 2.896204] [drm] DCHPD: connector_id=1: dc_sink didn't change. [ 2.896791] [drm] [HwTrace_Hotplug] link=2, dc_sink_in= (null) is now Disconnected [ 2.897366] [drm] DCHPD: connector_id=2: dc_sink didn't change. [ 2.897878] [drm] [HwTrace_Hotplug] link=3, dc_sink_in= (null) is now Disconnected [ 2.898455] [drm] DCHPD: connector_id=3: dc_sink didn't change. [ 2.911776] [drm] SADs count is: -2, don't need to read it [ 2.912168] [drm] [Conn_Detect] [DVI-D-1] DELL 2007FP: [Block 0] 00 FF FF FF FF FF FF 00 10 AC 21 A0 4C 38 56 32 05 11 01 03 80 29 1F 78 EE EE 91 A3 54 4C 99 26 0F 50 54 A5 4B 00 81 80 A9 40 71 4F 01 01 01 01 01 01 01 01 01 01 48 3F 40 30 62 B0 32 40 40 C0 13 00 6F 13 11 00 00 1E 00 00 00 FF 00 43 39 35 33 36 37 31 55 32 56 38 4C 0A 00 00 00 FC 00 44 45 4C 4C 20 32 30 30 37 46 50 0A 20 00 00 00 FD 00 38 4C 1E 53 11 00 0A 20 20 20 20 20 20 00 9F ^ [ 2.914871] [drm] [Detection_EDID Parser] dc_link_detect: manufacturer_id = AC10, product_id = A021, serial_number = 3256384C, manufacture_week = 5, manufacture_year = 17, display_name = DELL 2007FP, speaker_flag = 0, audio_mode_count = 0 [ 2.916301] [drm] [HwTrace_Hotplug] link=4, dc_sink_in=ffff880137b03000 is now Connected [ 2.916867] [drm] DCHPD: connector_id=4: Old sink= (null) New sink=ffff880137b03000 [ 2.917458] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013). [ 2.917923] [drm] Driver supports precise vblank timestamp query. [ 2.918351] [drm] KMS initialized. [ 2.920363] [drm] ring test on 0 succeeded in 7 usecs [ 3.262416] clocksource: Switched to clocksource tsc [ 5.469348] [drm] ring test on 1 succeeded in 89 usecs [ 5.469725] [drm] ring test on 2 succeeded in 2 usecs [ 5.470090] [drm] ring test on 3 succeeded in 2 usecs [ 5.470455] [drm] ring test on 4 succeeded in 2 usecs [ 5.470827] [drm] ring test on 5 succeeded in 2 usecs [ 5.471193] [drm] ring test on 6 succeeded in 2 usecs [ 5.471563] [drm] ring test on 7 succeeded in 2 usecs [ 5.471931] [drm] ring test on 8 succeeded in 2 usecs [ 5.472576] [drm] ring test on 9 succeeded in 109 usecs [ 5.473104] [drm] ring test on 10 succeeded in 65 usecs [ 5.499798] [drm] ring test on 11 succeeded in 1 usecs [ 5.500163] [drm] UVD initialized successfully. [ 5.599850] [drm] ring test on 12 succeeded in 11 usecs [ 5.600239] [drm] ring test on 13 succeeded in 5 usecs [ 5.600613] [drm] ring test on 14 succeeded in 4 usecs [ 5.600975] [drm] VCE initialized successfully. [ 5.992789] [drm] fb mappable at 0x340136F000 [ 5.993113] [drm] vram apper at 0x3400000000 [ 5.993451] [drm] size 7680000 [ 5.993670] [drm] fb depth is 24 [ 5.993900] [drm] pitch is 6400 [ 5.994226] fbcon: amdgpudrmfb (fb0) is primary device [ 5.994500] [drm] Atomic commit: SET crtc id 0: [ffff880137c7a000] [ 5.994502] [drm] [IfTrace_DC] dc_commit_targets: 1 targets [ 5.994503] [drm] [IfTrace_DC] core_target 0x69b6860: stream_count=1 [ 5.994504] [drm] [IfTrace_DC] core_stream 0x381f8400: src: 0, 0, 1600, 1200; dst: 0, 0, 1600, 1200; [ 5.994505] [drm] [IfTrace_DC] pix_clk_khz: 162000, h_total: 2160, v_total: 1250 [ 5.994505] [drm] [IfTrace_DC] sink name: DELL 2007FP, serial: 844511308 [ 5.994506] [drm] [IfTrace_DC] link: 4 [ 5.995044] [drm] [Conn_Mode] [DVI-D-1] {1600x1200, 2160x1250@162000Khz}^ [ 5.995045] [drm] DM_NOT_IMPL: dm_bandwidth_update [ 5.998135] [drm] [IfTrace_DC] dc_pre_update_surfaces_to_target: commit 1 surfaces to target 0x69b6860 [ 5.998566] [drm] DM_NOT_IMPL: dm_bandwidth_update [ 5.999384] [drm] DM_NOT_IMPL: dm_bandwidth_update [ 6.000156] [drm] DM_NOT_IMPL: dm_bandwidth_update [ 6.097314] Console: switching to colour frame buffer device 200x75 [ 6.097731] [drm] DM_NOT_IMPL: dm_bandwidth_update [ 6.098073] [drm] DM_NOT_IMPL: dm_bandwidth_update [ 6.202405] amdgpu 0000:00:04.0: fb0: amdgpudrmfb frame buffer device [ 6.213420] [drm] ib test on ring 0 succeeded [ 6.213888] [drm] ib test on ring 1 succeeded [ 6.214246] [drm] ib test on ring 2 succeeded [ 6.214605] [drm] ib test on ring 3 succeeded [ 6.214962] [drm] ib test on ring 4 succeeded [ 6.215319] [drm] ib test on ring 5 succeeded [ 6.215679] [drm] ib test on ring 6 succeeded [ 6.216037] [drm] ib test on ring 7 succeeded [ 6.216395] [drm] ib test on ring 8 succeeded [ 6.216738] [drm] ib test on ring 9 succeeded [ 6.217081] [drm] ib test on ring 10 succeeded [ 6.218797] [drm] ib test on ring 11 succeeded [ 6.219554] [drm] ib test on ring 12 succeeded [ 6.221227] [drm] Initialized amdgpu 3.9.0 20150101 for 0000:00:04.0 on minor 1 [ 6.226666] [drm] DM_NOT_IMPL: dm_bandwidth_update [ 6.227220] [drm] DM_NOT_IMPL: dm_bandwidth_update [2] GL_RENDERER = AMD Radeon (TM) RX 480 Graphics GL_VERSION = 4.5.13462 Compatibility Profile Context 16.50.5 GL_VENDOR = ATI Technologies Inc. GL_EXTENSIONS = GL_AMDX_debug_output GL_AMD_blend_minmax_factor[...] VisualID 258, 0x102 878 frames in 394.5 seconds = 2.226 FPS 14175 frames in 5.0 seconds = 2834.942 FPS 14269 frames in 5.0 seconds = 2853.633 FPS 14191 frames in 5.0 seconds = 2838.192 FPS 14231 frames in 5.0 seconds = 2846.003 FPS 14376 frames in 5.0 seconds = 2875.148 FPS 14346 frames in 5.0 seconds = 2868.773 FPS 12828 frames in 5.0 seconds = 2565.346 FPS 6007 frames in 5.0 seconds = 1201.194 FPS 6006 frames in 5.0 seconds = 1201.018 FPS 6004 frames in 5.0 seconds = 1200.700 FPS 6004 frames in 5.0 seconds = 1200.657 FPS [...] 6004 frames in 5.0 seconds = 1200.800 FPS 6004 frames in 5.0 seconds = 1200.704 FPS 5983 frames in 5.0 seconds = 1196.414 FPS 6177 frames in 5.0 seconds = 1235.288 FPS 6177 frames in 5.0 seconds = 1235.267 FPS 6182 frames in 5.0 seconds = 1236.311 FPS 6183 frames in 5.0 seconds = 1236.526 FPS 6181 frames in 5.0 seconds = 1236.176 FPS 6181 frames in 5.0 seconds = 1236.167 FPS [3] # vdpauinfo display: :0 screen: 0 API version: 1 Information string: G3DVL VDPAU Driver Shared Library version 1.0 Video surface: name width height types ------------------------------------------- 420 16384 16384 NV12 YV12 422 16384 16384 UYVY YUYV 444 16384 16384 Y8U8V8A8 V8U8Y8A8 Decoder capabilities: name level macbs width height ---------------------------------------------------- MPEG1 --- not supported --- MPEG2_SIMPLE 3 65536 4096 4096 MPEG2_MAIN 3 65536 4096 4096 H264_BASELINE 52 65536 4096 4096 H264_MAIN 52 65536 4096 4096 H264_HIGH 52 65536 4096 4096 VC1_SIMPLE 1 65536 4096 4096 VC1_MAIN 2 65536 4096 4096 VC1_ADVANCED 4 65536 4096 4096 MPEG4_PART2_SP 3 65536 4096 4096 MPEG4_PART2_ASP 5 65536 4096 4096 DIVX4_QMOBILE --- not supported --- DIVX4_MOBILE --- not supported --- DIVX4_HOME_THEATER --- not supported --- DIVX4_HD_1080P --- not supported --- DIVX5_QMOBILE --- not supported --- DIVX5_MOBILE --- not supported --- DIVX5_HOME_THEATER --- not supported --- DIVX5_HD_1080P --- not supported --- H264_CONSTRAINED_BASELINE --- not supported --- H264_EXTENDED --- not supported --- H264_PROGRESSIVE_HIGH --- not supported --- H264_CONSTRAINED_HIGH --- not supported --- H264_HIGH_444_PREDICTIVE --- not supported --- HEVC_MAIN 186 65536 4096 4096 HEVC_MAIN_10 186 65536 4096 4096 HEVC_MAIN_STILL --- not supported --- HEVC_MAIN_12 --- not supported --- HEVC_MAIN_444 --- not supported --- Output surface: name width height nat types ---------------------------------------------------- B8G8R8A8 16384 16384 y NV12 YV12 UYVY YUYV Y8U8V8A8 V8U8Y8A8 A8I8 I8A8 R8G8B8A8 16384 16384 y NV12 YV12 UYVY YUYV Y8U8V8A8 V8U8Y8A8 A8I8 I8A8 R10G10B10A2 16384 16384 y NV12 YV12 UYVY YUYV Y8U8V8A8 V8U8Y8A8 A8I8 I8A8 B10G10R10A2 16384 16384 y NV12 YV12 UYVY YUYV Y8U8V8A8 V8U8Y8A8 A8I8 I8A8 Bitmap surface: name width height ------------------------------ B8G8R8A8 16384 16384 R8G8B8A8 16384 16384 R10G10B10A2 16384 16384 B10G10R10A2 16384 16384 A8 16384 16384 Video mixer: feature name sup ------------------------------------ DEINTERLACE_TEMPORAL y DEINTERLACE_TEMPORAL_SPATIAL - INVERSE_TELECINE - NOISE_REDUCTION y SHARPNESS y LUMA_KEY - HIGH QUALITY SCALING - L1 - HIGH QUALITY SCALING - L2 - HIGH QUALITY SCALING - L3 - HIGH QUALITY SCALING - L4 - HIGH QUALITY SCALING - L5 - HIGH QUALITY SCALING - L6 - HIGH QUALITY SCALING - L7 - HIGH QUALITY SCALING - L8 - HIGH QUALITY SCALING - L9 - parameter name sup min max ----------------------------------------------------- VIDEO_SURFACE_WIDTH y 48 4096 VIDEO_SURFACE_HEIGHT y 48 4096 CHROMA_TYPE y LAYERS y 0 4 attribute name sup min max ----------------------------------------------------- BACKGROUND_COLOR y CSC_MATRIX y NOISE_REDUCTION_LEVEL y 0.00 1.00 SHARPNESS_LEVEL y -1.00 1.00 LUMA_KEY_MIN_LUMA y LUMA_KEY_MAX_LUMA y [4] # ./qvdpautest Warning: LLVM emitted unknown config register: 0x4 qvdpautest 0.5.1 Intel(R) Core(TM) i5-4690 CPU @ 3.50GHz Unknown GPU VDPAU API version : 1 VDPAU implementation : G3DVL VDPAU Driver Shared Library version 1.0 FATAL: get_bits failed : No backend implementation could be loaded.!! MPEG DECODING (1920x1080): 118 frames/s MPEG DECODING (1280x720): 254 frames/s H264 DECODING (1920x1080): 277 frames/s H264 DECODING (1280x720): 476 frames/s VC1 DECODING (1440x1080): 60 frames/s MPEG4 DECODING (1920x1080): 52 frames/s MIXER WEAVE (1920x1080): 1328 frames/s MIXER BOB (1920x1080): 2730 fields/s MIXER TEMPORAL (1920x1080): 237 fields/s MIXER TEMPORAL + IVTC (1920x1080): 880 fields/s MIXER TEMPORAL + SKIP_CHROMA (1920x1080): 994 fields/s MIXER TEMPORAL_SPATIAL (1920x1080): 1021 fields/s MIXER TEMPORAL_SPATIAL + IVTC (1920x1080): 947 fields/s MIXER TEMPORAL_SPATIAL + SKIP_CHROMA (1920x1080): 918 fields/s MIXER TEMPORAL_SPATIAL (720x576 video to 1920x1080 display): 1689 fields/s MULTITHREADED MPEG DECODING (1920x1080): 43 frames/s MULTITHREADED MIXER TEMPORAL (1920x1080): 885 fields/s [5] [ 6614.076946] amdgpu 0000:00:04.0: GPU fault detected: 146 0x0750080c [ 6614.078367] amdgpu 0000:00:04.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010B8EA [ 6614.080023] amdgpu 0000:00:04.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0C00800C [ 6614.081672] amdgpu 0000:00:04.0: GPU fault detected: 146 0x0750880c [ 6614.083202] amdgpu 0000:00:04.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010B8EA [ 6614.084816] amdgpu 0000:00:04.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0608800C [6] [13757.971926] VM fault (0x0c, vmid 4) at page 1095939, read from 'TC4' (0x54433400) (72) [13757.971931] amdgpu 0000:00:04.0: GPU fault detected: 146 0x0818480c [13757.973101] amdgpu 0000:00:04.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010B903 [13757.974489] amdgpu 0000:00:04.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0804800C [13757.975875] VM fault (0x0c, vmid 4) at page 1095939, read from 'TC4' (0x54433400) (72) [13764.909712] amdgpu 0000:00:04.0: GPU fault detected: 146 0x07e0480c [13764.911848] amdgpu 0000:00:04.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010B8FC [13764.914273] amdgpu 0000:00:04.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E04800C [13764.916700] VM fault (0x0c, vmid 7) at page 1095932, read from 'TC4' (0x54433400) (72) [7] root@lunixtst:~/mixbench# ./mixbench-ocl-alt mixbench-ocl/alternating (v0.02-25-g7587376) Use "-h" argument to see available options ------------------------ Device specifications ------------------------ Platform: AMD Accelerated Parallel Processing Device: Ellesmere/Advanced Micro Devices, Inc. Driver version: 2236.5 Address bits: 64 GPU clock rate: 555 MHz Total global mem: 8093 MB Max allowed buffer: 4048 MB OpenCL version: OpenCL 1.2 AMD-APP (2236.5) Total CUs: 14 ----------------------------------------------------------------------- Buffer size: 64MB Workgroup size: 256 Workitem stride: NDRange Buffer allocation: Device allocated Loading kernel source file... Precompilation of kernels... [>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>] ---------------------------------------------------------- CSV data ---------------------------------------------------------- Experiment ID, Single Precision ops,,,, Double precision ops,,,, Integer operations,,, Compute iters, Flops/byte, ex.time, GFLOPS, GB/sec, Flops/byte, ex.time, GFLOPS, GB/sec, Iops/byte, ex.time, GIOPS, GB/sec 0, 0.000, 14.36, 0.00, 598.08, 0.000, 19.61, 0.00, 876.04, 0.000, 9.88, 0.00, 869.55 1, 0.129, 10.18, 105.50, 817.59, 0.065, 18.89, 56.85, 881.23, 0.129, 10.83, 99.13, 768.29 2, 0.267, 10.33, 207.89, 779.57, 0.133, 17.37, 123.64, 927.27, 0.267, 11.78, 182.27, 683.51 3, 0.414, 10.16, 317.15, 766.46, 0.207, 19.52, 165.02, 797.57, 0.414, 13.67, 235.65, 569.49 4, 0.571, 10.71, 400.96, 701.68, 0.286, 23.51, 182.71, 639.50, 0.571, 13.32, 322.40, 564.20 5, 0.741, 10.64, 504.52, 681.10, 0.370, 26.50, 202.55, 546.90, 0.741, 13.96, 384.45, 519.01 6, 0.923, 10.07, 639.89, 693.22, 0.462, 28.05, 229.66, 497.59, 0.923, 13.72, 469.41, 508.53 7, 1.120, 9.80, 767.00, 684.82, 0.560, 30.77, 244.27, 436.19, 1.120, 16.14, 465.59, 415.71 8, 1.333, 10.49, 818.70, 614.03, 0.667, 33.39, 257.24, 385.87, 1.333, 15.47, 555.22, 416.41 9, 1.565, 9.69, 997.21, 637.11, 0.783, 36.19, 266.99, 341.16, 1.565, 16.76, 576.74, 368.47 10, 1.818, 9.60, 1117.98, 614.89, 0.909, 38.92, 275.89, 303.48, 1.818, 16.16, 664.30, 365.36 11, 2.095, 9.49, 1245.15, 594.27, 1.048, 41.58, 284.05, 271.14, 2.095, 16.84, 701.37, 334.74 12, 2.400, 8.96, 1438.38, 599.33, 1.200, 44.18, 291.62, 243.02, 2.400, 17.44, 738.98, 307.91 13, 2.737, 8.88, 1571.18, 574.09, 1.368, 51.88, 269.06, 196.62, 2.737, 18.10, 771.01, 281.72 14, 3.111, 9.15, 1643.19, 528.17, 1.556, 54.66, 275.01, 176.79, 3.111, 19.47, 772.22, 248.21 15, 3.529, 8.67, 1857.80, 526.38, 1.765, 52.21, 308.49, 174.81, 3.529, 20.13, 800.24, 226.74 16, 4.000, 7.80, 2201.42, 550.35, 2.000, 54.71, 314.03, 157.02, 4.000, 20.58, 834.89, 208.72 17, 4.533, 7.96, 2292.38, 505.67, 2.267, 56.13, 325.19, 143.46, 4.533, 20.47, 891.76, 196.71 18, 5.143, 8.07, 2394.13, 465.53, 2.571, 60.17, 321.19, 124.91, 5.143, 22.17, 871.61, 169.48 19, 5.846, 8.25, 2473.12, 423.03, 2.923, 62.81, 324.81, 111.12, 5.846, 22.82, 894.11, 152.94 20, 6.667, 8.14, 2636.84, 395.53, 3.333, 65.43, 328.20, 98.46, 6.667, 22.59, 950.54, 142.58 21, 7.636, 8.03, 2806.95, 367.58, 3.818, 68.10, 331.12, 86.72, 7.636, 23.24, 970.12, 127.04 22, 8.800, 8.23, 2870.13, 326.15, 4.400, 70.75, 333.88, 75.88, 8.800, 24.72, 955.73, 108.61 23, 10.222, 7.81, 3160.13, 309.14, 5.111, 73.54, 335.82, 65.70, 10.222, 24.52, 1007.29, 98.54 24, 12.000, 7.77, 3316.47, 276.37, 6.000, 77.60, 332.10, 55.35, 12.000, 24.97, 1031.90, 85.99 25, 14.286, 7.35, 3651.59, 255.61, 7.143, 80.30, 334.30, 46.80, 14.286, 25.61, 1048.37, 73.39 26, 17.333, 8.23, 3391.18, 195.65, 8.667, 83.22, 335.45, 38.71, 17.333, 27.26, 1024.18, 59.09 27, 21.600, 7.34, 3951.54, 182.94, 10.800, 85.91, 337.47, 31.25, 21.600, 27.10, 1069.93, 49.53 28, 28.000, 7.01, 4287.91, 153.14, 14.000, 86.43, 347.85, 24.85, 28.000, 28.36, 1060.17, 37.86 29, 38.667, 7.13, 4366.03, 112.91, 19.333, 88.14, 353.30, 18.27, 38.667, 29.04, 1072.42, 27.73 30, 60.000, 6.78, 4754.44, 79.24, 30.000, 91.74, 351.14, 11.70, 60.000, 28.78, 1119.19, 18.65 31, 124.000, 6.64, 5016.70, 40.46, 62.000, 94.45, 352.43, 5.68, 124.000, 30.30, 1098.50, 8.86 32, inf, 6.41, 5362.94, 0.00, inf, 95.21, 360.90, 0.00, inf, 29.98, 1146.16, 0.00 ------------------------------------------------------------------------------------------------------------------------------ Others tried: root@lunixtst:~/shoc# ./shoc/src/opencl/level1/md5hash/MD5Hash bash: ./shoc/src/opencl/level1/md5hash/MD5Hash: No such file or directory root@lunixtst:~/shoc# ./shoc/src/opencl/level1/md5hash/MD5Hash^C root@lunixtst:~/shoc# ./src/opencl/level1/md5hash/MD5Hash Chose device: name='Ellesmere' index=0 id=0x1005150 ERROR: mismatch in key index found. ERROR: mismatch in key index found. ERROR: mismatch in key index found. ERROR: mismatch in key index found. ERROR: mismatch in key index found. ERROR: mismatch in key index found. ERROR: mismatch in key index found. ERROR: mismatch in key index found. ERROR: mismatch in key index found. ERROR: mismatch in key index found. test atts units median mean stddev min max trial0 trial1 trial2 trial3 trial4 trial5 trial6 trial7 trial8 trial9 MD5Hash 7x10 GHash/s N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A Note: Any results marked with (*) had missing values. This can occur on systems with a mixture of device types or architectural capabilities. # ethminer -G -M #(after starting and stopping X) [OPENCL]:Found suitable OpenCL device [Ellesmere] with 8533725184 bytes of GPU memory ◇ 23:39:16.061|ethminer #00004000… Benchmarking on platform: { "platform": "AMD Accelerated Parallel Processing", "device": "Ellesmere", "version": "OpenCL 1.2 AMD-APP (2236.5)" } Preparing DAG... ℹ 23:39:16.062|gpuminer0 workLoop 0 #00000000… #00000000… ℹ 23:39:16.063|gpuminer0 Initialising miner... Warming up... Trial 1... [OPENCL]:Using platform: AMD Accelerated Parallel Processing [OPENCL]:Using device: Ellesmere(OpenCL 1.2 AMD-APP (2236.5)) [OPENCL]:Printing program log [OPENCL]: [OPENCL]:Creating one big buffer for the DAG [OPENCL]:Loading single big chunk kernels [OPENCL]:Mapping one big chunk. 0 Trial 2... [OPENCL]:Creating buffer for header. [OPENCL]:Creating mining buffer 0 [OPENCL]:Creating mining buffer 1 [ 46.474707] amdgpu 0000:00:04.0: GPU fault detected: 147 0x00004802 [ 46.475150] amdgpu 0000:00:04.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 46.475672] amdgpu 0000:00:04.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0C048002 0 Trial 3... 0 Trial 4... 0 Trial 5... 0 [***HANG HERE, HAVE TO REBOOT HOST TO RESET CARD***] [8] # clinfo Number of platforms 1 Platform Name AMD Accelerated Parallel Processing Platform Vendor Advanced Micro Devices, Inc. Platform Version OpenCL 2.0 AMD-APP (2236.5) Platform Profile FULL_PROFILE Platform Extensions cl_khr_icd cl_amd_event_callback cl_amd_offline_devices Platform Extensions function suffix AMD Platform Name AMD Accelerated Parallel Processing Number of devices 2 Device Name Ellesmere Device Vendor Advanced Micro Devices, Inc. Device Vendor ID 0x1002 Device Version OpenCL 1.2 AMD-APP (2236.5) Driver Version 2236.5 Device OpenCL C Version OpenCL C 1.2 Device Type GPU Device Profile FULL_PROFILE Device Board Name (AMD) AMD Radeon (TM) RX 480 Graphics Device Topology (AMD) PCI-E, 00:04.0 Max compute units 14 SIMD per compute unit (AMD) 4 SIMD width (AMD) 16 SIMD instruction width (AMD) 1 Max clock frequency 555MHz Graphics IP (AMD) 8.0 Device Partition (core) Max number of sub-devices 14 Supported partition types none specified Max work item dimensions 3 Max work item sizes 256x256x256 Max work group size 256 Preferred work group size multiple 64 Wavefront width (AMD) 64 Preferred / native vector sizes char 4 / 4 short 2 / 2 int 1 / 1 long 1 / 1 half 1 / 1 (cl_khr_fp16) float 1 / 1 double 1 / 1 (cl_khr_fp64) Half-precision Floating-point support (cl_khr_fp16) Denormals No Infinity and NANs No Round to nearest No Round to zero No Round to infinity No IEEE754-2008 fused multiply-add No Support is emulated in software No Correctly-rounded divide and sqrt operations No Single-precision Floating-point support (core) Denormals No Infinity and NANs Yes Round to nearest Yes Round to zero Yes Round to infinity Yes IEEE754-2008 fused multiply-add Yes Support is emulated in software No Correctly-rounded divide and sqrt operations Yes Double-precision Floating-point support (cl_khr_fp64) Denormals Yes Infinity and NANs Yes Round to nearest Yes Round to zero Yes Round to infinity Yes IEEE754-2008 fused multiply-add Yes Support is emulated in software No Correctly-rounded divide and sqrt operations No Address bits 64, Little-Endian Global memory size 8495808512 (7.912GiB) Global free memory (AMD) 8278188 (7.895GiB) Global memory channels (AMD) 8 Global memory banks per channel (AMD) 16 Global memory bank width (AMD) 256 bytes Error Correction support No Max memory allocation 4244635648 (3.953GiB) Unified memory for Host and Device No Minimum alignment for any data type 128 bytes Alignment of base address 2048 bits (256 bytes) Global Memory cache type Read/Write Global Memory cache size 16384 Global Memory cache line 64 bytes Image support Yes Max number of samplers per kernel 16 Max size for 1D images from buffer 134217728 pixels Max 1D or 2D image array size 2048 images Base address alignment for 2D image buffers 256 bytes Pitch alignment for 2D image buffers 256 bytes Max 2D image size 16384x16384 pixels Max 3D image size 2048x2048x2048 pixels Max number of read image args 128 Max number of write image args 8 Local memory type Local Local memory size 32768 (32KiB) Local memory syze per CU (AMD) 65536 (64KiB) Local memory banks (AMD) 32 Max constant buffer size 65536 (64KiB) Max number of constant args 8 Max size of kernel argument 1024 Queue properties Out-of-order execution No Profiling Yes Prefer user sync for interop Yes Profiling timer resolution 1ns Profiling timer offset since Epoch (AMD) 1484275845220815516ns (Thu Jan 12 18:50:45 2017) Execution capabilities Run OpenCL kernels Yes Run native kernels No Thread trace supported (AMD) Yes SPIR versions 1.2 printf() buffer size 1048576 (1024KiB) Built-in kernels Device Available Yes Compiler Available Yes Linker Available Yes Device Extensions cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event [...skip Intel CPU...] NULL platform behavior clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) No platform clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) No platform clCreateContext(NULL, ...) [default] No platform clCreateContext(NULL, ...) [other] Success [AMD] clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) No platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No platform clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) No platform # /opt/AMDAPPSDK-3.0/bin/x86_64/clinfo Number of platforms: 1 Platform Profile: FULL_PROFILE Platform Version: OpenCL 2.0 AMD-APP (2236.5) Platform Name: AMD Accelerated Parallel Processing Platform Vendor: Advanced Micro Devices, Inc. Platform Extensions: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices Platform Name: AMD Accelerated Parallel Processing Number of devices: 2 Device Type: CL_DEVICE_TYPE_GPU Vendor ID: 1002h Board name: AMD Radeon (TM) RX 480 Graphics Device Topology: PCI[ B#0, D#4, F#0 ] Max compute units: 14 Max work items dimensions: 3 Max work items[0]: 256 Max work items[1]: 256 Max work items[2]: 256 Max work group size: 256 Preferred vector width char: 4 Preferred vector width short: 2 Preferred vector width int: 1 Preferred vector width long: 1 Preferred vector width float: 1 Preferred vector width double: 1 Native vector width char: 4 Native vector width short: 2 Native vector width int: 1 Native vector width long: 1 Native vector width float: 1 Native vector width double: 1 Max clock frequency: 555Mhz Address bits: 64 Max memory allocation: 4244635648 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 8 Max image 2D width: 16384 Max image 2D height: 16384 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 1024 Alignment (bits) of base address: 2048 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: No Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 16384 Global memory size: 8495808512 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Scratchpad Local memory size: 32768 Max pipe arguments: 0 Max pipe active reservations: 0 Max pipe packet size: 0 Max global variable size: 0 Max global variable preferred total size: 0 Max read/write image args: 0 Max on device events: 0 Queue on device max size: 0 Max on device queues: 0 Queue on device preferred size: 0 SVM capabilities: Coarse grain buffer: No Fine grain buffer: No Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 64 Error correction support: 0 Unified memory for Host and Device: 0 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: No Queue on Host properties: Out-of-Order: No Profiling : Yes Queue on Device properties: Out-of-Order: No Profiling : No Platform ID: 0x7faab8eecdb8 Name: Ellesmere Vendor: Advanced Micro Devices, Inc. Device OpenCL C version: OpenCL C 1.2 Driver version: 2236.5 Profile: FULL_PROFILE Version: OpenCL 1.2 AMD-APP (2236.5) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event [9] # /opt/amdgpu-pro/bin/kmstest trying to open device 'i915'...failed trying to open device 'amdgpu'...done main: All ok! # /opt/amdgpu-pro/bin/modetest trying to open device 'i915'...failed trying to open device 'amdgpu'...done Encoders: id crtc type possible crtcs possible clones 45 0 TMDS 0x0000003f 0x00000000 47 0 TMDS 0x0000003f 0x00000000 49 0 TMDS 0x0000003f 0x00000000 51 0 TMDS 0x0000003f 0x00000000 53 34 TMDS 0x0000003f 0x00000000 [...and so on...] # /opt/amdgpu-pro/bin/proptest [works fine, output similar to subset of `modetest`] # /opt/amdgpu-pro/bin/vbltest trying to open device 'i915'...failed trying to open device 'amdgpu'...done starting count: 276079 freq: 60.87Hz freq: 60.02Hz freq: 60.02Hz freq: 60.02Hz freq: 60.02Hz freq: 60.02Hz freq: 60.02Hz ^C # /opt/amdgpu-pro/bin/amdgpu_test DRM Driver: Name: [amdgpu] : Date [20150101] : Description [AMD GPU] CUnit - A unit testing framework for C - Version 2.1-3 http://cunit.sourceforge.net/ Suite: Basic Tests Test: Query Info Test ...passed Test: Memory alloc Test ...passed Test: Userptr Test ...passed Test: Command submission Test (GFX) ... [HANGS HERE] OK, I failed to restrain the length of this message. -- [SorAlx] ridin' VN2000 Classic LT
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20170112234810.2cb83671>