Displaying 10 results from an estimated 10 matches for "gflops".
Did you mean:
flops
2010 Nov 03
1
[LLVMdev] LLVM x86 Code Generator discards Instruction-level Parallelism
Dear LLVMdev,
I've noticed an unusual behavior of the LLVM x86 code generator (with default options)
that results in nearly a 4x slow-down in floating-point throughput for my microbenchmark.
I've written a compute-intensive microbenchmark to approach theoretical peak
throughput of the target processor by issuing a large number of independent
floating-point multiplies. The distance
2016 May 02
2
[GSoC 2016] Attaining 90% of the turbo boost peak with a C version of Matrix-Matrix Multiplication
Hi Tobias,
according to [1], we can expect 90% of the turbo boost peak of the
processor with a C version of Matrix-Matrix Multiplication that is
similar to the one presented in [1]. In case of Intel Core i7-3820
SandyBridge, the theoretical maximal performance of the machine is
28.8 gflops and hence the expected number is 25,92 gflops.
However, in case of, for example, n = m = 1056 and k = 1024 a code
based on BLIS framework takes 0.088919 seconds and hence 25,68 gflops.
I’m not sure whether a C implementation, which similar to one the
presented in [1], can outperform a code based o...
2012 Oct 31
3
lustre client on arm debian
Hi,
has anyone tried to compile the lustre patchless client on a debian
linux for arm architecture? Could be possible to do?
Thanks in advance.
2007 May 02
0
FLAC on GPGPU
...block.
In addition to this kind of parallelism, grids of blocks (which do
not share memory, unlike threads within the same block) can be used
to process several audio frames at once. This is somewhat tricky,
given the explicitly stream-oriented API, and also some CUDA
peculiarities.
With the 330 GFLOPS from the current cards... I'd expect quite a
significant acceleration.
Does anyone find this interesting?
Josh: do you think this would be worth including in the FLAC codebase
when implemented?
-- boris
2013 Nov 19
7
Quadrified GTX 480 VT-d passthrough. CUDA 5.5 in Linux partial success
Hi everyone,
after following in the footsteps of the following discussion
(http://lists.xenproject.org/archives/html/xen-users/2013-09/msg00106.html)
I had been able to turn my GTX 480 into a Quadro 6000. When I VT-d
passthrough it to a Debian jessie VM it shows up fine and CUDA 5.5
seems to function properly up to a point:
lspci -v:
00:04.0 VGA compatible controller: NVIDIA Corporation GF100GL
2013 Nov 18
0
Quadrified GTX 480 VT-d passthrough. CUDA 5.5 in Linux partial success!
Hi everyone,
after following in the footsteps of the following discussion
(http://lists.xenproject.org/archives/html/xen-users/2013-09/msg00106.html)
I had been able to turn my GTX 480 into a Quadro 6000. When I VT-d
passthrough it to a Debian jessie VM it shows up fine and CUDA 5.5
seems to function properly up to a point:
lspci -v:
00:04.0 VGA compatible controller: NVIDIA Corporation GF100GL
2015 Jan 07
1
Design changes are done in Fedora
...tinuum is a smartphone the other end of, exactly? The one in my pocket has multiple general-purpose GHz-class CPU cores, a few specialized coprocessors, several hundred megs of RAM, dozens of gigs of fast local storage, and several high-tech radios. Its raw processing power is on the order of 100 GFLOPS.
This is the low end of?what?the Top 500 List from 1998?
?except that my phone achieves that parity on a few watts, and doesn?t require a staff of acolytes to tend to its needs. This is a device that would make Captain Kirk jealous, but it?s just one of a billion. Booooring.
We are *so* spoil...
2015 Jan 07
2
Design changes are done in Fedora
On Tue, Jan 6, 2015 at 5:07 PM, Warren Young <wyml at etr-usa.com> wrote:
>
>>> There are more JavaScript interpreters in the world than Dalvik, ART,[2] and Java ? VMs combined. Perhaps we should rewrite everything in JavaScript instead?
>>
>> I'm counting the running/useful instances of actual program code,
>
> I rather doubt you?ve done anything like
2010 Dec 22
16
stupid ZFS question - floating point operations
I have a coworker, who''s primary expertise is in another flavor of Unix.
This coworker lists floating point operations as one of ZFS detriments.
I''s not really sure what he means specifically, or where he got this
reference from.
In an effort to refute what I believe is an error or misunderstanding on
his part, I have spent time on Yahoo, Google, the ZFS section of
2009 Jan 06
11
zfs list improvements?
To improve the performance of scripts that manipulate zfs snapshots and the zfs snapshot service in perticular there needs to be a way to list all the snapshots for a given object and only the snapshots for that object.
There are two RFEs filed that cover this:
http://bugs.opensolaris.org/view_bug.do?bug_id=6352014 :
''zfs list'' should have an option to only present direct