search for: l1p

Displaying 6 results from an estimated 6 matches for "l1p".

Did you mean: l1
2018 Nov 07
3
RFC: System (cache, etc.) model for LLVM
...ning buffers or prefetch streams. If there multiple such bottlenecks, it would be the minimum of such streams. At the moment I cannot think of an optimization where the difference matters (which doesn't mean there isn't a case where it does). > That seems ok to me. As I understand it, L1P is a little awkward in > that L2 data doesn't get moved to L1P, it gets moved to L1. L1P is > really a prefetch buffer, right? One wouldn't do, say, cache blocking > for L1P. In that sense maybe modeling it as a cache level isn't the > right thing. The L1P (4 KiB) is sma...
2018 Nov 02
2
RFC: System (cache, etc.) model for LLVM
...for writing to RAM). Is it the maximum number of operations in the queue? > >> The intent is that getCacheLevel(0) is the L1 cache, getCacheLevel(1) is > >> the L2 cache and so on. > > > > Can passes rely on it? > > Yes. Naively, I'd put Blue Gene/Q's L1P cache between the L1 and the L2, i.e. the L1P would be getCacheLevel(1) and getCacheLevel(2) would be L2. How would you model it instead? Michael
2007 Jan 12
0
[PATCH] xc_ptrace PAE awareness
...maddr = page_array[maddr >> PAGE_SHIFT] << PAGE_SHIFT; + maddr = (uint64_t)page_array[maddr >> PAGE_SHIFT] << PAGE_SHIFT; return maddr; } @@ -225,7 +222,8 @@ map_domain_va_pae( void *guest_va, int perm) { - unsigned long l3e, l2e, l1e, l2p, l1p, p, va = (unsigned long)guest_va; + uint64_t l3e, l2e, l1e, l2p, l1p, p; + unsigned long va = (unsigned long)guest_va; uint64_t *l3, *l2, *l1; static void *v[MAX_VIRT_CPUS]; @@ -380,12 +378,12 @@ map_domain_va( if (!paging_enabled(&ctxt[cpu])) { static void * v...
2018 Nov 01
3
RFC: System (cache, etc.) model for LLVM
...hers/prefetch instructructions > > for different levels. > > Probably. Most X86 implementations direct all data prefetches to the > same cache level so we didn't find a need to model this, but it makes > sense to allow for it. Again the Blue Gene/Q: Streams prefetch into the L1P cache (P for prefetch), but a dcbt instruction is necessary to establish the cache line into the L1 cache. > >> An open question is how to handle different SKUs within a subtarget > >> family. We modeled the limited number of SKUs used in our products > >> via mult...
2012 Dec 10
26
[PATCH 00/11] Add virtual EPT support Xen.
From: Zhang Xiantao <xiantao.zhang@intel.com> With virtual EPT support, L1 hyerpvisor can use EPT hardware for L2 guest''s memory virtualization. In this way, L2 guest''s performance can be improved sharply. According to our testing, some benchmarks can show > 5x performance gain. Signed-off-by: Zhang Xiantao <xiantao.zhang@intel.com> Zhang Xiantao (11):
2018 Nov 01
2
RFC: System (cache, etc.) model for LLVM
Hi, thank you for sharing the system hierarchy model. IMHO it makes a lot of sense, although I don't know which of today's passes would make use of it. Here are my remarks. I am wondering how one could model the following features using this model, or whether they should be part of a performance model at all: * ARM's big.LITTLE * NUMA hierarchies (are the NUMA domains