search for: cold

Displaying 20 results from an estimated 1432 matches for "cold".

Did you mean: cld
2020 Aug 05
10
[RFC] Machine Function Splitter - Split out cold blocks from machine functions using profile data
Greetings, We present “Machine Function Splitter”, a codegen optimization pass which splits functions into hot and cold parts. This pass leverages the basic block sections feature recently introduced in LLVM from the Propeller project. The pass targets functions with profile coverage, identifies cold blocks and moves them to a separate section. The linker groups all cold blocks across functions together, decreasing...
2019 Feb 05
2
Status update on the hot/cold splitting pass
...; > On Feb 5, 2019, at 2:38 PM, Teresa Johnson via llvm-dev < > llvm-dev at lists.llvm.org> wrote: > > > > > On Mon, Jan 28, 2019 at 11:03 AM Aditya K via llvm-dev < > llvm-dev at lists.llvm.org> wrote: > >> > The splitting pass currently doesn’t move cold symbols into a separate >> section. Is that affecting your results? >> Maybe partly, the main reason is that, in the absence of good profile >> info, we aren't finding many cold blocks. >> > > We noticed that the split cold functions are ending up in the regular &g...
2019 Jan 28
2
Status update on the hot/cold splitting pass
The splitting pass currently doesn’t move cold symbols into a separate section. Is that affecting your results? On Darwin, we plan on using a symbol attribute to provide an ordering hint to the linker (see r352227, N_COLD_FUNC). vedant > On Jan 28, 2019, at 10:51 AM, Aditya K via llvm-dev <llvm-dev at lists.llvm.org> wrote: > &g...
2019 Jan 26
2
Status update on the hot/cold splitting pass
Hello, I’d like to give a status update to the community about the recently-added hot/cold splitting pass. I'll provide some motivation for the pass, describe its implementation, summarize recent/ongoing work, and share early results. # Motivation We (at Apple) have found that memory pressure from resident pages of code is significant on embedded devices. In particular, this pressu...
2019 Feb 05
2
Status update on the hot/cold splitting pass
On Mon, Jan 28, 2019 at 11:03 AM Aditya K via llvm-dev < llvm-dev at lists.llvm.org> wrote: > > The splitting pass currently doesn’t move cold symbols into a separate > section. Is that affecting your results? > Maybe partly, the main reason is that, in the absence of good profile > info, we aren't finding many cold blocks. > We noticed that the split cold functions are ending up in the regular .text section instead of .t...
2020 Jun 01
2
Improve hot cold splitting to aggressively outline small blocks
Hello, I am Ruijie Fang, a GSoC student working on "Improve hot cold splitting to aggressively outline small blocks." Over the course of last week, I met with my mentor and co-mentor, Aditya Kumar, and Rodrigo Rocha, and we made a preliminary plan on improving the existing hot/cold splitting pass in LLVM through identifying patterns of cold blocks in real-world...
2020 Aug 10
2
[RFC] Machine Function Splitter - Split out cold blocks from machine functions using profile data
>Exceptions >All eh pads are grouped together regardless of their coldness and are part of the original function. There are outstanding issues with splitting eh pads if they reside in separate sections in the binary. This remains as part of future work. Can you elaborate more on the outstanding issues with splitting eh pads? From my dip into the unwind map in gc...
2020 Aug 05
3
[RFC] Machine Function Splitter - Split out cold blocks from machine functions using profile data
...lar. Cost model can sometimes be quite tricky though -- it is hard to compare the cost with the actual benefit brought by the splitting. The beauty of machine splitting is that it does not depend on sophisticated cost/benefit model. > > In contrast, the machine function splitter extracts cold code into a separate > section. > HCS also adds a section prefix to all the cold functions. It is possible > that the cold functions are still in the same section as the hot one > depending on the linker. Ruijie has a patch to move all the cold functions > to a separate section, we a...
2018 Feb 27
1
On sharded tiered volume, only first shard of new file goes on hot tier.
...ered volume, only first shard of new file goes on hot tier. https://bugzilla.redhat.com/show_bug.cgi?id=1549714 On sharded tiered volume, only first shard of new file goes on hot tier. On a sharded tiered volume, only the first shard of a new file goes on the hot tier, the rest are written to the cold tier. This is unfortunate for archival applications where the hot tier is fast, but the cold tier is very slow. After the tier- promote-frequency (default 120 seconds), all of the shards do migrate to hot tier, but for archival applications, this migration is not helpful since the file is likely t...
2020 Sep 09
2
[RFC] New Feature Proposal: De-Optimizing Cold Functions using PGO Info
On Wed, 9 Sep 2020 at 19:26, Nemanja Ivanovic <nemanja.i.ibm at gmail.com> wrote: > - Default (i.e. no -profile-deopt-cold): do nothing > - Option with no arg (i.e. -profile-deopt-cold): add attribute only to > functions that have an execution count of zero > - Option with an arg (i.e. -profile-deopt-cold=<N>): add attribute to > functions that account for <N>% of total execution counts > I...
2020 Jun 02
2
Improve hot cold splitting to aggressively outline small blocks
...rinceton.edu On Mon, Jun 1, 2020 at 11:28 AM Teresa Johnson <tejohnson at google.com> wrote: > > > On Sun, May 31, 2020 at 11:37 PM Ruijie Fang <ruijief at princeton.edu> > wrote: > >> Hello, >> I am Ruijie Fang, a GSoC student working on "Improve hot cold >> splitting to aggressively outline small blocks." Over the course of >> last week, I met with my mentor and co-mentor, Aditya Kumar, and >> Rodrigo Rocha, and we made a preliminary plan on improving the >> existing hot/cold splitting pass in LLVM through identifying p...
2020 Sep 09
5
[RFC] New Feature Proposal: De-Optimizing Cold Functions using PGO Info
We would like to propose a new feature to disable optimizations on IR Functions that are considered “cold” by PGO profiles. The primary goal for this work is to improve code optimization speed (which also improves compilation and LTO speed) without making too much impact on target code performance. The mechanism is pretty simple: In the second phase (i.e. optimization phase) of PGO, we would add `optn...
2020 Jun 02
2
Improve hot cold splitting to aggressively outline small blocks
...son at google.com> >> wrote: >> >>> >>> >>> On Sun, May 31, 2020 at 11:37 PM Ruijie Fang <ruijief at princeton.edu> >>> wrote: >>> >>>> Hello, >>>> I am Ruijie Fang, a GSoC student working on "Improve hot cold >>>> splitting to aggressively outline small blocks." Over the course of >>>> last week, I met with my mentor and co-mentor, Aditya Kumar, and >>>> Rodrigo Rocha, and we made a preliminary plan on improving the >>>> existing hot/cold splitting pa...
2018 Mar 05
2
Why files goes to hot tier and cold tier at same time
Hi Guys Got a quick question regarding hot tier and cold tier. I got a gluster volume with 1 x 3 hot tier and 1 x 3 cold tier. watermark-low is 75 and watermark-hi is 90. usage of volume is very less. My files always go to hot tier and cold tier at same time As I understand, data should go to hot tier only until demoted. Could someone please shed some li...
2014 Nov 19
5
[LLVMdev] Odd code layout requirements for MCJIT
...rter than that now, but this description is good enough for the problem at hand). We're under pretty severe icache/iTLB pressure, so we do whatever we can to keep the hot path as compact as possible. One of the ways we do this is by dividing our code cache into three fixed-size areas: main, cold, and frozen. Our current, non-llvm codegen backend has one area tag per basic block, and most tracelets we compile will span all three areas. This means that if we emit code for tracelets A and then B, A's main code will be followed immediately by B's main code (and the same for cold/f...
2005 May 31
2
Centos4 SMP Kernel OOM
....) A reboot helped though, but still... :) [this is the normal CentOS4 i686 SMP kernel 2.6.9-5.0.5.ELsmp] May 31 22:31:25 tcs kernel: oom-killer: gfp_mask=0xd0 May 31 22:31:25 tcs kernel: DMA per-cpu: May 31 22:31:25 tcs kernel: cpu 0 hot: low 2, high 6, batch 1 May 31 22:31:25 tcs kernel: cpu 0 cold: low 0, high 2, batch 1 May 31 22:31:25 tcs kernel: cpu 1 hot: low 2, high 6, batch 1 May 31 22:31:25 tcs kernel: cpu 1 cold: low 0, high 2, batch 1 May 31 22:31:25 tcs kernel: cpu 2 hot: low 2, high 6, batch 1 May 31 22:31:25 tcs kernel: cpu 2 cold: low 0, high 2, batch 1 May 31 22:31:25 tcs kerne...
2018 Mar 05
0
Why files goes to hot tier and cold tier at same time
Hi, The actual data will be in the hot tier only till demotion. The file that you see on the cold tier is just a linkto file of the file on the hot tier. These linkto file are necessary for the internal working of the tier. On Mon, Mar 5, 2018 at 1:16 PM, Sherin George <allmyforums at outlook.in> wrote: > Hi Guys > > Got a quick question regarding hot tier and cold tier. > I...
2017 Aug 15
8
[RFC] Enhance Partial Inliner by using a general outlining scheme for cold blocks
...d BranchProbability information is only used when attempting to inline the early return code, and not used to determine whether to outline the slow code. ### Proposed changes In addition to looking for early returns, we should utilize profile information to outline blocks that are considered cold. If we can sufficiently reduce the size of the original function via this type of outlining, inlining should be able to inline the rest of the hot code. ## Details With the presence of profile information, we have a view of what code is infrequently executed and make better decisions on wha...
2017 Aug 24
1
[RFC] Enhance Partial Inliner by using a general outlining scheme for cold blocks
Hi David, So I've began doing some implementation on the outlining portion of the code. Currently, I got the partial inliner to outline cold regions (single entry, single exit) of the code, based solely on the existence of ProfileSummaryInfo (ie. profiling data). However, I have some concerns on how this will co-exist with the existing code that peels early returns. The control flow looks something like this: // New Code: find cold r...
2017 Aug 24
3
[RFC] Enhance Partial Inliner by using a general outlining scheme for cold blocks
Hi David, The only reason I can see to use the 'pattern matching' part as a fall-back is in case we cannot inline the (what I'm assuming would be) a much bigger hot-path-only cloned function for whatever reason. What I'm assuming here is that after cold-region outlining, we may still have a large portion of the original function body to attempt to inline, whereas the pattern matching method will only contain a few basic blocks, giving a better chance to inline something. For your (2) point, I think we'll have to be careful here. Without a se...