Thanks for clarifying, Craig! +Shengchen: It looks like for the rest of the suggestions we'd need to make changes to tablegen. We will address them after the merge as we will have a use case in monorepo. Just to clarify: our understanding is that you support the merge but just have suggestions. We don't want to be blocked right now as we've started the merge process this week. Does that sound right to you? On Mon, Jan 10, 2022 at 2:02 PM Craig Topper <craig.topper at gmail.com> wrote:> > On Mon, Jan 10, 2022 at 1:12 PM Amir Aupov via llvm-dev < > llvm-dev at lists.llvm.org> wrote: > >> Hi Shengchen, >> >> Thank you for the code review and your suggestions! >> We've addressed some of them (see inline) but still in the process of >> addressing them all. >> >> On Tue, Jan 4, 2022 at 7:56 PM Kan, Shengchen <shengchen.kan at intel.com> >> wrote: >> >>> Hi Amir >>> >>> >>> >>> I read the MC-related code and it looks fine to me in general. I’m also >>> supportive for the merge. The following are some suggestions. >>> >>> >>> >>> 1. >>> https://llvm.org/docs/CodingStandards.html#braced-initializer-lists >>> If you use a braced initializer list when initializing a variable, use an >>> equals before the open curly brace. e.g BinaryBasicBlock.h >>> >>> Addressed, soon to be published. >> >> >>> >>> 1. >>> https://llvm.org/docs/ProgrammersManual.html#fine-grained-debug-info-with-debug-type-and-the-debug-only-option >>> DEBUG_TYPE should not be target specific, >>> >>> e.g, the definition of DEBUG_TYPE should be same in X86MCPlusBuilder.cpp >>> and AArch64MCPlusBuilder.cpp >>> >> Addressed in >> https://github.com/facebookincubator/BOLT/commit/7ca0c5a26714754ec1dd72576ddecaa8b459842c >> >> >>> >>> 1. Functions getShortArithOpcode, getShortBranchOpcode in >>> X86MCPlusBuilder.cpp use reversed mappings as functions >>> getRelaxedOpcodeBranch, getRelaxedOpcodeArith in X86AsmBackend.cpp. Writing >>> twice in two places makes the code difficult to maintain, could we generate >>> the table with tablgen? The memory folding and unfolding tables in >>> X86InstrFoldTables.cpp give an example. >>> >>> To make sure I get it right: the tables in X86InstrFoldTables.cpp were >> generated with tablegen at some point, but are checked in as source code? >> Is there a way to rebuild these tables, so I can use that as an example? >> > > There is a tablegen backend that can generate it, but it has some bugs. > Especially around some oddities with the SSE movd/movq instructions if I > remember right. I used to run it after adding new instructions and picked > the table entries I thought were ok. The code is in llvm/utils/TableGen/X86FoldTablesEmitter.cpp > but I'm not sure how helpful it is. It's mostly trying to find pairs of > instructions that different in the register vs memory form of the modrm > byte with rest of the encoding being the same. For your case, the > instructions differ in the opcode byte. I guess you'd have to write a > tablegen backend that knew rules like if opcode==0x81 or 0x83, the other > instruction is the one with the other opcode, but rest of the encoding is > the same. > > >> >>> 1. It would better if functions isADD, isAND, isCMP, isDEC, isINC, >>> isSUB and isTEST can be generated with tablgen. >>> >>> Removed our definitions of isINC/isDEC: >> https://github.com/facebookincubator/BOLT/commit/9fec2d58964d643a7436ba2804f8fb967f23881e >> How do we approach generating isADD/... with tablegen? There are >> appropriate definitions (`defm ADD`, etc) >> in llvm/lib/Target/X86/X86InstrArithmetic.td but I'm not sure how to get >> tablegen to generate the tables. >> > > I think you'd have to add isAdd, isCmp, etc. fields to the X86Inst class > and set them on the appropriate instructions. Then have a tablegen backend > collect all the instructions with each property. Or you could figure out > the different encoding rules like if the opcode is 0x80-0x83 than the > format being MRM4r/MRM4m/MRM4X mean the opcode is an AND. > > >> Thanks, >> -Amir >> >> >>> >>> Best >>> >>> Shengchen >>> >>> *From:* llvm-dev <llvm-dev-bounces at lists.llvm.org> *On Behalf Of *Wang, >>> Phoebe via llvm-dev >>> *Sent:* Thursday, December 30, 2021 5:33 PM >>> *To:* Amir Aupov <amir.aupov at gmail.com> >>> *Cc:* llvm-dev at lists.llvm.org >>> *Subject:* Re: [llvm-dev] Preparing BOLT for LLVM monorepo >>> >>> >>> >>> Hi Amir, >>> >>> >>> >>> Amazing job! Thanks for the quick and perfect changes. I don’t have any >>> other problems now. Hoping seeing the code in main trunk soon. >>> >>> >>> >>> Thanks >>> >>> Phoebe >>> >>> >>> >>> *From:* Amir Aupov <amir.aupov at gmail.com> >>> *Sent:* Thursday, December 30, 2021 4:32 PM >>> *To:* Wang, Phoebe <phoebe.wang at intel.com> >>> *Cc:* llvm-dev at lists.llvm.org >>> *Subject:* Re: [llvm-dev] Preparing BOLT for LLVM monorepo >>> >>> >>> >>> Hi Phoebe, >>> >>> >>> >>> Thank you again for reviewing the codebase and for your feedback. We've >>> addressed it in recent commits: >>> >>> > 1. Use the monorepo of LLVM in the example for convenience in https://github.com/facebookincubator/BOLT/blob/main/bolt/docs/OptimizingClang.md#getting-clang-7-sources >>> >>> E.g, git clone --branch=release/7.x https://github.com/llvm/llvm-project.git >>> >>> https://github.com/facebookincubator/BOLT/commit/c4ed6a11eb264660f5cfe7f5165df35b63d52608 >>> >>> >>> >>> > 1. I found there are 142 code with “not implemented”, most of which are in Core/MCPlusBuilder.h. >>> >>> * Do they affect the functionality of BOLT? >>> >>> * Do you have plan to implement them recently or can they be removed instead? >>> >>> Explained above. >>> >>> > 2. I noticed some inconsistent use of braces in the code. Maybe better to follow with LLVM coding standard<https://llvm.org/docs/CodingStandards.html#don-t-use-braces-on-simple-single-statement-bodies-of-if-else-loop-statements>. >>> >>> * https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/BinaryBasicBlock.cpp#L241, L289<https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/BinaryBasicBlock.cpp#L289>, etc. >>> >>> * https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/DebugData.cpp#L193 >>> >>> https://github.com/facebookincubator/BOLT/commit/fe3fcf094e320af4da4a98a5387820cd0ca826b9 >>> >>> https://github.com/facebookincubator/BOLT/commit/b40699ed90882465e49ec46e4d3ad28623e54c69 >>> >>> https://github.com/facebookincubator/BOLT/commit/c21ee425083276a7af700e232df15e2a3d31dacb >>> >>> https://github.com/facebookincubator/BOLT/commit/ebc06afcfc35fbb4360b3788adeb6228c27ad830 >>> >>> https://github.com/facebookincubator/BOLT/commit/dca3003fce711daee387bbbf66bf4ecbe1b9ab9b >>> >>> https://github.com/facebookincubator/BOLT/commit/cd526a4550455b0f203771088cef6e512b0dd1a2 >>> >>> > 3. Some files don’t have a descriptions in the first line, e.g. DynoStats.cpp<https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/DynoStats.cpp#L1>, ParallelUtilities.cpp<https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/ParallelUtilities.cpp#L1>, etc. >>> >>> > 4. Leaving without descriptions might be fine, but the format should be consistent. Leaving with spaces like in BinaryFunctionProfile.cpp<https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/BinaryFunctionProfile.cpp#L1> doesn’t make sense. >>> >>> >>> https://github.com/facebookincubator/BOLT/commit/349c8abc696919e989418895f4ade24a4c910617 >>> >>> >>> >>> On Tue, Dec 21, 2021 at 4:22 AM Wang, Phoebe <phoebe.wang at intel.com> >>> wrote: >>> >>> Hi Amir, >>> >>> >>> >>> It makes sense to me. Thanks! >>> >>> >>> >>> *From:* llvm-dev <llvm-dev-bounces at lists.llvm.org> *On Behalf Of *Amir >>> Aupov via llvm-dev >>> *Sent:* Tuesday, December 21, 2021 8:57 AM >>> *To:* llvm-dev at lists.llvm.org >>> *Subject:* Re: [llvm-dev] Preparing BOLT for LLVM monorepo >>> >>> >>> >>> Hi Phoebe, >>> >>> >>> >>> Thanks again for taking the time to look at the codebase. We've added >>> your feedback to our upstreaming tracking issue on github: >>> https://github.com/facebookincubator/BOLT/issues/248. We've started >>> addressing the issues. >>> >>> >>> >>> Regarding "not implemented” in Core/MCPlusBuilder.h: it's a design >>> decision to prevent the use of non-implemented target-specific methods >>> (these are virtual methods). Target-specific implementation goes to >>> X86MCPlusBuilder and AArch64MCPlusBuilder. Some methods only make sense for >>> specific targets, so we intend to keep llvm_unreachable statements in >>> MCPlusBuilder.h. >>> >>> _______________________________________________ >> LLVM Developers mailing list >> llvm-dev at lists.llvm.org >> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev >> >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20220110/6d0691ed/attachment.html>
Kan, Shengchen via llvm-dev
2022-Jan-11 02:22 UTC
[llvm-dev] Preparing BOLT for LLVM monorepo
Yes, I am supportive for the merge and those are only suggestions and are not the block issues for the merge. See more comments below. Best Shengchen From: Amir Aupov <amir.aupov at gmail.com> Sent: Tuesday, January 11, 2022 6:24 AM To: Craig Topper <craig.topper at gmail.com> Cc: Kan, Shengchen <shengchen.kan at intel.com>; llvm-dev at lists.llvm.org Subject: Re: [llvm-dev] Preparing BOLT for LLVM monorepo Thanks for clarifying, Craig! +Shengchen: It looks like for the rest of the suggestions we'd need to make changes to tablegen. We will address them after the merge as we will have a use case in monorepo. Just to clarify: our understanding is that you support the merge but just have suggestions. We don't want to be blocked right now as we've started the merge process this week. Does that sound right to you? On Mon, Jan 10, 2022 at 2:02 PM Craig Topper <craig.topper at gmail.com<mailto:craig.topper at gmail.com>> wrote: On Mon, Jan 10, 2022 at 1:12 PM Amir Aupov via llvm-dev <llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org>> wrote: Hi Shengchen, Thank you for the code review and your suggestions! We've addressed some of them (see inline) but still in the process of addressing them all. On Tue, Jan 4, 2022 at 7:56 PM Kan, Shengchen <shengchen.kan at intel.com<mailto:shengchen.kan at intel.com>> wrote: Hi Amir I read the MC-related code and it looks fine to me in general. I’m also supportive for the merge. The following are some suggestions. 1. https://llvm.org/docs/CodingStandards.html#braced-initializer-lists If you use a braced initializer list when initializing a variable, use an equals before the open curly brace. e.g BinaryBasicBlock.h Addressed, soon to be published. 1. https://llvm.org/docs/ProgrammersManual.html#fine-grained-debug-info-with-debug-type-and-the-debug-only-option DEBUG_TYPE should not be target specific, e.g, the definition of DEBUG_TYPE should be same in X86MCPlusBuilder.cpp and AArch64MCPlusBuilder.cpp Addressed in https://github.com/facebookincubator/BOLT/commit/7ca0c5a26714754ec1dd72576ddecaa8b459842c 1. Functions getShortArithOpcode, getShortBranchOpcode in X86MCPlusBuilder.cpp use reversed mappings as functions getRelaxedOpcodeBranch, getRelaxedOpcodeArith in X86AsmBackend.cpp. Writing twice in two places makes the code difficult to maintain, could we generate the table with tablgen? The memory folding and unfolding tables in X86InstrFoldTables.cpp give an example. To make sure I get it right: the tables in X86InstrFoldTables.cpp were generated with tablegen at some point, but are checked in as source code? Is there a way to rebuild these tables, so I can use that as an example? There is a tablegen backend that can generate it, but it has some bugs. Especially around some oddities with the SSE movd/movq instructions if I remember right. I used to run it after adding new instructions and picked the table entries I thought were ok. The code is in llvm/utils/TableGen/X86FoldTablesEmitter.cpp but I'm not sure how helpful it is. It's mostly trying to find pairs of instructions that different in the register vs memory form of the modrm byte with rest of the encoding being the same. For your case, the instructions differ in the opcode byte. I guess you'd have to write a tablegen backend that knew rules like if opcode==0x81 or 0x83, the other instruction is the one with the other opcode, but rest of the encoding is the same. I think we can just check the instruction name rather than opcode in the tablgen, e,g. ADD32ri and ADD32ri8 have the same prefix “ADD32ri”, but one of them has suffix “8”. If you do not want to hand-write the table rather than generate it w/ tablgen, at least getShortArithOpcode can share the same table with getRelaxedOpcodeArith. The function pair lookupUnfoldTable and lookupFoldTable is an example. 1. It would better if functions isADD, isAND, isCMP, isDEC, isINC, isSUB and isTEST can be generated with tablgen. Removed our definitions of isINC/isDEC: https://github.com/facebookincubator/BOLT/commit/9fec2d58964d643a7436ba2804f8fb967f23881e How do we approach generating isADD/... with tablegen? There are appropriate definitions (`defm ADD`, etc) in llvm/lib/Target/X86/X86InstrArithmetic.td but I'm not sure how to get tablegen to generate the tables. I think you'd have to add isAdd, isCmp, etc. fields to the X86Inst class and set them on the appropriate instructions. Then have a tablegen backend collect all the instructions with each property. Or you could figure out the different encoding rules like if the opcode is 0x80-0x83 than the format being MRM4r/MRM4m/MRM4X mean the opcode is an AND. I think we can just check the instruction name in tablgen, e.g, ADD8ri, ADD16ri, ADD32ri, ADD32ri8 and ADD64ri have same prefix “ADD” followed by a number . Thanks, -Amir Best Shengchen From: llvm-dev <llvm-dev-bounces at lists.llvm.org<mailto:llvm-dev-bounces at lists.llvm.org>> On Behalf Of Wang, Phoebe via llvm-dev Sent: Thursday, December 30, 2021 5:33 PM To: Amir Aupov <amir.aupov at gmail.com<mailto:amir.aupov at gmail.com>> Cc: llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org> Subject: Re: [llvm-dev] Preparing BOLT for LLVM monorepo Hi Amir, Amazing job! Thanks for the quick and perfect changes. I don’t have any other problems now. Hoping seeing the code in main trunk soon. Thanks Phoebe From: Amir Aupov <amir.aupov at gmail.com<mailto:amir.aupov at gmail.com>> Sent: Thursday, December 30, 2021 4:32 PM To: Wang, Phoebe <phoebe.wang at intel.com<mailto:phoebe.wang at intel.com>> Cc: llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org> Subject: Re: [llvm-dev] Preparing BOLT for LLVM monorepo Hi Phoebe, Thank you again for reviewing the codebase and for your feedback. We've addressed it in recent commits:> 1. Use the monorepo of LLVM in the example for convenience in https://github.com/facebookincubator/BOLT/blob/main/bolt/docs/OptimizingClang.md#getting-clang-7-sourcesE.g, git clone --branch=release/7.x https://github.com/llvm/llvm-project.git https://github.com/facebookincubator/BOLT/commit/c4ed6a11eb264660f5cfe7f5165df35b63d52608> 1. I found there are 142 code with “not implemented”, most of which are in Core/MCPlusBuilder.h.* Do they affect the functionality of BOLT? * Do you have plan to implement them recently or can they be removed instead? Explained above.> 2. I noticed some inconsistent use of braces in the code. Maybe better to follow with LLVM coding standard<https://llvm.org/docs/CodingStandards.html#don-t-use-braces-on-simple-single-statement-bodies-of-if-else-loop-statements>.* https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/BinaryBasicBlock.cpp#L241, L289<https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/BinaryBasicBlock.cpp#L289>, etc. * https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/DebugData.cpp#L193 https://github.com/facebookincubator/BOLT/commit/fe3fcf094e320af4da4a98a5387820cd0ca826b9 https://github.com/facebookincubator/BOLT/commit/b40699ed90882465e49ec46e4d3ad28623e54c69 https://github.com/facebookincubator/BOLT/commit/c21ee425083276a7af700e232df15e2a3d31dacb https://github.com/facebookincubator/BOLT/commit/ebc06afcfc35fbb4360b3788adeb6228c27ad830 https://github.com/facebookincubator/BOLT/commit/dca3003fce711daee387bbbf66bf4ecbe1b9ab9b https://github.com/facebookincubator/BOLT/commit/cd526a4550455b0f203771088cef6e512b0dd1a2> 3. Some files don’t have a descriptions in the first line, e.g. DynoStats.cpp<https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/DynoStats.cpp#L1>, ParallelUtilities.cpp<https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/ParallelUtilities.cpp#L1>, etc.> 4. Leaving without descriptions might be fine, but the format should be consistent. Leaving with spaces like in BinaryFunctionProfile.cpp<https://github.com/facebookincubator/BOLT/blob/main/bolt/lib/Core/BinaryFunctionProfile.cpp#L1> doesn’t make sense.https://github.com/facebookincubator/BOLT/commit/349c8abc696919e989418895f4ade24a4c910617 On Tue, Dec 21, 2021 at 4:22 AM Wang, Phoebe <phoebe.wang at intel.com<mailto:phoebe.wang at intel.com>> wrote: Hi Amir, It makes sense to me. Thanks! From: llvm-dev <llvm-dev-bounces at lists.llvm.org<mailto:llvm-dev-bounces at lists.llvm.org>> On Behalf Of Amir Aupov via llvm-dev Sent: Tuesday, December 21, 2021 8:57 AM To: llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org> Subject: Re: [llvm-dev] Preparing BOLT for LLVM monorepo Hi Phoebe, Thanks again for taking the time to look at the codebase. We've added your feedback to our upstreaming tracking issue on github: https://github.com/facebookincubator/BOLT/issues/248. We've started addressing the issues. Regarding "not implemented” in Core/MCPlusBuilder.h: it's a design decision to prevent the use of non-implemented target-specific methods (these are virtual methods). Target-specific implementation goes to X86MCPlusBuilder and AArch64MCPlusBuilder. Some methods only make sense for specific targets, so we intend to keep llvm_unreachable statements in MCPlusBuilder.h. _______________________________________________ LLVM Developers mailing list llvm-dev at lists.llvm.org<mailto:llvm-dev at lists.llvm.org> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20220111/31f03ba8/attachment-0001.html>