Initially we intended this to go into 4.3 but it didn''t make it, so here it is rebased onto current staging. Other than the rebasing the only significant change is a major howler in the entry.S code -- we weren''t restoring any of the outer stack frame guest state for 32- or 64-bit guests! Most of the 32-bit guest state is in the inner frame and the outer frame contains "only" the SPSR_* registers. Amazing how far you can get without context switching those! For a 64-bit guest the outer frame includes ELR_el1, so the SMC trap injection code exposed the problem... Ian.
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 01/15] xen: arm: tweak arm64 stack frame layout
Correct definition of UREGS_kernel_sizeof and use it. Correct adjustment of stack on entry and exit. Add 64-bit versions of the build time checks for stack pointer alignment correctness when pushing the stack frames. Lastly, correct the padding in the stack frames to properly align the inner and outer frames and also avoid an unnecessary 64bit padding field. Signed-off-by: Ian Campbell <ian.campbell@citrix.com> Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> --- xen/arch/arm/arm64/asm-offsets.c | 2 +- xen/arch/arm/arm64/entry.S | 9 +++++---- xen/arch/arm/domain.c | 2 ++ xen/arch/arm/traps.c | 7 +++++++ xen/include/asm-arm/arm64/processor.h | 7 +++---- 5 files changed, 18 insertions(+), 9 deletions(-) diff --git a/xen/arch/arm/arm64/asm-offsets.c b/xen/arch/arm/arm64/asm-offsets.c index 7aacc79..2a225b6 100644 --- a/xen/arch/arm/arm64/asm-offsets.c +++ b/xen/arch/arm/arm64/asm-offsets.c @@ -39,7 +39,7 @@ void __dummy__(void) OFFSET(UREGS_SP_el1, struct cpu_user_regs, sp_el1); OFFSET(UREGS_ELR_el1, struct cpu_user_regs, elr_el1); - OFFSET(UREGS_kernel_sizeof, struct cpu_user_regs, cpsr); + OFFSET(UREGS_kernel_sizeof, struct cpu_user_regs, spsr_el1); DEFINE(UREGS_user_sizeof, sizeof(struct cpu_user_regs)); BLANK(); diff --git a/xen/arch/arm/arm64/entry.S b/xen/arch/arm/arm64/entry.S index 5656f45..b5af1e2 100644 --- a/xen/arch/arm/arm64/entry.S +++ b/xen/arch/arm/arm64/entry.S @@ -35,7 +35,7 @@ lr .req x30 // link register mrs x22, SP_el0 str x22, [x21] - add x21, sp, #UREGS_ELR_el1 + add x21, sp, #UREGS_SP_el1 mrs x22, SP_el1 mrs x23, ELR_el1 stp x22, x23, [x21] @@ -60,7 +60,7 @@ lr .req x30 // link register * Save state on entry to hypervisor */ .macro entry, hyp, compat - sub sp, sp, #(UREGS_SPSR_el1 - UREGS_SP) + sub sp, sp, #(UREGS_SPSR_el1 - UREGS_LR) /* CPSR, PC, SP, LR */ push x28, x29 push x26, x27 push x24, x25 @@ -79,7 +79,7 @@ lr .req x30 // link register .if \hyp == 1 /* Hypervisor mode */ - add x21, sp, #(UREGS_X0 - UREGS_SP) + add x21, sp, #UREGS_kernel_sizeof .else /* Guest mode */ @@ -214,7 +214,8 @@ ENTRY(return_to_hypervisor) pop x26, x27 pop x28, x29 - ldr lr, [sp], #(UREGS_SPSR_el1 - UREGS_SP) + ldr lr, [sp], #(UREGS_SPSR_el1 - UREGS_LR) /* CPSR, PC, SP, LR */ + eret /* diff --git a/xen/arch/arm/domain.c b/xen/arch/arm/domain.c index 6cdfdb5..70fbc0d 100644 --- a/xen/arch/arm/domain.c +++ b/xen/arch/arm/domain.c @@ -434,6 +434,8 @@ int vcpu_initialise(struct vcpu *v) { int rc = 0; + BUILD_BUG_ON( sizeof(struct cpu_info) > STACK_SIZE ); + v->arch.stack = alloc_xenheap_pages(STACK_ORDER, MEMF_node(vcpu_to_node(v))); if ( v->arch.stack == NULL ) return -ENOMEM; diff --git a/xen/arch/arm/traps.c b/xen/arch/arm/traps.c index bbd60aa..db00c68 100644 --- a/xen/arch/arm/traps.c +++ b/xen/arch/arm/traps.c @@ -45,9 +45,16 @@ * entry.S) and struct cpu_info (which lives at the bottom of a Xen * stack) must be doubleword-aligned in size. */ static inline void check_stack_alignment_constraints(void) { +#ifdef CONFIG_ARM_64 + BUILD_BUG_ON((sizeof (struct cpu_user_regs)) & 0xf); + BUILD_BUG_ON((offsetof(struct cpu_user_regs, spsr_el1)) & 0xf); + BUILD_BUG_ON((offsetof(struct cpu_user_regs, lr)) & 0xf); + BUILD_BUG_ON((sizeof (struct cpu_info)) & 0xf); +#else BUILD_BUG_ON((sizeof (struct cpu_user_regs)) & 0x7); BUILD_BUG_ON((offsetof(struct cpu_user_regs, sp_usr)) & 0x7); BUILD_BUG_ON((sizeof (struct cpu_info)) & 0x7); +#endif } static int debug_stack_lines = 20; diff --git a/xen/include/asm-arm/arm64/processor.h b/xen/include/asm-arm/arm64/processor.h index d9fbcb2..c8d4609 100644 --- a/xen/include/asm-arm/arm64/processor.h +++ b/xen/include/asm-arm/arm64/processor.h @@ -51,6 +51,7 @@ struct cpu_user_regs __DECL_REG(x27, r11_fiq); __DECL_REG(x28, r12_fiq); __DECL_REG(/* x29 */ fp, /* r13_fiq */ sp_fiq); + __DECL_REG(/* x30 */ lr, /* r14_fiq */ lr_fiq); register_t sp; /* Valid for hypervisor frames */ @@ -59,7 +60,7 @@ struct cpu_user_regs __DECL_REG(pc, pc32); /* ELR_EL2 */ uint32_t cpsr; /* SPSR_EL2 */ - uint64_t pad0; + uint32_t pad0; /* Align end of kernel frame. */ /* Outer guest frame only from here on... */ @@ -68,7 +69,7 @@ struct cpu_user_regs uint32_t spsr_svc; /* AArch32 */ }; - uint32_t pad1; /* Align */ + uint32_t pad1; /* Doubleword-align the user half of the frame */ /* AArch32 guests only */ uint32_t spsr_fiq, spsr_irq, spsr_und, spsr_abt; @@ -76,8 +77,6 @@ struct cpu_user_regs /* AArch64 guests only */ uint64_t sp_el0; uint64_t sp_el1, elr_el1; - - uint64_t pad2; /* Doubleword-align the user half of the frame */ }; #undef __DECL_REG -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 02/15] xen: arm: rename 32-bit specific zImage field offset constants
This will help avoid confusion when 64-bit Image support is added. Signed-off-by: Ian Campbell <ian.campbell@citrix.com> Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> --- xen/arch/arm/kernel.c | 28 ++++++++++++++-------------- 1 files changed, 14 insertions(+), 14 deletions(-) diff --git a/xen/arch/arm/kernel.c b/xen/arch/arm/kernel.c index 3953f0f..641b1f0 100644 --- a/xen/arch/arm/kernel.c +++ b/xen/arch/arm/kernel.c @@ -20,12 +20,12 @@ #define KERNEL_FLASH_ADDRESS 0x00000000UL #define KERNEL_FLASH_SIZE 0x00800000UL -#define ZIMAGE_MAGIC_OFFSET 0x24 -#define ZIMAGE_START_OFFSET 0x28 -#define ZIMAGE_END_OFFSET 0x2c -#define ZIMAGE_HEADER_LEN 0x30 +#define ZIMAGE32_MAGIC_OFFSET 0x24 +#define ZIMAGE32_START_OFFSET 0x28 +#define ZIMAGE32_END_OFFSET 0x2c +#define ZIMAGE32_HEADER_LEN 0x30 -#define ZIMAGE_MAGIC 0x016f2818 +#define ZIMAGE32_MAGIC 0x016f2818 struct minimal_dtb_header { uint32_t magic; @@ -115,26 +115,26 @@ static void kernel_zimage_load(struct kernel_info *info) } } -/** - * Check the image is a zImage and return the load address and length +/* + * Check if the image is a 32-bit zImage and setup kernel_info */ -static int kernel_try_zimage_prepare(struct kernel_info *info, +static int kernel_try_zimage32_prepare(struct kernel_info *info, paddr_t addr, paddr_t size) { - uint32_t zimage[ZIMAGE_HEADER_LEN/4]; + uint32_t zimage[ZIMAGE32_HEADER_LEN/4]; uint32_t start, end; struct minimal_dtb_header dtb_hdr; - if ( size < ZIMAGE_HEADER_LEN ) + if ( size < ZIMAGE32_HEADER_LEN ) return -EINVAL; copy_from_paddr(zimage, addr, sizeof(zimage), DEV_SHARED); - if (zimage[ZIMAGE_MAGIC_OFFSET/4] != ZIMAGE_MAGIC) + if (zimage[ZIMAGE32_MAGIC_OFFSET/4] != ZIMAGE32_MAGIC) return -EINVAL; - start = zimage[ZIMAGE_START_OFFSET/4]; - end = zimage[ZIMAGE_END_OFFSET/4]; + start = zimage[ZIMAGE32_START_OFFSET/4]; + end = zimage[ZIMAGE32_END_OFFSET/4]; if ( (end - start) > size ) return -EINVAL; @@ -254,7 +254,7 @@ int kernel_prepare(struct kernel_info *info) info->load_attr = BUFFERABLE; } - rc = kernel_try_zimage_prepare(info, start, size); + rc = kernel_try_zimage32_prepare(info, start, size); if (rc < 0) rc = kernel_try_elf_prepare(info, start, size); -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 03/15] xen: arm: support for loading 64-bit zImage dom0
This is defined in linux/Documentation/arm64/booting.txt. Signed-off-by: Ian Campbell <ian.campbell@citrix.com> Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> --- xen/arch/arm/kernel.c | 80 +++++++++++++++++++++++++++++++++++++++++++++--- 1 files changed, 75 insertions(+), 5 deletions(-) diff --git a/xen/arch/arm/kernel.c b/xen/arch/arm/kernel.c index 641b1f0..f6ff294 100644 --- a/xen/arch/arm/kernel.c +++ b/xen/arch/arm/kernel.c @@ -27,6 +27,8 @@ #define ZIMAGE32_MAGIC 0x016f2818 +#define ZIMAGE64_MAGIC 0x14000008 + struct minimal_dtb_header { uint32_t magic; uint32_t total_size; @@ -115,6 +117,57 @@ static void kernel_zimage_load(struct kernel_info *info) } } +#ifdef CONFIG_ARM_64 +/* + * Check if the image is a 64-bit zImage and setup kernel_info + */ +static int kernel_try_zimage64_prepare(struct kernel_info *info, + paddr_t addr, paddr_t size) +{ + /* linux/Documentation/arm64/booting.txt */ + struct { + uint32_t magic; + uint32_t res0; + uint64_t text_offset; /* Image load offset */ + uint64_t res1; + uint64_t res2; + } zimage; + uint64_t start, end; + + if ( size < sizeof(zimage) ) + return -EINVAL; + + copy_from_paddr(&zimage, addr, sizeof(zimage), DEV_SHARED); + + if (zimage.magic != ZIMAGE64_MAGIC) + return -EINVAL; + + /* Currently there is no length in the header, so just use the size */ + start = 0; + end = size; + + /* + * Given the above this check is a bit pointless, but leave it + * here in case someone adds a length field in the future. + */ + if ( (end - start) > size ) + return -EINVAL; + + info->zimage.kernel_addr = addr; + + info->zimage.load_addr = info->mem.bank[0].start + + zimage.text_offset; + info->zimage.len = end - start; + + info->entry = info->zimage.load_addr; + info->load = kernel_zimage_load; + + info->type = DOMAIN_PV64; + + return 0; +} +#endif + /* * Check if the image is a 32-bit zImage and setup kernel_info */ @@ -170,6 +223,10 @@ static int kernel_try_zimage32_prepare(struct kernel_info *info, info->load = kernel_zimage_load; info->check_overlap = kernel_zimage_check_overlap; +#ifdef CONFIG_ARM_64 + info->type = DOMAIN_PV32; +#endif + return 0; } @@ -208,6 +265,19 @@ static int kernel_try_elf_prepare(struct kernel_info *info, if ( (rc = elf_xen_parse(&info->elf.elf, &info->elf.parms)) != 0 ) goto err; +#ifdef CONFIG_ARM_64 + if ( elf_32bit(&info->elf.elf) ) + info->type = DOMAIN_PV32; + else if ( elf_64bit(&info->elf.elf) ) + info->type = DOMAIN_PV64; + else + { + printk("Unknown ELF class\n"); + rc = -EINVAL; + goto err; + } +#endif + /* * TODO: can the ELF header be used to find the physical address * to load the image to? Instead of assuming virt == phys. @@ -254,13 +324,13 @@ int kernel_prepare(struct kernel_info *info) info->load_attr = BUFFERABLE; } - rc = kernel_try_zimage32_prepare(info, start, size); - if (rc < 0) - rc = kernel_try_elf_prepare(info, start, size); - #ifdef CONFIG_ARM_64 - info->type = DOMAIN_PV32; /* No 64-bit guest support yet */ + rc = kernel_try_zimage64_prepare(info, start, size); + if (rc < 0) #endif + rc = kernel_try_zimage32_prepare(info, start, size); + if (rc < 0) + rc = kernel_try_elf_prepare(info, start, size); return rc; } -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 04/15] xen: arm: support building a 64-bit dom0 domain
Signed-off-by: Ian Campbell <ian.campbell@citrix.com> Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> --- v3: rebased. as part of this move PSR_GUEST_INIT out of the public interface --- xen/arch/arm/domain_build.c | 10 +++++++--- xen/include/asm-arm/processor.h | 2 ++ xen/include/public/arch-arm.h | 2 -- 3 files changed, 9 insertions(+), 5 deletions(-) diff --git a/xen/arch/arm/domain_build.c b/xen/arch/arm/domain_build.c index 5e486c4..7f3035c 100644 --- a/xen/arch/arm/domain_build.c +++ b/xen/arch/arm/domain_build.c @@ -593,9 +593,7 @@ int construct_dom0(struct domain *d) memset(regs, 0, sizeof(*regs)); - regs->pc = (uint32_t)kinfo.entry; - - regs->cpsr = PSR_GUEST_INIT; + regs->pc = (register_t)kinfo.entry; #ifdef CONFIG_ARM_64 d->arch.type = kinfo.type; @@ -603,6 +601,11 @@ int construct_dom0(struct domain *d) if ( is_pv32_domain(d) ) { + regs->cpsr = PSR_GUEST_INIT|PSR_MODE_SVC; + + /* Pretend to be a Cortex A15 */ + d->arch.vpidr = 0x410fc0f0; + /* FROM LINUX head.S * * Kernel startup entry point. @@ -621,6 +624,7 @@ int construct_dom0(struct domain *d) #ifdef CONFIG_ARM_64 else { + regs->cpsr = PSR_GUEST_INIT|PSR_MODE_EL1h; /* From linux/Documentation/arm64/booting.txt */ regs->x0 = kinfo.dtb_paddr; regs->x1 = 0; /* Reserved for future use */ diff --git a/xen/include/asm-arm/processor.h b/xen/include/asm-arm/processor.h index 263bd03..8b9b320 100644 --- a/xen/include/asm-arm/processor.h +++ b/xen/include/asm-arm/processor.h @@ -37,6 +37,8 @@ #define SCTLR_BASE 0x00c50078 #define HSCTLR_BASE 0x30c51878 +#define PSR_GUEST_INIT (PSR_ABT_MASK|PSR_FIQ_MASK|PSR_IRQ_MASK) + /* HCR Hyp Configuration Register */ #define HCR_TGE (1<<27) #define HCR_TVM (1<<26) diff --git a/xen/include/public/arch-arm.h b/xen/include/public/arch-arm.h index 8aa62d3..cea12b2 100644 --- a/xen/include/public/arch-arm.h +++ b/xen/include/public/arch-arm.h @@ -237,8 +237,6 @@ typedef uint64_t xen_callback_t; #define PSR_IT_MASK (0x0600fc00) /* Thumb If-Then Mask */ #define PSR_JAZELLE (1<<24) /* Jazelle Mode */ -#define PSR_GUEST_INIT (PSR_ABT_MASK|PSR_FIQ_MASK|PSR_IRQ_MASK|PSR_MODE_SVC) - #endif /* __XEN_PUBLIC_ARCH_ARM_H__ */ /* -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 05/15] xen: arm: precalculate VTTBR_EL2 for a domain when setting up its p2m
Mostly just to help with upcoming vcpu_show_registers changes. Signed-off-by: Ian Campbell <ian.campbell@citrix.com> Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> --- xen/arch/arm/p2m.c | 16 +++++++++------- xen/include/asm-arm/domain.h | 3 +++ 2 files changed, 12 insertions(+), 7 deletions(-) diff --git a/xen/arch/arm/p2m.c b/xen/arch/arm/p2m.c index 9fc5534..307c6d4 100644 --- a/xen/arch/arm/p2m.c +++ b/xen/arch/arm/p2m.c @@ -23,13 +23,10 @@ void dump_p2m_lookup(struct domain *d, paddr_t addr) void p2m_load_VTTBR(struct domain *d) { - struct p2m_domain *p2m = &d->arch.p2m; - paddr_t maddr = page_to_maddr(p2m->first_level); - uint64_t vttbr = maddr; - - vttbr |= ((uint64_t)p2m->vmid&0xff)<<48; - - WRITE_SYSREG64(vttbr, VTTBR_EL2); + if ( is_idle_domain(d) ) + return; + BUG_ON(!d->arch.vttbr); + WRITE_SYSREG64(d->arch.vttbr, VTTBR_EL2); isb(); /* Ensure update is visible */ } @@ -301,6 +298,9 @@ int p2m_alloc_table(struct domain *d) p2m->first_level = page; + d->arch.vttbr = page_to_maddr(p2m->first_level) + | ((uint64_t)p2m->vmid&0xff)<<48; + spin_unlock(&p2m->lock); return 0; @@ -332,6 +332,8 @@ int p2m_init(struct domain *d) /* Zero is reserved */ p2m->vmid = d->domain_id + 1; + d->arch.vttbr = 0; + p2m->first_level = NULL; return 0; diff --git a/xen/include/asm-arm/domain.h b/xen/include/asm-arm/domain.h index 339b6e6..31a2d66 100644 --- a/xen/include/asm-arm/domain.h +++ b/xen/include/asm-arm/domain.h @@ -62,7 +62,10 @@ struct arch_domain enum domain_type type; #endif + /* Virtual MMU */ struct p2m_domain p2m; + uint64_t vttbr; + struct hvm_domain hvm_domain; xen_pfn_t *grant_table_gpfn; -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 06/15] xen: arm: improve register dump output for 64-bit guest (and more generally too)
Signed-off-by: Ian Campbell <ian.campbell@citrix.com> --- xen/arch/arm/traps.c | 171 +++++++++++++++++++++++++++-------------- xen/include/asm-arm/cpregs.h | 1 + 2 files changed, 113 insertions(+), 59 deletions(-) diff --git a/xen/arch/arm/traps.c b/xen/arch/arm/traps.c index db00c68..3d33fba 100644 --- a/xen/arch/arm/traps.c +++ b/xen/arch/arm/traps.c @@ -241,7 +241,7 @@ void panic_PAR(uint64_t par) msg = decode_fsc( (par&PAR_FSC_MASK) >> PAR_FSC_SHIFT, &level); - printk("PAR: %010"PRIx64": %s stage %d%s%s\n", + printk("PAR: %016"PRIx64": %s stage %d%s%s\n", par, msg, stage, second_in_first ? " during second stage lookup" : "", @@ -299,33 +299,60 @@ static void inject_undef_exception(struct cpu_user_regs *regs, } struct reg_ctxt { - uint32_t sctlr, tcr; - uint64_t ttbr0, ttbr1; + /* Guest-side state */ + uint32_t sctlr_el1, tcr_el1; + uint64_t ttbr0_el1, ttbr1_el1; #ifdef CONFIG_ARM_32 + uint32_t dfsr, ifsr; uint32_t dfar, ifar; #else + uint32_t esr_el1; uint64_t far; + uint32_t ifsr32_el2; #endif + + /* Hypervisor-side state */ + uint64_t vttbr_el2; }; +static const char *mode_string(uint32_t cpsr) +{ + uint32_t mode; + static const char *mode_strings[] = { + [PSR_MODE_USR] = "32-bit Guest USR", + [PSR_MODE_FIQ] = "32-bit Guest FIQ", + [PSR_MODE_IRQ] = "32-bit Guest IRQ", + [PSR_MODE_SVC] = "32-bit Guest SVC", + [PSR_MODE_MON] = "32-bit Monitor", + [PSR_MODE_ABT] = "32-bit Guest ABT", + [PSR_MODE_HYP] = "Hypervisor", + [PSR_MODE_UND] = "32-bit Guest UND", + [PSR_MODE_SYS] = "32-bit Guest SYS", +#ifdef CONFIG_ARM_64 + [PSR_MODE_EL3h] = "64-bit EL3h (Monitor, handler)", + [PSR_MODE_EL3t] = "64-bit EL3t (Monitor, thread)", + [PSR_MODE_EL2h] = "64-bit EL2h (Hypervisor, handler)", + [PSR_MODE_EL2t] = "64-bit EL2t (Hypervisor, thread)", + [PSR_MODE_EL1h] = "64-bit EL1h (Guest Kernel, handler)", + [PSR_MODE_EL1t] = "64-bit EL1t (Guest Kernel, thread)", + [PSR_MODE_EL0t] = "64-bit EL0t (Guest User)", +#endif + }; + mode = cpsr & PSR_MODE_MASK; + + if ( mode > ARRAY_SIZE(mode_strings) ) + return "Unknown"; + return mode_strings[mode] ? : "Unknown"; +} + static void show_registers_32(struct cpu_user_regs *regs, struct reg_ctxt *ctxt, int guest_mode, const struct vcpu *v) { - static const char *mode_strings[] = { - [PSR_MODE_USR] = "USR", - [PSR_MODE_FIQ] = "FIQ", - [PSR_MODE_IRQ] = "IRQ", - [PSR_MODE_SVC] = "SVC", - [PSR_MODE_MON] = "MON", - [PSR_MODE_ABT] = "ABT", - [PSR_MODE_HYP] = "HYP", - [PSR_MODE_UND] = "UND", - [PSR_MODE_SYS] = "SYS" - }; #ifdef CONFIG_ARM_64 + BUG_ON( ! (regs->cpsr & PSR_MODE_BIT) ); printk("PC: %08"PRIx32"\n", regs->pc32); #else printk("PC: %08"PRIx32, regs->pc); @@ -333,9 +360,8 @@ static void show_registers_32(struct cpu_user_regs *regs, print_symbol(" %s", regs->pc); printk("\n"); #endif - printk("CPSR: %08"PRIx32" MODE:%s%s\n", regs->cpsr, - guest_mode ? "32-bit Guest " : "Hypervisor", - guest_mode ? mode_strings[regs->cpsr & PSR_MODE_MASK] : ""); + printk("CPSR: %08"PRIx32" MODE:%s\n", regs->cpsr, + mode_string(regs->cpsr)); printk(" R0: %08"PRIx32" R1: %08"PRIx32" R2: %08"PRIx32" R3: %08"PRIx32"\n", regs->r0, regs->r1, regs->r2, regs->r3); printk(" R4: %08"PRIx32" R5: %08"PRIx32" R6: %08"PRIx32" R7: %08"PRIx32"\n", @@ -376,15 +402,19 @@ static void show_registers_32(struct cpu_user_regs *regs, if ( guest_mode ) { - printk("TTBR0 %010"PRIx64" TTBR1 %010"PRIx64" TCR %08"PRIx32"\n", - ctxt->ttbr0, ctxt->ttbr1, ctxt->tcr); - printk("SCTLR %08"PRIx32"\n", ctxt->sctlr); - printk("IFAR %08"PRIx32" DFAR %08"PRIx32"\n", + printk(" SCTLR: %08"PRIx32"\n", ctxt->sctlr_el1); + printk(" TCR: %08"PRIx32"\n", ctxt->tcr_el1); + printk(" TTBR0: %016"PRIx64"\n", ctxt->ttbr0_el1); + printk(" TTBR1: %016"PRIx64"\n", ctxt->ttbr1_el1); + printk(" IFAR: %08"PRIx32", IFSR: %08"PRIx32"\n" + " DFAR: %08"PRIx32", DFSR: %08"PRIx32"\n", #ifdef CONFIG_ARM_64 (uint32_t)(ctxt->far >> 32), - (uint32_t)(ctxt->far & 0xffffffff) + ctxt->ifsr32_el2, + (uint32_t)(ctxt->far & 0xffffffff), + ctxt->esr_el1 #else - ctxt->ifar, ctxt->dfar + ctxt->ifar, ctxt->ifsr, ctxt->dfar, ctxt->dfsr #endif ); printk("\n"); @@ -397,13 +427,25 @@ static void show_registers_64(struct cpu_user_regs *regs, int guest_mode, const struct vcpu *v) { + + BUG_ON( (regs->cpsr & PSR_MODE_BIT) ); + printk("PC: %016"PRIx64, regs->pc); if ( !guest_mode ) print_symbol(" %s", regs->pc); printk("\n"); - printk("SP: %08"PRIx64"\n", regs->sp); + printk("LR: %016"PRIx64"\n", regs->lr); + if ( guest_mode ) + { + printk("SP_EL0: %016"PRIx64"\n", regs->sp_el0); + printk("SP_EL1: %016"PRIx64"\n", regs->sp_el1); + } + else + { + printk("SP: %016"PRIx64"\n", regs->sp); + } printk("CPSR: %08"PRIx32" MODE:%s\n", regs->cpsr, - guest_mode ? "64-bit Guest" : "Hypervisor"); + mode_string(regs->cpsr)); printk(" X0: %016"PRIx64" X1: %016"PRIx64" X2: %016"PRIx64"\n", regs->x0, regs->x1, regs->x2); printk(" X3: %016"PRIx64" X4: %016"PRIx64" X5: %016"PRIx64"\n", @@ -422,17 +464,20 @@ static void show_registers_64(struct cpu_user_regs *regs, regs->x21, regs->x22, regs->x23); printk(" X24: %016"PRIx64" X25: %016"PRIx64" X26: %016"PRIx64"\n", regs->x24, regs->x25, regs->x26); - printk(" X27: %016"PRIx64" X28: %016"PRIx64" X29: %016"PRIx64"\n", - regs->x27, regs->x28, regs->lr); + printk(" X27: %016"PRIx64" X28: %016"PRIx64" FP: %016"PRIx64"\n", + regs->x27, regs->x28, regs->fp); printk("\n"); if ( guest_mode ) { - printk("SCTLR_EL1: %08"PRIx32"\n", ctxt->sctlr); - printk(" TCR_EL1: %08"PRIx32"\n", ctxt->tcr); - printk("TTBR0_EL1: %010"PRIx64"\n", ctxt->ttbr0); - printk("TTBR1_EL1: %010"PRIx64"\n", ctxt->ttbr1); - printk(" FAR_EL1: %010"PRIx64"\n", ctxt->far); + printk(" ELR_EL1: %016"PRIx64"\n", regs->elr_el1); + printk(" ESR_EL1: %08"PRIx32"\n", ctxt->esr_el1); + printk(" FAR_EL1: %016"PRIx64"\n", ctxt->far); + printk("\n"); + printk(" SCTLR_EL1: %08"PRIx32"\n", ctxt->sctlr_el1); + printk(" TCR_EL1: %08"PRIx32"\n", ctxt->tcr_el1); + printk(" TTBR0_EL1: %016"PRIx64"\n", ctxt->ttbr0_el1); + printk(" TTBR1_EL1: %016"PRIx64"\n", ctxt->ttbr1_el1); printk("\n"); } } @@ -464,60 +509,68 @@ static void _show_registers(struct cpu_user_regs *regs, show_registers_32(regs, ctxt, guest_mode, v); #endif } - -#ifdef CONFIG_ARM_32 - printk("HTTBR %"PRIx64"\n", READ_CP64(HTTBR)); - printk("HDFAR %"PRIx32"\n", READ_CP32(HDFAR)); - printk("HIFAR %"PRIx32"\n", READ_CP32(HIFAR)); - printk("HPFAR %"PRIx32"\n", READ_CP32(HPFAR)); - printk("HCR %08"PRIx32"\n", READ_CP32(HCR)); - printk("HSR %"PRIx32"\n", READ_CP32(HSR)); - printk("VTTBR %010"PRIx64"\n", READ_CP64(VTTBR)); + printk(" VTCR_EL2: %08"PRIx32"\n", READ_SYSREG32(VTCR_EL2)); + printk(" VTTBR_EL2: %016"PRIx64"\n", ctxt->vttbr_el2); printk("\n"); - printk("DFSR %"PRIx32" DFAR %"PRIx32"\n", READ_CP32(DFSR), READ_CP32(DFAR)); - printk("IFSR %"PRIx32" IFAR %"PRIx32"\n", READ_CP32(IFSR), READ_CP32(IFAR)); + printk(" SCTLR_EL2: %08"PRIx32"\n", READ_SYSREG32(SCTLR_EL2)); + printk(" HCR_EL2: %016"PRIregister"\n", READ_SYSREG(HCR_EL2)); + printk(" TTBR0_EL2: %016"PRIx64"\n", READ_SYSREG64(TTBR0_EL2)); printk("\n"); + printk(" ESR_EL2: %08"PRIx32"\n", READ_SYSREG32(ESR_EL2)); + printk(" HPFAR_EL2: %016"PRIregister"\n", READ_SYSREG(HPFAR_EL2)); + +#ifdef CONFIG_ARM_32 + printk(" HDFAR: %08"PRIx32"\n", READ_CP32(HDFAR)); + printk(" HIFAR: %08"PRIx32"\n", READ_CP32(HIFAR)); #else - printk("TTBR0_EL2: %"PRIx64"\n", READ_SYSREG64(TTBR0_EL2)); - printk(" FAR_EL2: %"PRIx64"\n", READ_SYSREG64(FAR_EL2)); - printk("HPFAR_EL2: %"PRIx64"\n", READ_SYSREG64(HPFAR_EL2)); - printk(" HCR_EL2: %"PRIx64"\n", READ_SYSREG64(HCR_EL2)); - printk(" ESR_EL2: %"PRIx64"\n", READ_SYSREG64(ESR_EL2)); - printk("VTTBR_EL2: %"PRIx64"\n", READ_SYSREG64(VTTBR_EL2)); - printk("\n"); + printk(" FAR_EL2: %016"PRIx64"\n", READ_SYSREG64(FAR_EL2)); #endif + printk("\n"); } void show_registers(struct cpu_user_regs *regs) { struct reg_ctxt ctxt; - ctxt.sctlr = READ_SYSREG(SCTLR_EL1); - ctxt.tcr = READ_SYSREG(TCR_EL1); - ctxt.ttbr0 = READ_SYSREG64(TTBR0_EL1); - ctxt.ttbr1 = READ_SYSREG64(TTBR1_EL1); + ctxt.sctlr_el1 = READ_SYSREG(SCTLR_EL1); + ctxt.tcr_el1 = READ_SYSREG(TCR_EL1); + ctxt.ttbr0_el1 = READ_SYSREG64(TTBR0_EL1); + ctxt.ttbr1_el1 = READ_SYSREG64(TTBR1_EL1); #ifdef CONFIG_ARM_32 ctxt.dfar = READ_CP32(DFAR); ctxt.ifar = READ_CP32(IFAR); + ctxt.dfsr = READ_CP32(DFSR); + ctxt.ifsr = READ_CP32(IFSR); #else ctxt.far = READ_SYSREG(FAR_EL1); + ctxt.esr_el1 = READ_SYSREG(ESR_EL1); + ctxt.ifsr32_el2 = READ_SYSREG(IFSR32_EL2); #endif + ctxt.vttbr_el2 = READ_SYSREG64(VTTBR_EL2); + _show_registers(regs, &ctxt, guest_mode(regs), current); } void vcpu_show_registers(const struct vcpu *v) { struct reg_ctxt ctxt; - ctxt.sctlr = v->arch.sctlr; - ctxt.tcr = v->arch.ttbcr; - ctxt.ttbr0 = v->arch.ttbr0; - ctxt.ttbr1 = v->arch.ttbr1; + ctxt.sctlr_el1 = v->arch.sctlr; + ctxt.tcr_el1 = v->arch.ttbcr; + ctxt.ttbr0_el1 = v->arch.ttbr0; + ctxt.ttbr1_el1 = v->arch.ttbr1; #ifdef CONFIG_ARM_32 ctxt.dfar = v->arch.dfar; ctxt.ifar = v->arch.ifar; + ctxt.dfsr = v->arch.dfsr; + ctxt.ifsr = v->arch.ifsr; #else ctxt.far = v->arch.far; + ctxt.esr_el1 = v->arch.esr; + ctxt.ifsr32_el2 = v->arch.ifsr; #endif + + ctxt.vttbr_el2 = v->domain->arch.vttbr; + _show_registers(&v->arch.cpu_info->guest_cpu_user_regs, &ctxt, 1, v); } @@ -957,7 +1010,7 @@ void dump_guest_s1_walk(struct domain *d, vaddr_t addr) printk("dom%d VA 0x%08"PRIvaddr"\n", d->domain_id, addr); printk(" TTBCR: 0x%08"PRIx32"\n", ttbcr); - printk(" TTBR0: 0x%010"PRIx64" = 0x%"PRIpaddr"\n", + printk(" TTBR0: 0x%016"PRIx64" = 0x%"PRIpaddr"\n", ttbr0, p2m_lookup(d, ttbr0 & PAGE_MASK)); if ( ttbcr & TTBCR_EAE ) diff --git a/xen/include/asm-arm/cpregs.h b/xen/include/asm-arm/cpregs.h index 122dd1a..2960492 100644 --- a/xen/include/asm-arm/cpregs.h +++ b/xen/include/asm-arm/cpregs.h @@ -259,6 +259,7 @@ #define DACR32_EL2 DACR #define ESR_EL2 HSR #define HCR_EL2 HCR +#define HPFAR_EL2 HPFAR #define ID_AFR0_EL1 ID_AFR0 #define ID_DFR0_EL1 ID_DFR0 #define ID_ISAR0_EL1 ID_ISAR0 -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 07/15] xen: arm: support dumping 64-bit guest stack
Signed-off-by: Ian Campbell <ian.campbell@citrix.com> Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> --- xen/arch/arm/traps.c | 80 +++++++++++++++++++++++++++++++++++++++++++++++-- 1 files changed, 76 insertions(+), 4 deletions(-) diff --git a/xen/arch/arm/traps.c b/xen/arch/arm/traps.c index 3d33fba..a757d47 100644 --- a/xen/arch/arm/traps.c +++ b/xen/arch/arm/traps.c @@ -574,9 +574,81 @@ void vcpu_show_registers(const struct vcpu *v) _show_registers(&v->arch.cpu_info->guest_cpu_user_regs, &ctxt, 1, v); } -static void show_guest_stack(struct cpu_user_regs *regs) +static void show_guest_stack(struct vcpu *v, struct cpu_user_regs *regs) { - printk("GUEST STACK GOES HERE\n"); + int i; + vaddr_t sp; + paddr_t stack_phys; + void *mapped; + unsigned long *stack, addr; + + switch ( regs->cpsr & PSR_MODE_MASK ) + { + case PSR_MODE_USR: + case PSR_MODE_SYS: +#ifdef CONFIG_ARM_64 + case PSR_MODE_EL0t: +#endif + printk("No stack trace for guest user-mode\n"); + return; + + case PSR_MODE_FIQ: + case PSR_MODE_IRQ: + case PSR_MODE_SVC: + case PSR_MODE_ABT: + case PSR_MODE_UND: + printk("No stack trace for 32-bit guest kernel-mode\n"); + return; + +#ifdef CONFIG_ARM_64 + case PSR_MODE_EL1t: + sp = regs->sp_el0; + break; + case PSR_MODE_EL1h: + sp = regs->sp_el1; + break; +#endif + + case PSR_MODE_HYP: + case PSR_MODE_MON: +#ifdef CONFIG_ARM_64 + case PSR_MODE_EL3h: + case PSR_MODE_EL3t: + case PSR_MODE_EL2h: + case PSR_MODE_EL2t: +#endif + default: + BUG(); + return; + } + + printk("Guest stack trace from sp=%"PRIvaddr":\n ", sp); + + if ( gvirt_to_maddr(sp, &stack_phys) ) + { + printk("Failed to convert stack to physical address\n"); + return; + } + + mapped = map_domain_page(stack_phys >> PAGE_SHIFT); + + stack = mapped + (sp & ~PAGE_MASK); + + for ( i = 0; i < (debug_stack_lines*stack_words_per_line); i++ ) + { + if ( (((long)stack - 1) ^ ((long)(stack + 1) - 1)) & PAGE_SIZE ) + break; + addr = *stack; + if ( (i != 0) && ((i % stack_words_per_line) == 0) ) + printk("\n "); + printk(" %p", _p(addr)); + stack++; + } + if ( i == 0 ) + printk("Stack empty."); + printk("\n"); + unmap_domain_page(mapped); + } #define STACK_BEFORE_EXCEPTION(regs) ((register_t*)(regs)->sp) @@ -659,7 +731,7 @@ void show_stack(struct cpu_user_regs *regs) int i; if ( guest_mode(regs) ) - return show_guest_stack(regs); + return show_guest_stack(current, regs); printk("Xen stack trace from sp=%p:\n ", stack); @@ -701,7 +773,7 @@ void vcpu_show_execution_state(struct vcpu *v) vcpu_show_registers(v); if ( !usr_mode(&v->arch.cpu_info->guest_cpu_user_regs) ) - show_guest_stack(&v->arch.cpu_info->guest_cpu_user_regs); + show_guest_stack(v, &v->arch.cpu_info->guest_cpu_user_regs); vcpu_unpause(v); } -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 08/15] xen: arm: show less words in a line of a stack trace in 64-bit builds
Words are twice as wide so this ensures that a line is still <80 characters. Signed-off-by: Ian Campbell <ian.campbell@citrix.com> Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> --- xen/arch/arm/traps.c | 9 +++++++-- 1 files changed, 7 insertions(+), 2 deletions(-) diff --git a/xen/arch/arm/traps.c b/xen/arch/arm/traps.c index a757d47..abbaadd 100644 --- a/xen/arch/arm/traps.c +++ b/xen/arch/arm/traps.c @@ -57,10 +57,15 @@ static inline void check_stack_alignment_constraints(void) { #endif } +#ifdef CONFIG_ARM_32 static int debug_stack_lines = 20; -integer_param("debug_stack_lines", debug_stack_lines); - #define stack_words_per_line 8 +#else +static int debug_stack_lines = 40; +#define stack_words_per_line 4 +#endif + +integer_param("debug_stack_lines", debug_stack_lines); void __cpuinit init_traps(void) -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 09/15] xen: arm: Set EL1 register width in HCR_EL2 during context switch.
Signed-off-by: Ian Campbell <ian.campbell@citrix.com> --- v3: Remove ifdef and rely on is_pv32_domain == true on arm32 --- xen/arch/arm/domain.c | 5 +++++ xen/include/asm-arm/processor.h | 1 + 2 files changed, 6 insertions(+), 0 deletions(-) diff --git a/xen/arch/arm/domain.c b/xen/arch/arm/domain.c index 70fbc0d..a409d11 100644 --- a/xen/arch/arm/domain.c +++ b/xen/arch/arm/domain.c @@ -207,6 +207,11 @@ static void ctxt_switch_to(struct vcpu *n) isb(); + if ( is_pv32_domain(n->domain) ) + hcr &= ~HCR_RW; + else + hcr |= HCR_RW; + WRITE_SYSREG(hcr, HCR_EL2); isb(); diff --git a/xen/include/asm-arm/processor.h b/xen/include/asm-arm/processor.h index 8b9b320..9e4995c 100644 --- a/xen/include/asm-arm/processor.h +++ b/xen/include/asm-arm/processor.h @@ -40,6 +40,7 @@ #define PSR_GUEST_INIT (PSR_ABT_MASK|PSR_FIQ_MASK|PSR_IRQ_MASK) /* HCR Hyp Configuration Register */ +#define HCR_RW (1<<31) /* ARM64 only */ #define HCR_TGE (1<<27) #define HCR_TVM (1<<26) #define HCR_TTLB (1<<25) -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 10/15] xen: arm: handle traps from 64-bit guests
While there observe that we weren''t ever restoring the outer stack frame, even for 32-bit guests when running a 64-bit hypervisor! The outer stack frame "only" contains most of the SPSR registers for 32-bit... We can also remove the logic at "return_to_new_vcpu" which detects returns to hypervisor mode -- seemingly trying to handle hypervisor threads which aren''t an thing which we have. The idle VCPUs do not take this path. This simplifies the return_to_new_vcpu code, we also split it into 32- and 64-bit VCPU paths. There is no need to export return_to_hypervisor (now renamed return_from_trap on arm64) and likewise stop doing so on 32-bit as well. Also tweak the case of some system registers for consistency. Signed-off-by: Ian Campbell <ian.campbell@citrix.com> --- v3: actually restore the outer stack frame! Due to this change I have dropped the previous Acks. --- xen/arch/arm/arm32/entry.S | 4 +- xen/arch/arm/arm64/entry.S | 95 ++++++++++++++++++++++++++++++++----------- xen/arch/arm/domain.c | 6 ++- 3 files changed, 77 insertions(+), 28 deletions(-) diff --git a/xen/arch/arm/arm32/entry.S b/xen/arch/arm/arm32/entry.S index 76814dd..cc0a99f 100644 --- a/xen/arch/arm/arm32/entry.S +++ b/xen/arch/arm/arm32/entry.S @@ -87,7 +87,7 @@ DEFINE_TRAP_ENTRY_NOIRQ(fiq) return_from_trap: mov sp, r11 -ENTRY(return_to_new_vcpu) +ENTRY(return_to_new_vcpu32) ldr r11, [sp, #UREGS_cpsr] and r11, #PSR_MODE_MASK cmp r11, #PSR_MODE_HYP @@ -108,7 +108,7 @@ ENTRY(return_to_guest) RESTORE_ONE_BANKED(R8_fiq); RESTORE_ONE_BANKED(R9_fiq); RESTORE_ONE_BANKED(R10_fiq) RESTORE_ONE_BANKED(R11_fiq); RESTORE_ONE_BANKED(R12_fiq); /* Fall thru */ -ENTRY(return_to_hypervisor) +return_to_hypervisor: cpsid i ldr lr, [sp, #UREGS_lr] ldr r11, [sp, #UREGS_pc] diff --git a/xen/arch/arm/arm64/entry.S b/xen/arch/arm/arm64/entry.S index b5af1e2..9cda8f1 100644 --- a/xen/arch/arm/arm64/entry.S +++ b/xen/arch/arm/arm64/entry.S @@ -26,7 +26,7 @@ lr .req x30 // link register .macro entry_guest, compat add x21, sp, #UREGS_SPSR_el1 - mrs x23, SPSR_EL1 + mrs x23, SPSR_el1 str x23, [x21] .if \compat == 0 /* Aarch64 mode */ @@ -40,24 +40,56 @@ lr .req x30 // link register mrs x23, ELR_el1 stp x22, x23, [x21] - .else /* Aarch32 mode */ + .else /* Aarch32 mode */ add x21, sp, #UREGS_SPSR_fiq - mrs x22, spsr_fiq - mrs x23, spsr_irq + mrs x22, SPSR_fiq + mrs x23, SPSR_irq stp w22, w23, [x21] add x21, sp, #UREGS_SPSR_und - mrs x22, spsr_und - mrs x23, spsr_abt + mrs x22, SPSR_und + mrs x23, SPSR_abt stp w22, w23, [x21] .endif .endm + .macro exit_guest, compat + + add x21, sp, #UREGS_SPSR_el1 + ldr x23, [x21] + msr SPSR_el1, x23 + + .if \compat == 0 /* Aarch64 mode */ + + add x21, sp, #UREGS_SP_el0 + ldr x22, [x21] + msr SP_el0, x22 + + add x21, sp, #UREGS_SP_el1 + ldp x22, x23, [x21] + msr SP_el1, x22 + msr ELR_el1, x23 + + .else /* Aarch32 mode */ + + add x21, sp, #UREGS_SPSR_fiq + ldp w22, w23, [x21] + msr SPSR_fiq, x22 + msr SPSR_irq, x23 + + add x21, sp, #UREGS_SPSR_und + ldp w22, w23, [x21] + msr SPSR_und, x22 + msr SPSR_abt, x23 + + .endif + + .endm /* - * Save state on entry to hypervisor + * Save state on entry to hypervisor, restore on exit */ .macro entry, hyp, compat sub sp, sp, #(UREGS_SPSR_el1 - UREGS_LR) /* CPSR, PC, SP, LR */ @@ -96,6 +128,20 @@ lr .req x30 // link register .endm + .macro exit, hyp, compat + + .if \hyp == 0 /* Guest mode */ + + bl leave_hypervisor_tail /* Disables interrupts on return */ + + exit_guest \compat + + .endif + + b return_from_trap + + .endm + /* * Bad Abort numbers *----------------- @@ -133,21 +179,26 @@ hyp_sync: msr daifclr, #2 mov x0, sp bl do_trap_hypervisor - b return_to_hypervisor + exit hyp=1 hyp_irq: entry hyp=1 mov x0, sp bl do_trap_irq - b return_to_hypervisor + exit hyp=1 guest_sync: entry hyp=0, compat=0 - invalid BAD_SYNC /* No AArch64 guest support yet */ + msr daifclr, #2 + mov x0, sp + bl do_trap_hypervisor + exit hyp=0, compat=0 guest_irq: entry hyp=0, compat=0 - invalid BAD_IRQ /* No AArch64 guest support yet */ + mov x0, sp + bl do_trap_irq + exit hyp=0, compat=0 guest_fiq_invalid: entry hyp=0, compat=0 @@ -162,13 +213,13 @@ guest_sync_compat: msr daifclr, #2 mov x0, sp bl do_trap_hypervisor - b return_to_guest + exit hyp=0, compat=1 guest_irq_compat: entry hyp=0, compat=1 mov x0, sp bl do_trap_irq - b return_to_guest + exit hyp=0, compat=1 guest_fiq_invalid_compat: entry hyp=0, compat=1 @@ -178,18 +229,12 @@ guest_error_invalid_compat: entry hyp=0, compat=1 invalid BAD_ERROR -ENTRY(return_to_new_vcpu) - ldr x21, [sp, #UREGS_CPSR] - and x21, x21, #PSR_MODE_MASK - /* Returning to EL2? */ - cmp x21, #PSR_MODE_EL2t - ccmp x21, #PSR_MODE_EL2h, #0x4, ne - b.eq return_to_hypervisor /* Yes */ - /* Fall thru */ -ENTRY(return_to_guest) - bl leave_hypervisor_tail /* Disables interrupts on return */ - /* Fall thru */ -ENTRY(return_to_hypervisor) +ENTRY(return_to_new_vcpu32) + exit hyp=0, compat=1 +ENTRY(return_to_new_vcpu64) + exit hyp=0, compat=0 + +return_from_trap: msr daifset, #2 /* Mask interrupts */ ldp x21, x22, [sp, #UREGS_PC] // load ELR, SPSR diff --git a/xen/arch/arm/domain.c b/xen/arch/arm/domain.c index a409d11..cf82e21 100644 --- a/xen/arch/arm/domain.c +++ b/xen/arch/arm/domain.c @@ -250,9 +250,13 @@ static void continue_new_vcpu(struct vcpu *prev) if ( is_idle_vcpu(current) ) reset_stack_and_jump(idle_loop); + else if is_pv32_domain(current->domain) + /* check_wakeup_from_wait(); */ + reset_stack_and_jump(return_to_new_vcpu32); else /* check_wakeup_from_wait(); */ - reset_stack_and_jump(return_to_new_vcpu); + reset_stack_and_jump(return_to_new_vcpu64); + } void context_switch(struct vcpu *prev, struct vcpu *next) -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 11/15] xen: arm: handle hypercalls from 64-bit guests
Signed-off-by: Ian Campbell <ian.campbell@citrix.com> Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> --- xen/arch/arm/traps.c | 70 +++++++++++++++++++++++++++++---------- xen/include/asm-arm/processor.h | 7 +++- 2 files changed, 57 insertions(+), 20 deletions(-) diff --git a/xen/arch/arm/traps.c b/xen/arch/arm/traps.c index abbaadd..32de87f 100644 --- a/xen/arch/arm/traps.c +++ b/xen/arch/arm/traps.c @@ -796,8 +796,8 @@ unsigned long do_arch_0(unsigned int cmd, unsigned long long value) return 0; } -typedef unsigned long (*arm_hypercall_fn_t)( - unsigned int, unsigned int, unsigned int, unsigned int, unsigned int); +typedef register_t (*arm_hypercall_fn_t)( + register_t, register_t, register_t, register_t, register_t); typedef struct { arm_hypercall_fn_t fn; @@ -853,6 +853,7 @@ static arm_psci_t arm_psci_table[] = { PSCI(cpu_on, 2), }; +#ifndef NDEBUG static void do_debug_trap(struct cpu_user_regs *regs, unsigned int code) { register_t *r; @@ -881,6 +882,7 @@ static void do_debug_trap(struct cpu_user_regs *regs, unsigned int code) break; } } +#endif static void do_trap_psci(struct cpu_user_regs *regs) { @@ -901,30 +903,49 @@ static void do_trap_psci(struct cpu_user_regs *regs) regs->r0 = psci_call(regs->r1, regs->r2); } -static void do_trap_hypercall(struct cpu_user_regs *regs, unsigned long iss) +#ifdef CONFIG_ARM_64 +#define HYPERCALL_RESULT_REG(r) (r)->x0 +#define HYPERCALL_ARG1(r) (r)->x0 +#define HYPERCALL_ARG2(r) (r)->x1 +#define HYPERCALL_ARG3(r) (r)->x2 +#define HYPERCALL_ARG4(r) (r)->x3 +#define HYPERCALL_ARG5(r) (r)->x4 +#define HYPERCALL_ARGS(r) (r)->x0, (r)->x1, (r)->x2, (r)->x3, (r)->x4 +#else +#define HYPERCALL_RESULT_REG(r) (r)->r0 +#define HYPERCALL_ARG1(r) (r)->r0 +#define HYPERCALL_ARG2(r) (r)->r1 +#define HYPERCALL_ARG3(r) (r)->r2 +#define HYPERCALL_ARG4(r) (r)->r3 +#define HYPERCALL_ARG5(r) (r)->r4 +#define HYPERCALL_ARGS(r) (r)->r0, (r)->r1, (r)->r2, (r)->r3, (r)->r4 +#endif + +static void do_trap_hypercall(struct cpu_user_regs *regs, register_t *nr, + unsigned long iss) { arm_hypercall_fn_t call = NULL; #ifndef NDEBUG - uint32_t orig_pc = regs->pc; + register_t orig_pc = regs->pc; #endif if ( iss != XEN_HYPERCALL_TAG ) domain_crash_synchronous(); - if ( regs->r12 >= ARRAY_SIZE(arm_hypercall_table) ) + if ( *nr >= ARRAY_SIZE(arm_hypercall_table) ) { - regs->r0 = -ENOSYS; + HYPERCALL_RESULT_REG(regs) = -ENOSYS; return; } - call = arm_hypercall_table[regs->r12].fn; + call = arm_hypercall_table[*nr].fn; if ( call == NULL ) { - regs->r0 = -ENOSYS; + HYPERCALL_RESULT_REG(regs) = -ENOSYS; return; } - regs->r0 = call(regs->r0, regs->r1, regs->r2, regs->r3, regs->r4); + HYPERCALL_RESULT_REG(regs) = call(HYPERCALL_ARGS(regs)); #ifndef NDEBUG /* @@ -933,16 +954,16 @@ static void do_trap_hypercall(struct cpu_user_regs *regs, unsigned long iss) */ if ( orig_pc == regs->pc ) { - switch ( arm_hypercall_table[regs->r12].nr_args ) { - case 5: regs->r4 = 0xDEADBEEF; - case 4: regs->r3 = 0xDEADBEEF; - case 3: regs->r2 = 0xDEADBEEF; - case 2: regs->r1 = 0xDEADBEEF; - case 1: /* Don''t clobber r0 -- it''s the return value */ + switch ( arm_hypercall_table[*nr].nr_args ) { + case 5: HYPERCALL_ARG5(regs) = 0xDEADBEEF; + case 4: HYPERCALL_ARG4(regs) = 0xDEADBEEF; + case 3: HYPERCALL_ARG3(regs) = 0xDEADBEEF; + case 2: HYPERCALL_ARG2(regs) = 0xDEADBEEF; + case 1: /* Don''t clobber x0/r0 -- it''s the return value */ break; default: BUG(); } - regs->r12 = 0xDEADBEEF; + *nr = 0xDEADBEEF; } #endif } @@ -1221,13 +1242,26 @@ asmlinkage void do_trap_hypervisor(struct cpu_user_regs *regs) */ inject_undef_exception(regs, regs->pc32); break; - case HSR_EC_HVC: + case HSR_EC_HVC32: +#ifndef NDEBUG if ( (hsr.iss & 0xff00) == 0xff00 ) return do_debug_trap(regs, hsr.iss & 0x00ff); +#endif if ( hsr.iss == 0 ) return do_trap_psci(regs); - do_trap_hypercall(regs, hsr.iss); + do_trap_hypercall(regs, (register_t *)®s->r12, hsr.iss); break; +#ifdef CONFIG_ARM_64 + case HSR_EC_HVC64: +#ifndef NDEBUG + if ( (hsr.iss & 0xff00) == 0xff00 ) + return do_debug_trap(regs, hsr.iss & 0x00ff); +#endif + if ( hsr.iss == 0 ) + return do_trap_psci(regs); + do_trap_hypercall(regs, ®s->x16, hsr.iss); + break; +#endif case HSR_EC_DATA_ABORT_GUEST: do_trap_data_abort_guest(regs, hsr.dabt); break; diff --git a/xen/include/asm-arm/processor.h b/xen/include/asm-arm/processor.h index 9e4995c..6296a32 100644 --- a/xen/include/asm-arm/processor.h +++ b/xen/include/asm-arm/processor.h @@ -83,9 +83,12 @@ #define HSR_EC_JAZELLE 0x09 #define HSR_EC_BXJ 0x0a #define HSR_EC_CP14_64 0x0c -#define HSR_EC_SVC 0x11 -#define HSR_EC_HVC 0x12 +#define HSR_EC_SVC32 0x11 +#define HSR_EC_HVC32 0x12 #define HSR_EC_SMC 0x13 +#ifdef CONFIG_ARM_64 +#define HSR_EC_HVC64 0x16 +#endif #define HSR_EC_INSTR_ABORT_GUEST 0x20 #define HSR_EC_INSTR_ABORT_HYP 0x21 #define HSR_EC_DATA_ABORT_GUEST 0x24 -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 12/15] xen: arm: handle 64-bit system register access traps.
Wire up the vtimer handling to it. Use a simplified version of the 32-bit cp-register macros to have convenient decoding of HSR register values. (simplified because we don''t need them for passing to the assembler on 64-bit) Signed-off-by: Ian Campbell <ian.campbell@citrix.com> Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> --- xen/arch/arm/traps.c | 39 ++++++++++++ xen/arch/arm/vtimer.c | 130 ++++++++++++++++++++++++++------------- xen/include/asm-arm/processor.h | 32 ++++++++++ xen/include/asm-arm/sysregs.h | 56 +++++++++++++++++ 4 files changed, 213 insertions(+), 44 deletions(-) create mode 100644 xen/include/asm-arm/sysregs.h diff --git a/xen/arch/arm/traps.c b/xen/arch/arm/traps.c index 32de87f..ac70984 100644 --- a/xen/arch/arm/traps.c +++ b/xen/arch/arm/traps.c @@ -1098,6 +1098,39 @@ static void do_cp15_64(struct cpu_user_regs *regs, } +#ifdef CONFIG_ARM_64 +static void do_sysreg(struct cpu_user_regs *regs, + union hsr hsr) +{ + struct hsr_sysreg sysreg = hsr.sysreg; + + switch ( hsr.bits & HSR_SYSREG_REGS_MASK ) + { + case CNTP_CTL_EL0: + case CNTP_TVAL_EL0: + if ( !vtimer_emulate(regs, hsr) ) + { + dprintk(XENLOG_ERR, + "failed emulation of 64-bit vtimer sysreg access\n"); + domain_crash_synchronous(); + } + break; + default: + printk("%s %d, %d, c%d, c%d, %d %s x%d @ 0x%"PRIregister"\n", + sysreg.read ? "mrs" : "msr", + sysreg.op0, sysreg.op1, + sysreg.crn, sysreg.crm, + sysreg.op2, + sysreg.read ? "=>" : "<=", + sysreg.reg, regs->pc); + panic("unhandled 64-bit sysreg access %#x\n", + hsr.bits & HSR_SYSREG_REGS_MASK); + } + + regs->pc += 4; +} +#endif + void dump_guest_s1_walk(struct domain *d, vaddr_t addr) { uint32_t ttbcr = READ_SYSREG32(TCR_EL1); @@ -1261,7 +1294,13 @@ asmlinkage void do_trap_hypervisor(struct cpu_user_regs *regs) return do_trap_psci(regs); do_trap_hypercall(regs, ®s->x16, hsr.iss); break; + case HSR_EC_SYSREG: + if ( is_pv32_domain(current->domain) ) + goto bad_trap; + do_sysreg(regs, hsr); + break; #endif + case HSR_EC_DATA_ABORT_GUEST: do_trap_data_abort_guest(regs, hsr.dabt); break; diff --git a/xen/arch/arm/vtimer.c b/xen/arch/arm/vtimer.c index aee762a..d58a630 100644 --- a/xen/arch/arm/vtimer.c +++ b/xen/arch/arm/vtimer.c @@ -112,57 +112,67 @@ int virt_timer_restore(struct vcpu *v) return 0; } -static int vtimer_emulate_32(struct cpu_user_regs *regs, union hsr hsr) +static void vtimer_cntp_ctl(struct cpu_user_regs *regs, uint32_t *r, int read) { struct vcpu *v = current; - struct hsr_cp32 cp32 = hsr.cp32; - uint32_t *r = (uint32_t *)select_user_reg(regs, cp32.reg); - s_time_t now; - - switch ( hsr.bits & HSR_CP32_REGS_MASK ) + if ( read ) { - case HSR_CPREG32(CNTP_CTL): - if ( cp32.read ) + *r = v->arch.phys_timer.ctl; + } + else + { + uint32_t ctl = *r & ~CNTx_CTL_PENDING; + if ( ctl & CNTx_CTL_ENABLE ) + ctl |= v->arch.phys_timer.ctl & CNTx_CTL_PENDING; + v->arch.phys_timer.ctl = ctl; + + if ( v->arch.phys_timer.ctl & CNTx_CTL_ENABLE ) { - *r = v->arch.phys_timer.ctl; + set_timer(&v->arch.phys_timer.timer, + v->arch.phys_timer.cval + v->domain->arch.phys_timer_base.offset); } else + stop_timer(&v->arch.phys_timer.timer); + } +} + +static void vtimer_cntp_tval(struct cpu_user_regs *regs, uint32_t *r, int read) +{ + struct vcpu *v = current; + s_time_t now; + + now = NOW() - v->domain->arch.phys_timer_base.offset; + + if ( read ) + { + *r = (uint32_t)(ns_to_ticks(v->arch.phys_timer.cval - now) & 0xffffffffull); + } + else + { + v->arch.phys_timer.cval = now + ticks_to_ns(*r); + if ( v->arch.phys_timer.ctl & CNTx_CTL_ENABLE ) { - uint32_t ctl = *r & ~CNTx_CTL_PENDING; - if ( ctl & CNTx_CTL_ENABLE ) - ctl |= v->arch.phys_timer.ctl & CNTx_CTL_PENDING; - v->arch.phys_timer.ctl = ctl; - - if ( v->arch.phys_timer.ctl & CNTx_CTL_ENABLE ) - { - set_timer(&v->arch.phys_timer.timer, - v->arch.phys_timer.cval + - v->domain->arch.phys_timer_base.offset); - } - else - stop_timer(&v->arch.phys_timer.timer); + v->arch.phys_timer.ctl &= ~CNTx_CTL_PENDING; + set_timer(&v->arch.phys_timer.timer, + v->arch.phys_timer.cval + + v->domain->arch.phys_timer_base.offset); } + } +} +static int vtimer_emulate_cp32(struct cpu_user_regs *regs, union hsr hsr) +{ + struct hsr_cp32 cp32 = hsr.cp32; + uint32_t *r = (uint32_t *)select_user_reg(regs, cp32.reg); + + switch ( hsr.bits & HSR_CP32_REGS_MASK ) + { + case HSR_CPREG32(CNTP_CTL): + vtimer_cntp_ctl(regs, r, cp32.read); return 1; case HSR_CPREG32(CNTP_TVAL): - now = NOW() - v->domain->arch.phys_timer_base.offset; - if ( cp32.read ) - { - *r = (uint32_t)(ns_to_ticks(v->arch.phys_timer.cval - now) & 0xffffffffull); - } - else - { - v->arch.phys_timer.cval = now + ticks_to_ns(*r); - if ( v->arch.phys_timer.ctl & CNTx_CTL_ENABLE ) - { - v->arch.phys_timer.ctl &= ~CNTx_CTL_PENDING; - set_timer(&v->arch.phys_timer.timer, - v->arch.phys_timer.cval + - v->domain->arch.phys_timer_base.offset); - } - } - + vtimer_cntp_tval(regs, r, cp32.read); return 1; default: @@ -170,7 +180,7 @@ static int vtimer_emulate_32(struct cpu_user_regs *regs, union hsr hsr) } } -static int vtimer_emulate_64(struct cpu_user_regs *regs, union hsr hsr) +static int vtimer_emulate_cp64(struct cpu_user_regs *regs, union hsr hsr) { struct vcpu *v = current; struct hsr_cp64 cp64 = hsr.cp64; @@ -201,16 +211,48 @@ static int vtimer_emulate_64(struct cpu_user_regs *regs, union hsr hsr) } } +#ifdef CONFIG_ARM_64 +static int vtimer_emulate_sysreg(struct cpu_user_regs *regs, union hsr hsr) +{ + struct hsr_sysreg sysreg = hsr.sysreg; + register_t *x = select_user_reg(regs, sysreg.reg); + uint32_t r = (uint32_t)*x; + + switch ( hsr.bits & HSR_SYSREG_REGS_MASK ) + { + case CNTP_CTL_EL0: + vtimer_cntp_ctl(regs, &r, sysreg.read); + *x = r; + return 1; + case CNTP_TVAL_EL0: + vtimer_cntp_tval(regs, &r, sysreg.read); + *x = r; + return 1; + default: + return 0; + } + +} +#endif + int vtimer_emulate(struct cpu_user_regs *regs, union hsr hsr) { - if ( !is_pv32_domain(current->domain) ) - return -EINVAL; switch (hsr.ec) { case HSR_EC_CP15_32: - return vtimer_emulate_32(regs, hsr); + if ( !is_pv32_domain(current->domain) ) + return 0; + return vtimer_emulate_cp32(regs, hsr); case HSR_EC_CP15_64: - return vtimer_emulate_64(regs, hsr); + if ( !is_pv32_domain(current->domain) ) + return 0; + return vtimer_emulate_cp64(regs, hsr); +#ifdef CONFIG_ARM_64 + case HSR_EC_SYSREG: + if ( is_pv32_domain(current->domain) ) + return 0; + return vtimer_emulate_sysreg(regs, hsr); +#endif default: return 0; } diff --git a/xen/include/asm-arm/processor.h b/xen/include/asm-arm/processor.h index 6296a32..8a002a9 100644 --- a/xen/include/asm-arm/processor.h +++ b/xen/include/asm-arm/processor.h @@ -2,6 +2,7 @@ #define __ASM_ARM_PROCESSOR_H #include <asm/cpregs.h> +#include <asm/sysregs.h> /* TTBCR Translation Table Base Control Register */ #define TTBCR_EAE 0x80000000 @@ -88,6 +89,7 @@ #define HSR_EC_SMC 0x13 #ifdef CONFIG_ARM_64 #define HSR_EC_HVC64 0x16 +#define HSR_EC_SYSREG 0x18 #endif #define HSR_EC_INSTR_ABORT_GUEST 0x20 #define HSR_EC_INSTR_ABORT_HYP 0x21 @@ -247,6 +249,21 @@ union hsr { unsigned long ec:6; /* Exception Class */ } cp64; /* HSR_EC_CP15_64, HSR_EC_CP14_64 */ +#ifdef CONFIG_ARM_64 + struct hsr_sysreg { + unsigned long read:1; /* Direction */ + unsigned long crm:4; /* CRm */ + unsigned long reg:5; /* Rt */ + unsigned long crn:4; /* CRn */ + unsigned long op1:3; /* Op1 */ + unsigned long op2:3; /* Op2 */ + unsigned long op0:2; /* Op0 */ + unsigned long res0:3; + unsigned long len:1; /* Instruction length */ + unsigned long ec:6; + } sysreg; /* HSR_EC_SYSREG */ +#endif + struct hsr_dabt { unsigned long dfsc:6; /* Data Fault Status Code */ unsigned long write:1; /* Write / not Read */ @@ -289,6 +306,21 @@ union hsr { #define HSR_CP64_CRM_SHIFT (1) #define HSR_CP64_REGS_MASK (HSR_CP64_OP1_MASK|HSR_CP64_CRM_MASK) +/* HSR.EC == HSR_SYSREG */ +#define HSR_SYSREG_OP0_MASK (0x00300000) +#define HSR_SYSREG_OP0_SHIFT (20) +#define HSR_SYSREG_OP1_MASK (0x0001c000) +#define HSR_SYSREG_OP1_SHIFT (14) +#define HSR_SYSREG_CRN_MASK (0x00003800) +#define HSR_SYSREG_CRN_SHIFT (10) +#define HSR_SYSREG_CRM_MASK (0x0000001e) +#define HSR_SYSREG_CRM_SHIFT (1) +#define HSR_SYSREG_OP2_MASK (0x000e0000) +#define HSR_SYSREG_OP2_SHIFT (17) +#define HSR_SYSREG_REGS_MASK (HSR_SYSREG_OP0_MASK|HSR_SYSREG_OP1_MASK|\ + HSR_SYSREG_CRN_MASK|HSR_SYSREG_CRM_MASK|\ + HSR_SYSREG_OP2_MASK) + /* Physical Address Register */ #define PAR_F (1<<0) diff --git a/xen/include/asm-arm/sysregs.h b/xen/include/asm-arm/sysregs.h new file mode 100644 index 0000000..9c64777 --- /dev/null +++ b/xen/include/asm-arm/sysregs.h @@ -0,0 +1,56 @@ +#ifndef __ASM_ARM_SYSREGS_H +#define __ASM_ARM_SYSREGS_H + +#ifdef CONFIG_ARM_64 + +#include <xen/stringify.h> + +/* AArch 64 System Register Encodings */ +#define __HSR_SYSREG_c0 0 +#define __HSR_SYSREG_c1 1 +#define __HSR_SYSREG_c2 2 +#define __HSR_SYSREG_c3 3 +#define __HSR_SYSREG_c4 4 +#define __HSR_SYSREG_c5 5 +#define __HSR_SYSREG_c6 6 +#define __HSR_SYSREG_c7 7 +#define __HSR_SYSREG_c8 8 +#define __HSR_SYSREG_c9 9 +#define __HSR_SYSREG_c10 10 +#define __HSR_SYSREG_c11 11 +#define __HSR_SYSREG_c12 12 +#define __HSR_SYSREG_c13 13 +#define __HSR_SYSREG_c14 14 +#define __HSR_SYSREG_c15 15 + +#define __HSR_SYSREG_0 0 +#define __HSR_SYSREG_1 1 +#define __HSR_SYSREG_2 2 +#define __HSR_SYSREG_3 3 +#define __HSR_SYSREG_4 4 +#define __HSR_SYSREG_5 5 +#define __HSR_SYSREG_6 6 +#define __HSR_SYSREG_7 7 + +/* These are used to decode traps with HSR.EC==HSR_EC_SYSREG */ +#define HSR_SYSREG(op0,op1,crn,crm,op2) \ + ((__HSR_SYSREG_##op0) << HSR_SYSREG_OP0_SHIFT) | \ + ((__HSR_SYSREG_##op1) << HSR_SYSREG_OP1_SHIFT) | \ + ((__HSR_SYSREG_##crn) << HSR_SYSREG_CRN_SHIFT) | \ + ((__HSR_SYSREG_##crm) << HSR_SYSREG_CRM_SHIFT) | \ + ((__HSR_SYSREG_##op2) << HSR_SYSREG_OP2_SHIFT) + +#define CNTP_CTL_EL0 HSR_SYSREG(3,3,c14,c2,1) +#define CNTP_TVAL_EL0 HSR_SYSREG(3,3,c14,c2,0) +#endif + +#endif + +/* + * Local variables: + * mode: C + * c-set-style: "BSD" + * c-basic-offset: 4 + * indent-tabs-mode: nil + * End: + */ -- 1.7.2.5
Signed-off-by: Ian Campbell <ian.campbell@citrix.com> Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> --- xen/include/asm-arm/processor.h | 8 ++++---- 1 files changed, 4 insertions(+), 4 deletions(-) diff --git a/xen/include/asm-arm/processor.h b/xen/include/asm-arm/processor.h index 8a002a9..f84ad99 100644 --- a/xen/include/asm-arm/processor.h +++ b/xen/include/asm-arm/processor.h @@ -242,11 +242,11 @@ union hsr { unsigned long reg1:5; /* Rt1 */ unsigned long reg2:5; /* Rt2 */ unsigned long sbzp2:1; - unsigned long op1:4; /* Op1 */ - unsigned long cc:4; /* Condition Code */ + unsigned long op1:4; /* Op1 */ + unsigned long cc:4; /* Condition Code */ unsigned long ccvalid:1;/* CC Valid */ - unsigned long len:1; /* Instruction length */ - unsigned long ec:6; /* Exception Class */ + unsigned long len:1; /* Instruction length */ + unsigned long ec:6; /* Exception Class */ } cp64; /* HSR_EC_CP15_64, HSR_EC_CP14_64 */ #ifdef CONFIG_ARM_64 -- 1.7.2.5
I was mostly interested in commenting the RW bit which is Register Width and not Read/Write as a reader might initially expect. Thought I might as well do the others... Signed-off-by: Ian Campbell <ian.campbell@citrix.com> --- xen/include/asm-arm/processor.h | 56 +++++++++++++++++++------------------- 1 files changed, 28 insertions(+), 28 deletions(-) diff --git a/xen/include/asm-arm/processor.h b/xen/include/asm-arm/processor.h index f84ad99..c9d406c 100644 --- a/xen/include/asm-arm/processor.h +++ b/xen/include/asm-arm/processor.h @@ -41,38 +41,38 @@ #define PSR_GUEST_INIT (PSR_ABT_MASK|PSR_FIQ_MASK|PSR_IRQ_MASK) /* HCR Hyp Configuration Register */ -#define HCR_RW (1<<31) /* ARM64 only */ -#define HCR_TGE (1<<27) -#define HCR_TVM (1<<26) -#define HCR_TTLB (1<<25) -#define HCR_TPU (1<<24) -#define HCR_TPC (1<<23) -#define HCR_TSW (1<<22) -#define HCR_TAC (1<<21) -#define HCR_TIDCP (1<<20) -#define HCR_TSC (1<<19) -#define HCR_TID3 (1<<18) -#define HCR_TID2 (1<<17) -#define HCR_TID1 (1<<16) -#define HCR_TID0 (1<<15) -#define HCR_TWE (1<<14) -#define HCR_TWI (1<<13) -#define HCR_DC (1<<12) -#define HCR_BSU_MASK (3<<10) +#define HCR_RW (1<<31) /* Register Width, ARM64 only */ +#define HCR_TGE (1<<27) /* Trap General Exceptions */ +#define HCR_TVM (1<<26) /* Trap Virtual Memory Controls */ +#define HCR_TTLB (1<<25) /* Trap TLB Maintenance Operations */ +#define HCR_TPU (1<<24) /* Trap Cache Maintenance Operations to PoU */ +#define HCR_TPC (1<<23) /* Trap Cache Maintenance Operations to PoC */ +#define HCR_TSW (1<<22) /* Trap Set/Way Cache Maintenance Operations */ +#define HCR_TAC (1<<21) /* Trap ACTLR Accesses */ +#define HCR_TIDCP (1<<20) /* Trap lockdown */ +#define HCR_TSC (1<<19) /* Trap SMC instruction */ +#define HCR_TID3 (1<<18) /* Trap ID Register Group 3 */ +#define HCR_TID2 (1<<17) /* Trap ID Register Group 2 */ +#define HCR_TID1 (1<<16) /* Trap ID Register Group 1 */ +#define HCR_TID0 (1<<15) /* Trap ID Register Group 0 */ +#define HCR_TWE (1<<14) /* Trap WFE instruction */ +#define HCR_TWI (1<<13) /* Trap WFI instruction */ +#define HCR_DC (1<<12) /* Default cacheable */ +#define HCR_BSU_MASK (3<<10) /* Barrier Shareability Upgrade */ #define HCR_BSU_NONE (0<<10) #define HCR_BSU_INNER (1<<10) #define HCR_BSU_OUTER (2<<10) #define HCR_BSU_FULL (3<<10) -#define HCR_FB (1<<9) -#define HCR_VA (1<<8) -#define HCR_VI (1<<7) -#define HCR_VF (1<<6) -#define HCR_AMO (1<<5) -#define HCR_IMO (1<<4) -#define HCR_FMO (1<<3) -#define HCR_PTW (1<<2) -#define HCR_SWIO (1<<1) -#define HCR_VM (1<<0) +#define HCR_FB (1<<9) /* Force Broadcast of Cache/BP/TLB operations */ +#define HCR_VA (1<<8) /* Virtual Asynchronous Abort */ +#define HCR_VI (1<<7) /* Virtual IRQ */ +#define HCR_VF (1<<6) /* Virtual FIQ */ +#define HCR_AMO (1<<5) /* Override CPSR.A */ +#define HCR_IMO (1<<4) /* Override CPSR.I */ +#define HCR_FMO (1<<3) /* Override CPSR.F */ +#define HCR_PTW (1<<2) /* Protected Walk */ +#define HCR_SWIO (1<<1) /* Set/Way Invalidation Override */ +#define HCR_VM (1<<0) /* Virtual MMU Enable */ #define HSR_EC_WFI_WFE 0x01 #define HSR_EC_CP15_32 0x03 -- 1.7.2.5
Ian Campbell
2013-Jul-19 11:44 UTC
[PATCH v3 15/15] xen: arm: Handle SMC from 64-bit guests
Similarly to arm32 guests handle it by injecting an undefined instruction trap. Signed-off-by: Ian Campbell <ian.campbell@citrix.com> --- xen/arch/arm/traps.c | 40 +++++++++++++++++++++++++++++++------- xen/include/asm-arm/processor.h | 14 ++++++++++++- xen/include/public/arch-arm.h | 3 ++ 3 files changed, 48 insertions(+), 9 deletions(-) diff --git a/xen/arch/arm/traps.c b/xen/arch/arm/traps.c index ac70984..00a2a73 100644 --- a/xen/arch/arm/traps.c +++ b/xen/arch/arm/traps.c @@ -284,25 +284,49 @@ static vaddr_t exception_handler(vaddr_t offset) * pipeline adjustments). See TakeUndefInstrException pseudocode in * ARM. */ -static void inject_undef_exception(struct cpu_user_regs *regs, - register_t preferred_return) +static void inject_undef32_exception(struct cpu_user_regs *regs) { uint32_t spsr = regs->cpsr; int is_thumb = (regs->cpsr & PSR_THUMB); /* Saved PC points to the instruction past the faulting instruction. */ uint32_t return_offset = is_thumb ? 2 : 4; + BUG_ON( !is_pv32_domain(current->domain) ); + /* Update processor mode */ cpsr_switch_mode(regs, PSR_MODE_UND); /* Update banked registers */ regs->spsr_und = spsr; - regs->lr_und = preferred_return + return_offset; + regs->lr_und = regs->pc32 + return_offset; /* Branch to exception vector */ regs->pc32 = exception_handler(VECTOR32_UND); } +#ifdef CONFIG_ARM_64 +/* Inject an undefined exception into a 64 bit guest */ +static void inject_undef64_exception(struct cpu_user_regs *regs, int instr_len) +{ + union hsr esr = { + .iss = 0, + .len = instr_len, + .ec = HSR_EC_UNKNOWN, + }; + + BUG_ON( is_pv32_domain(current->domain) ); + + regs->spsr_el1 = regs->cpsr; + regs->elr_el1 = regs->pc; + + regs->cpsr = PSR_MODE_EL1h | PSR_ABT_MASK | PSR_FIQ_MASK | \ + PSR_IRQ_MASK | PSR_DBG_MASK; + regs->pc = READ_SYSREG(VBAR_EL1) + VECTOR64_CURRENT_SPx_SYNC; + + WRITE_SYSREG32(esr.bits, ESR_EL1); +} +#endif + struct reg_ctxt { /* Guest-side state */ uint32_t sctlr_el1, tcr_el1; @@ -1269,11 +1293,8 @@ asmlinkage void do_trap_hypervisor(struct cpu_user_regs *regs) goto bad_trap; do_cp15_64(regs, hsr); break; - case HSR_EC_SMC: - /* PC32 already contains the preferred exception return - * address, so no need to adjust here. - */ - inject_undef_exception(regs, regs->pc32); + case HSR_EC_SMC32: + inject_undef32_exception(regs); break; case HSR_EC_HVC32: #ifndef NDEBUG @@ -1294,6 +1315,9 @@ asmlinkage void do_trap_hypervisor(struct cpu_user_regs *regs) return do_trap_psci(regs); do_trap_hypercall(regs, ®s->x16, hsr.iss); break; + case HSR_EC_SMC64: + inject_undef64_exception(regs, hsr.len); + break; case HSR_EC_SYSREG: if ( is_pv32_domain(current->domain) ) goto bad_trap; diff --git a/xen/include/asm-arm/processor.h b/xen/include/asm-arm/processor.h index c9d406c..4dd82bd 100644 --- a/xen/include/asm-arm/processor.h +++ b/xen/include/asm-arm/processor.h @@ -74,6 +74,7 @@ #define HCR_SWIO (1<<1) /* Set/Way Invalidation Override */ #define HCR_VM (1<<0) /* Virtual MMU Enable */ +#define HSR_EC_UNKNOWN 0x00 #define HSR_EC_WFI_WFE 0x01 #define HSR_EC_CP15_32 0x03 #define HSR_EC_CP15_64 0x04 @@ -86,9 +87,10 @@ #define HSR_EC_CP14_64 0x0c #define HSR_EC_SVC32 0x11 #define HSR_EC_HVC32 0x12 -#define HSR_EC_SMC 0x13 +#define HSR_EC_SMC32 0x13 #ifdef CONFIG_ARM_64 #define HSR_EC_HVC64 0x16 +#define HSR_EC_SMC64 0x17 #define HSR_EC_SYSREG 0x18 #endif #define HSR_EC_INSTR_ABORT_GUEST 0x20 @@ -379,11 +381,21 @@ union hsr { #define CNTx_CTL_PENDING (1u<<2) /* IRQ pending */ /* Exception Vector offsets */ +/* ... ARM32 */ #define VECTOR32_RST 0 #define VECTOR32_UND 4 #define VECTOR32_SVC 8 #define VECTOR32_PABT 12 #define VECTOR32_DABT 16 +/* ... ARM64 */ +#define VECTOR64_CURRENT_SP0_SYNC 0x000 +#define VECTOR64_CURRENT_SP0_IRQ 0x080 +#define VECTOR64_CURRENT_SP0_FIQ 0x100 +#define VECTOR64_CURRENT_SP0_ERROR 0x180 +#define VECTOR64_CURRENT_SPx_SYNC 0x200 +#define VECTOR64_CURRENT_SPx_IRQ 0x280 +#define VECTOR64_CURRENT_SPx_FIQ 0x300 +#define VECTOR64_CURRENT_SPx_ERROR 0x380 #if defined(CONFIG_ARM_32) # include <asm/arm32/processor.h> diff --git a/xen/include/public/arch-arm.h b/xen/include/public/arch-arm.h index cea12b2..cbd53a9 100644 --- a/xen/include/public/arch-arm.h +++ b/xen/include/public/arch-arm.h @@ -234,6 +234,9 @@ typedef uint64_t xen_callback_t; #define PSR_IRQ_MASK (1<<7) /* Interrupt mask */ #define PSR_ABT_MASK (1<<8) /* Asynchronous Abort mask */ #define PSR_BIG_ENDIAN (1<<9) /* Big Endian Mode */ +#ifdef __aarch64__ /* For Aarch64 bit 9 is repurposed. */ +#define PSR_DBG_MASK (1<<9) +#endif #define PSR_IT_MASK (0x0600fc00) /* Thumb If-Then Mask */ #define PSR_JAZELLE (1<<24) /* Jazelle Mode */ -- 1.7.2.5
Konrad Rzeszutek Wilk
2013-Jul-19 12:59 UTC
Re: [PATCH v3 04/15] xen: arm: support building a 64-bit dom0 domain
On Fri, Jul 19, 2013 at 12:44:33PM +0100, Ian Campbell wrote:> Signed-off-by: Ian Campbell <ian.campbell@citrix.com> > Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> > --- > v3: rebased. as part of this move PSR_GUEST_INIT out of the public interface > --- > xen/arch/arm/domain_build.c | 10 +++++++--- > xen/include/asm-arm/processor.h | 2 ++ > xen/include/public/arch-arm.h | 2 -- > 3 files changed, 9 insertions(+), 5 deletions(-) > > diff --git a/xen/arch/arm/domain_build.c b/xen/arch/arm/domain_build.c > index 5e486c4..7f3035c 100644 > --- a/xen/arch/arm/domain_build.c > +++ b/xen/arch/arm/domain_build.c > @@ -593,9 +593,7 @@ int construct_dom0(struct domain *d) > > memset(regs, 0, sizeof(*regs)); > > - regs->pc = (uint32_t)kinfo.entry; > - > - regs->cpsr = PSR_GUEST_INIT; > + regs->pc = (register_t)kinfo.entry; > > #ifdef CONFIG_ARM_64 > d->arch.type = kinfo.type; > @@ -603,6 +601,11 @@ int construct_dom0(struct domain *d) > > if ( is_pv32_domain(d) ) > { > + regs->cpsr = PSR_GUEST_INIT|PSR_MODE_SVC; > + > + /* Pretend to be a Cortex A15 */Um, should there by a TODO for that?> + d->arch.vpidr = 0x410fc0f0; > + > /* FROM LINUX head.S > * > * Kernel startup entry point. > @@ -621,6 +624,7 @@ int construct_dom0(struct domain *d) > #ifdef CONFIG_ARM_64 > else > { > + regs->cpsr = PSR_GUEST_INIT|PSR_MODE_EL1h; > /* From linux/Documentation/arm64/booting.txt */ > regs->x0 = kinfo.dtb_paddr; > regs->x1 = 0; /* Reserved for future use */ > diff --git a/xen/include/asm-arm/processor.h b/xen/include/asm-arm/processor.h > index 263bd03..8b9b320 100644 > --- a/xen/include/asm-arm/processor.h > +++ b/xen/include/asm-arm/processor.h > @@ -37,6 +37,8 @@ > #define SCTLR_BASE 0x00c50078 > #define HSCTLR_BASE 0x30c51878 > > +#define PSR_GUEST_INIT (PSR_ABT_MASK|PSR_FIQ_MASK|PSR_IRQ_MASK) > + > /* HCR Hyp Configuration Register */ > #define HCR_TGE (1<<27) > #define HCR_TVM (1<<26) > diff --git a/xen/include/public/arch-arm.h b/xen/include/public/arch-arm.h > index 8aa62d3..cea12b2 100644 > --- a/xen/include/public/arch-arm.h > +++ b/xen/include/public/arch-arm.h > @@ -237,8 +237,6 @@ typedef uint64_t xen_callback_t; > #define PSR_IT_MASK (0x0600fc00) /* Thumb If-Then Mask */ > #define PSR_JAZELLE (1<<24) /* Jazelle Mode */ > > -#define PSR_GUEST_INIT (PSR_ABT_MASK|PSR_FIQ_MASK|PSR_IRQ_MASK|PSR_MODE_SVC) > - > #endif /* __XEN_PUBLIC_ARCH_ARM_H__ */ > > /* > -- > 1.7.2.5 > > > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.xen.org > http://lists.xen.org/xen-devel
Julien Grall
2013-Jul-19 13:30 UTC
Re: [PATCH v3 03/15] xen: arm: support for loading 64-bit zImage dom0
On 07/19/2013 12:44 PM, Ian Campbell wrote:> This is defined in linux/Documentation/arm64/booting.txt. > > Signed-off-by: Ian Campbell <ian.campbell@citrix.com> > Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com> > --- > xen/arch/arm/kernel.c | 80 +++++++++++++++++++++++++++++++++++++++++++++--- > 1 files changed, 75 insertions(+), 5 deletions(-) > > diff --git a/xen/arch/arm/kernel.c b/xen/arch/arm/kernel.c > index 641b1f0..f6ff294 100644 > --- a/xen/arch/arm/kernel.c > +++ b/xen/arch/arm/kernel.c > @@ -27,6 +27,8 @@ > > #define ZIMAGE32_MAGIC 0x016f2818 > > +#define ZIMAGE64_MAGIC 0x14000008 > + > struct minimal_dtb_header { > uint32_t magic; > uint32_t total_size; > @@ -115,6 +117,57 @@ static void kernel_zimage_load(struct kernel_info *info) > } > } > > +#ifdef CONFIG_ARM_64 > +/* > + * Check if the image is a 64-bit zImage and setup kernel_info > + */ > +static int kernel_try_zimage64_prepare(struct kernel_info *info, > + paddr_t addr, paddr_t size) > +{ > + /* linux/Documentation/arm64/booting.txt */ > + struct { > + uint32_t magic; > + uint32_t res0; > + uint64_t text_offset; /* Image load offset */ > + uint64_t res1; > + uint64_t res2; > + } zimage; > + uint64_t start, end; > + > + if ( size < sizeof(zimage) ) > + return -EINVAL; > + > + copy_from_paddr(&zimage, addr, sizeof(zimage), DEV_SHARED); > + > + if (zimage.magic != ZIMAGE64_MAGIC) > + return -EINVAL; > + > + /* Currently there is no length in the header, so just use the size */ > + start = 0; > + end = size; > + > + /* > + * Given the above this check is a bit pointless, but leave it > + * here in case someone adds a length field in the future. > + */ > + if ( (end - start) > size ) > + return -EINVAL; > + > + info->zimage.kernel_addr = addr; > + > + info->zimage.load_addr = info->mem.bank[0].start > + + zimage.text_offset; > + info->zimage.len = end - start; > + > + info->entry = info->zimage.load_addr; > + info->load = kernel_zimage_load;Could you fill check_overlap callback? For this purpose, you can reuse kernel_zimage_check_overlap.> + info->type = DOMAIN_PV64; > + > + return 0; > +} > +#endif > + > /* > * Check if the image is a 32-bit zImage and setup kernel_info > */ > @@ -170,6 +223,10 @@ static int kernel_try_zimage32_prepare(struct kernel_info *info, > info->load = kernel_zimage_load; > info->check_overlap = kernel_zimage_check_overlap; > > +#ifdef CONFIG_ARM_64 > + info->type = DOMAIN_PV32; > +#endif > + > return 0; > } > > @@ -208,6 +265,19 @@ static int kernel_try_elf_prepare(struct kernel_info *info, > if ( (rc = elf_xen_parse(&info->elf.elf, &info->elf.parms)) != 0 ) > goto err; > > +#ifdef CONFIG_ARM_64 > + if ( elf_32bit(&info->elf.elf) ) > + info->type = DOMAIN_PV32; > + else if ( elf_64bit(&info->elf.elf) ) > + info->type = DOMAIN_PV64; > + else > + { > + printk("Unknown ELF class\n"); > + rc = -EINVAL; > + goto err; > + } > +#endif > + > /* > * TODO: can the ELF header be used to find the physical address > * to load the image to? Instead of assuming virt == phys. > @@ -254,13 +324,13 @@ int kernel_prepare(struct kernel_info *info) > info->load_attr = BUFFERABLE; > } > > - rc = kernel_try_zimage32_prepare(info, start, size); > - if (rc < 0) > - rc = kernel_try_elf_prepare(info, start, size); > - > #ifdef CONFIG_ARM_64 > - info->type = DOMAIN_PV32; /* No 64-bit guest support yet */ > + rc = kernel_try_zimage64_prepare(info, start, size); > + if (rc < 0) > #endif > + rc = kernel_try_zimage32_prepare(info, start, size); > + if (rc < 0) > + rc = kernel_try_elf_prepare(info, start, size); > > return rc; > } >
On 07/19/2013 12:44 PM, Ian Campbell wrote:> I was mostly interested in commenting the RW bit which is Register Width and > not Read/Write as a reader might initially expect. Thought I might as well do > the others... > > Signed-off-by: Ian Campbell <ian.campbell@citrix.com>Thanks ! It will save time to understand each bit. :) Acked-by: Julien Grall <julien.grall@linaro.org>> --- > xen/include/asm-arm/processor.h | 56 +++++++++++++++++++------------------- > 1 files changed, 28 insertions(+), 28 deletions(-) > > diff --git a/xen/include/asm-arm/processor.h b/xen/include/asm-arm/processor.h > index f84ad99..c9d406c 100644 > --- a/xen/include/asm-arm/processor.h > +++ b/xen/include/asm-arm/processor.h > @@ -41,38 +41,38 @@ > #define PSR_GUEST_INIT (PSR_ABT_MASK|PSR_FIQ_MASK|PSR_IRQ_MASK) > > /* HCR Hyp Configuration Register */ > -#define HCR_RW (1<<31) /* ARM64 only */ > -#define HCR_TGE (1<<27) > -#define HCR_TVM (1<<26) > -#define HCR_TTLB (1<<25) > -#define HCR_TPU (1<<24) > -#define HCR_TPC (1<<23) > -#define HCR_TSW (1<<22) > -#define HCR_TAC (1<<21) > -#define HCR_TIDCP (1<<20) > -#define HCR_TSC (1<<19) > -#define HCR_TID3 (1<<18) > -#define HCR_TID2 (1<<17) > -#define HCR_TID1 (1<<16) > -#define HCR_TID0 (1<<15) > -#define HCR_TWE (1<<14) > -#define HCR_TWI (1<<13) > -#define HCR_DC (1<<12) > -#define HCR_BSU_MASK (3<<10) > +#define HCR_RW (1<<31) /* Register Width, ARM64 only */ > +#define HCR_TGE (1<<27) /* Trap General Exceptions */ > +#define HCR_TVM (1<<26) /* Trap Virtual Memory Controls */ > +#define HCR_TTLB (1<<25) /* Trap TLB Maintenance Operations */ > +#define HCR_TPU (1<<24) /* Trap Cache Maintenance Operations to PoU */ > +#define HCR_TPC (1<<23) /* Trap Cache Maintenance Operations to PoC */ > +#define HCR_TSW (1<<22) /* Trap Set/Way Cache Maintenance Operations */ > +#define HCR_TAC (1<<21) /* Trap ACTLR Accesses */ > +#define HCR_TIDCP (1<<20) /* Trap lockdown */ > +#define HCR_TSC (1<<19) /* Trap SMC instruction */ > +#define HCR_TID3 (1<<18) /* Trap ID Register Group 3 */ > +#define HCR_TID2 (1<<17) /* Trap ID Register Group 2 */ > +#define HCR_TID1 (1<<16) /* Trap ID Register Group 1 */ > +#define HCR_TID0 (1<<15) /* Trap ID Register Group 0 */ > +#define HCR_TWE (1<<14) /* Trap WFE instruction */ > +#define HCR_TWI (1<<13) /* Trap WFI instruction */ > +#define HCR_DC (1<<12) /* Default cacheable */ > +#define HCR_BSU_MASK (3<<10) /* Barrier Shareability Upgrade */ > #define HCR_BSU_NONE (0<<10) > #define HCR_BSU_INNER (1<<10) > #define HCR_BSU_OUTER (2<<10) > #define HCR_BSU_FULL (3<<10) > -#define HCR_FB (1<<9) > -#define HCR_VA (1<<8) > -#define HCR_VI (1<<7) > -#define HCR_VF (1<<6) > -#define HCR_AMO (1<<5) > -#define HCR_IMO (1<<4) > -#define HCR_FMO (1<<3) > -#define HCR_PTW (1<<2) > -#define HCR_SWIO (1<<1) > -#define HCR_VM (1<<0) > +#define HCR_FB (1<<9) /* Force Broadcast of Cache/BP/TLB operations */ > +#define HCR_VA (1<<8) /* Virtual Asynchronous Abort */ > +#define HCR_VI (1<<7) /* Virtual IRQ */ > +#define HCR_VF (1<<6) /* Virtual FIQ */ > +#define HCR_AMO (1<<5) /* Override CPSR.A */ > +#define HCR_IMO (1<<4) /* Override CPSR.I */ > +#define HCR_FMO (1<<3) /* Override CPSR.F */ > +#define HCR_PTW (1<<2) /* Protected Walk */ > +#define HCR_SWIO (1<<1) /* Set/Way Invalidation Override */ > +#define HCR_VM (1<<0) /* Virtual MMU Enable */ > > #define HSR_EC_WFI_WFE 0x01 > #define HSR_EC_CP15_32 0x03 >
Ian Campbell
2013-Jul-19 13:39 UTC
Re: [PATCH v3 03/15] xen: arm: support for loading 64-bit zImage dom0
On Fri, 2013-07-19 at 14:30 +0100, Julien Grall wrote:> > + info->entry = info->zimage.load_addr; > > + info->load = kernel_zimage_load; > > Could you fill check_overlap callback? For this purpose, you can reuse > kernel_zimage_check_overlap.Sure, missed this during the rebase because it didn''t actually cause an issue. Ian.
Ian Campbell
2013-Jul-19 13:42 UTC
Re: [PATCH v3 04/15] xen: arm: support building a 64-bit dom0 domain
On Fri, 2013-07-19 at 08:59 -0400, Konrad Rzeszutek Wilk wrote:> > + regs->cpsr = PSR_GUEST_INIT|PSR_MODE_SVC; > > + > > + /* Pretend to be a Cortex A15 */ > > Um, should there by a TODO for that?I''m not a big fan of sprinkling TODOs through the code. Given that CortexA15/A7 (which are architecturally identical) are the only processor cores which we currently support (or which exist) and that real 64-bit hardware isn''t available yet I think changing this would be part of a bigger effort anyway.> > + d->arch.vpidr = 0x410fc0f0; > > + > > /* FROM LINUX head.S
Stefano Stabellini
2013-Jul-22 10:44 UTC
Re: [PATCH v3 09/15] xen: arm: Set EL1 register width in HCR_EL2 during context switch.
On Fri, 19 Jul 2013, Ian Campbell wrote:> Signed-off-by: Ian Campbell <ian.campbell@citrix.com>Acked-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>> v3: Remove ifdef and rely on is_pv32_domain == true on arm32 > --- > xen/arch/arm/domain.c | 5 +++++ > xen/include/asm-arm/processor.h | 1 + > 2 files changed, 6 insertions(+), 0 deletions(-) > > diff --git a/xen/arch/arm/domain.c b/xen/arch/arm/domain.c > index 70fbc0d..a409d11 100644 > --- a/xen/arch/arm/domain.c > +++ b/xen/arch/arm/domain.c > @@ -207,6 +207,11 @@ static void ctxt_switch_to(struct vcpu *n) > > isb(); > > + if ( is_pv32_domain(n->domain) ) > + hcr &= ~HCR_RW; > + else > + hcr |= HCR_RW; > + > WRITE_SYSREG(hcr, HCR_EL2); > isb(); > > diff --git a/xen/include/asm-arm/processor.h b/xen/include/asm-arm/processor.h > index 8b9b320..9e4995c 100644 > --- a/xen/include/asm-arm/processor.h > +++ b/xen/include/asm-arm/processor.h > @@ -40,6 +40,7 @@ > #define PSR_GUEST_INIT (PSR_ABT_MASK|PSR_FIQ_MASK|PSR_IRQ_MASK) > > /* HCR Hyp Configuration Register */ > +#define HCR_RW (1<<31) /* ARM64 only */ > #define HCR_TGE (1<<27) > #define HCR_TVM (1<<26) > #define HCR_TTLB (1<<25) > -- > 1.7.2.5 >
Ian Campbell
2013-Jul-22 21:33 UTC
Re: [PATCH v3 00/15] xen: arm: 64-bit dom0 kernel support
On Fri, 2013-07-19 at 12:44 +0100, Ian Campbell wrote:> Initially we intended this to go into 4.3 but it didn''t make it, so here > it is rebased onto current staging. > > Other than the rebasing the only significant change is a major howler in > the entry.S code -- we weren''t restoring any of the outer stack frame > guest state for 32- or 64-bit guests! Most of the 32-bit guest state is > in the inner frame and the outer frame contains "only" the SPSR_* > registers. Amazing how far you can get without context switching those!Most of this series is now acked, anyone got any comments acks/nacks on nos 6, 10 or 15? That is: xen: arm: improve register dump output for 64-bit guest (and more generally too) xen: arm: handle traps from 64-bit guests xen: arm: Handle SMC from 64-bit guests Thanks, Ian
Stefano Stabellini
2013-Jul-24 11:30 UTC
Re: [PATCH v3 10/15] xen: arm: handle traps from 64-bit guests
On Fri, 19 Jul 2013, Ian Campbell wrote:> While there observe that we weren''t ever restoring the outer stack frame, even > for 32-bit guests when running a 64-bit hypervisor! The outer stack frame > "only" contains most of the SPSR registers for 32-bit... > > We can also remove the logic at "return_to_new_vcpu" which detects returns to > hypervisor mode -- seemingly trying to handle hypervisor threads which aren''t > an thing which we have. The idle VCPUs do not take this path. This simplifies > the return_to_new_vcpu code, we also split it into 32- and 64-bit VCPU paths. > > There is no need to export return_to_hypervisor (now renamed return_from_trap > on arm64) and likewise stop doing so on 32-bit as well. > > Also tweak the case of some system registers for consistency. > > Signed-off-by: Ian Campbell <ian.campbell@citrix.com> > --- > v3: actually restore the outer stack frame! Due to this change I have dropped > the previous Acks.It looks OK. However the code refactoring, the naming changes, and the new code to save the outer stack frame are difficult to follow together. Maybe separate out the latter?> xen/arch/arm/arm32/entry.S | 4 +- > xen/arch/arm/arm64/entry.S | 95 ++++++++++++++++++++++++++++++++----------- > xen/arch/arm/domain.c | 6 ++- > 3 files changed, 77 insertions(+), 28 deletions(-) > > diff --git a/xen/arch/arm/arm32/entry.S b/xen/arch/arm/arm32/entry.S > index 76814dd..cc0a99f 100644 > --- a/xen/arch/arm/arm32/entry.S > +++ b/xen/arch/arm/arm32/entry.S > @@ -87,7 +87,7 @@ DEFINE_TRAP_ENTRY_NOIRQ(fiq) > > return_from_trap: > mov sp, r11 > -ENTRY(return_to_new_vcpu) > +ENTRY(return_to_new_vcpu32) > ldr r11, [sp, #UREGS_cpsr] > and r11, #PSR_MODE_MASK > cmp r11, #PSR_MODE_HYP > @@ -108,7 +108,7 @@ ENTRY(return_to_guest) > RESTORE_ONE_BANKED(R8_fiq); RESTORE_ONE_BANKED(R9_fiq); RESTORE_ONE_BANKED(R10_fiq) > RESTORE_ONE_BANKED(R11_fiq); RESTORE_ONE_BANKED(R12_fiq); > /* Fall thru */ > -ENTRY(return_to_hypervisor) > +return_to_hypervisor: > cpsid i > ldr lr, [sp, #UREGS_lr] > ldr r11, [sp, #UREGS_pc] > diff --git a/xen/arch/arm/arm64/entry.S b/xen/arch/arm/arm64/entry.S > index b5af1e2..9cda8f1 100644 > --- a/xen/arch/arm/arm64/entry.S > +++ b/xen/arch/arm/arm64/entry.S > @@ -26,7 +26,7 @@ lr .req x30 // link register > .macro entry_guest, compat > > add x21, sp, #UREGS_SPSR_el1 > - mrs x23, SPSR_EL1 > + mrs x23, SPSR_el1 > str x23, [x21] > > .if \compat == 0 /* Aarch64 mode */ > @@ -40,24 +40,56 @@ lr .req x30 // link register > mrs x23, ELR_el1 > stp x22, x23, [x21] > > - .else /* Aarch32 mode */ > + .else /* Aarch32 mode */ > > add x21, sp, #UREGS_SPSR_fiq > - mrs x22, spsr_fiq > - mrs x23, spsr_irq > + mrs x22, SPSR_fiq > + mrs x23, SPSR_irq > stp w22, w23, [x21] > > add x21, sp, #UREGS_SPSR_und > - mrs x22, spsr_und > - mrs x23, spsr_abt > + mrs x22, SPSR_und > + mrs x23, SPSR_abt > stp w22, w23, [x21] > > .endif > > .endm > > + .macro exit_guest, compat > + > + add x21, sp, #UREGS_SPSR_el1 > + ldr x23, [x21] > + msr SPSR_el1, x23 > + > + .if \compat == 0 /* Aarch64 mode */ > + > + add x21, sp, #UREGS_SP_el0 > + ldr x22, [x21] > + msr SP_el0, x22 > + > + add x21, sp, #UREGS_SP_el1 > + ldp x22, x23, [x21] > + msr SP_el1, x22 > + msr ELR_el1, x23 > + > + .else /* Aarch32 mode */ > + > + add x21, sp, #UREGS_SPSR_fiq > + ldp w22, w23, [x21] > + msr SPSR_fiq, x22 > + msr SPSR_irq, x23 > + > + add x21, sp, #UREGS_SPSR_und > + ldp w22, w23, [x21] > + msr SPSR_und, x22 > + msr SPSR_abt, x23 > + > + .endif > + > + .endm > /* > - * Save state on entry to hypervisor > + * Save state on entry to hypervisor, restore on exit > */ > .macro entry, hyp, compat > sub sp, sp, #(UREGS_SPSR_el1 - UREGS_LR) /* CPSR, PC, SP, LR */ > @@ -96,6 +128,20 @@ lr .req x30 // link register > > .endm > > + .macro exit, hyp, compat > + > + .if \hyp == 0 /* Guest mode */ > + > + bl leave_hypervisor_tail /* Disables interrupts on return */ > + > + exit_guest \compat > + > + .endif > + > + b return_from_trap > + > + .endm > + > /* > * Bad Abort numbers > *----------------- > @@ -133,21 +179,26 @@ hyp_sync: > msr daifclr, #2 > mov x0, sp > bl do_trap_hypervisor > - b return_to_hypervisor > + exit hyp=1 > > hyp_irq: > entry hyp=1 > mov x0, sp > bl do_trap_irq > - b return_to_hypervisor > + exit hyp=1 > > guest_sync: > entry hyp=0, compat=0 > - invalid BAD_SYNC /* No AArch64 guest support yet */ > + msr daifclr, #2 > + mov x0, sp > + bl do_trap_hypervisor > + exit hyp=0, compat=0 > > guest_irq: > entry hyp=0, compat=0 > - invalid BAD_IRQ /* No AArch64 guest support yet */ > + mov x0, sp > + bl do_trap_irq > + exit hyp=0, compat=0 > > guest_fiq_invalid: > entry hyp=0, compat=0 > @@ -162,13 +213,13 @@ guest_sync_compat: > msr daifclr, #2 > mov x0, sp > bl do_trap_hypervisor > - b return_to_guest > + exit hyp=0, compat=1 > > guest_irq_compat: > entry hyp=0, compat=1 > mov x0, sp > bl do_trap_irq > - b return_to_guest > + exit hyp=0, compat=1 > > guest_fiq_invalid_compat: > entry hyp=0, compat=1 > @@ -178,18 +229,12 @@ guest_error_invalid_compat: > entry hyp=0, compat=1 > invalid BAD_ERROR > > -ENTRY(return_to_new_vcpu) > - ldr x21, [sp, #UREGS_CPSR] > - and x21, x21, #PSR_MODE_MASK > - /* Returning to EL2? */ > - cmp x21, #PSR_MODE_EL2t > - ccmp x21, #PSR_MODE_EL2h, #0x4, ne > - b.eq return_to_hypervisor /* Yes */ > - /* Fall thru */ > -ENTRY(return_to_guest) > - bl leave_hypervisor_tail /* Disables interrupts on return */ > - /* Fall thru */ > -ENTRY(return_to_hypervisor) > +ENTRY(return_to_new_vcpu32) > + exit hyp=0, compat=1 > +ENTRY(return_to_new_vcpu64) > + exit hyp=0, compat=0 > + > +return_from_trap: > msr daifset, #2 /* Mask interrupts */ > > ldp x21, x22, [sp, #UREGS_PC] // load ELR, SPSR > diff --git a/xen/arch/arm/domain.c b/xen/arch/arm/domain.c > index a409d11..cf82e21 100644 > --- a/xen/arch/arm/domain.c > +++ b/xen/arch/arm/domain.c > @@ -250,9 +250,13 @@ static void continue_new_vcpu(struct vcpu *prev) > > if ( is_idle_vcpu(current) ) > reset_stack_and_jump(idle_loop); > + else if is_pv32_domain(current->domain) > + /* check_wakeup_from_wait(); */ > + reset_stack_and_jump(return_to_new_vcpu32); > else > /* check_wakeup_from_wait(); */ > - reset_stack_and_jump(return_to_new_vcpu); > + reset_stack_and_jump(return_to_new_vcpu64); > + > } > > void context_switch(struct vcpu *prev, struct vcpu *next) > -- > 1.7.2.5 >
Ian Campbell
2013-Jul-24 17:50 UTC
Re: [PATCH v3 10/15] xen: arm: handle traps from 64-bit guests
On Wed, 2013-07-24 at 12:30 +0100, Stefano Stabellini wrote:> On Fri, 19 Jul 2013, Ian Campbell wrote: > > While there observe that we weren''t ever restoring the outer stack frame, even > > for 32-bit guests when running a 64-bit hypervisor! The outer stack frame > > "only" contains most of the SPSR registers for 32-bit... > > > > We can also remove the logic at "return_to_new_vcpu" which detects returns to > > hypervisor mode -- seemingly trying to handle hypervisor threads which aren''t > > an thing which we have. The idle VCPUs do not take this path. This simplifies > > the return_to_new_vcpu code, we also split it into 32- and 64-bit VCPU paths. > > > > There is no need to export return_to_hypervisor (now renamed return_from_trap > > on arm64) and likewise stop doing so on 32-bit as well. > > > > Also tweak the case of some system registers for consistency. > > > > Signed-off-by: Ian Campbell <ian.campbell@citrix.com> > > --- > > v3: actually restore the outer stack frame! Due to this change I have dropped > > the previous Acks. > > It looks OK. However the code refactoring, the naming changes, and the > new code to save the outer stack frame are difficult to follow together. > Maybe separate out the latter?Right, I''ll see what I can do. Ian