dolphin

mirror of https://github.com/dolphin-emu/dolphin.git synced 2026-04-11 23:00:18 -04:00

Author	SHA1	Message	Date
Sintendo	19dda51a0d	Jit64: subfx - Use LEA when possible Similar to what we do for addx. Since we're calculating b - a and because subtraction is not communitative, we can only apply this when source register a holds the constant. Before: 45 8B EE mov r13d,r14d 41 83 ED 08 sub r13d,8 After: 45 8D 6E F8 lea r13d,[r14-8]	2020-04-21 22:45:47 +02:00
Sintendo	89646c898f	Jit64: addx - Skip ADD after MOV when possible We can get away with skipping the addition when we know we're dealing with a constant zero. Just a MOV will suffice in this case. Once again, we don't bother to add separate handling for when overflow is needed, because no titles would ever hit that path during my testing. Before: 8B 7D F8 mov edi,dword ptr [rbp-8] 83 C7 00 add edi,0 After: 8B 7D F8 mov edi,dword ptr [rbp-8]	2020-04-21 22:45:47 +02:00
Sintendo	50f7a7d248	Jit64: addx - Prefer smaller MOV+ADD sequence ADD has a smaller encoding for immediates that can be expressed as an 8-bit signed integer (in other words, between -128 and 127). MOV lacks this compact representation. Since addition allows us to swap the source registers, we can always get the shortest sequence here by carefully checking if we're dealing with a small immediate first. If we are, move the other source into the destination and add the small immediate onto that. For large immediates the reverse is preferrable. Before: 41 BE 40 00 00 00 mov r14d,40h 44 03 75 A8 add r14d,dword ptr [rbp-58h] After: 44 8B 75 A8 mov r14d,dword ptr [rbp-58h] 41 83 C6 40 add r14d,40h Before: 44 8B 7D F8 mov r15d,dword ptr [rbp-8] 41 81 C7 00 68 00 CC add r15d,0CC006800h After: 41 BF 00 68 00 CC mov r15d,0CC006800h 44 03 7D F8 add r15d,dword ptr [rbp-8]	2020-04-21 22:42:02 +02:00
Sintendo	2481660519	Jit64: addx - Emit MOV when possible When the source registers are a simple register and a constant zero and overflow isn't needed, emitting LEA is kinda silly. This will occasionally save a single byte for certain registers due to how x86 encoding works. More importantly, LEA takes up execution resources while MOV does not. Before: 41 8D 7D 00 lea edi,[r13] After: 41 8B FD mov edi,r13d	2020-04-21 22:36:20 +02:00
Sintendo	1c25e6352a	Jit64: addx - Emit nothing when possible When the destination register matches a source register, the other source register contains zero, and overflow isn't needed, the instruction becomes a nop and we don't need to emit anything. We could add specialized handling for the case where overflow is needed, but none of the titles I tried would hit this path. Before: 83 C7 00 add edi,0 After:	2020-04-21 22:35:17 +02:00
Sintendo	f1c3ab359d	Jit64: addx - Deduplicate branches part 2 No functional change, just simplify some repeated logic in the case where we're dealing with exactly one immediate and one simple register when overflow isn't needed.	2020-04-21 22:06:46 +02:00
Sintendo	72fbdf1a6b	Jit64: addx - Deduplicate branches part 1 No functional change, just simplify some repeated logic for the cases where the destination register matches one of the sources.	2020-04-21 22:06:39 +02:00
container1234	75a69b1145	Breakpoints: Fix crash after clearing all memory breakpoints	2020-03-14 21:57:09 +09:00
Tilka	e323f47ceb	Merge pull request #8472 from degasus/jitsetting Core/Jits: Adds an option to disable the register cache.	2020-02-08 13:49:33 +00:00
Techjar	a106c99826	Jit64: Don't use PEXT in DoubleToSingle on AMD Zen This was causing severe slowdown in some games.	2020-01-26 22:10:46 -05:00
Tilka	709862b818	Merge pull request #8120 from MerryMage/cdts Jit64: Make DoubleToSingle a common asm routine	2020-01-25 19:10:37 +00:00
Connor McLaughlin	efc1ee8e6a	Merge pull request #8537 from degasus/fastmem Core/HW -> PowerPC/JIT: Fastmem arena construction	2020-01-14 09:38:15 +10:00
Tilka	98f645daac	Merge pull request #8158 from Sintendo/jitopts x64 micro-optimizations	2020-01-06 14:09:43 +01:00
Sintendo	12fcbac2a3	Jit64: addx - Emit LEA for register + immediate Prefer LEA over MOV + ADD when dealing with immediates. Before: 44 8B EE mov r13d,esi 41 83 C5 20 add r13d,20h After: 44 8D 6E 20 lea r13d,[rsi+20h]	2020-01-05 23:39:13 +01:00
Sintendo	8e7b6f4178	Jit64: addx - Prefer ADD over LEA when possible The old logic would always emit LEA when both sources are in a register and OE is disabled. However, ADD is still preferable when one of the sources matches the destination. Before: 45 8D 6C 35 00 lea r13d,[r13+rsi] After: 44 03 EE add r13d,esi	2020-01-05 23:23:56 +01:00
David Korth	c2dd2e8a2e	Use std::istringstream or std::ostringstream instead of std::stringstream where possible. This removes std::iostream from the inheritance chain, which reduces overhead slightly.	2019-12-29 23:45:02 -05:00
David Korth	9f3b9acad9	PowerPC.cpp: No need to explicitly initialize ppcState. "ppcState{}" is stored in the .data segment, which means the full ~4 MB is stored in the executable. "ppcState" is stored in the .bss segment, which means it only stores a note that tells it to allocate and zero ~4 MB at runtime.	2019-12-29 23:45:02 -05:00
degasus	aad8aab698	Jit64: Disable the fast address check if fastmem is disabled. This was a huge speedup with disabled fastmem, but it still requires the fastmem arena. So let's disable it for now, even if this commit has a huge performance hit with disabled fastmem.	2019-12-28 13:41:57 +01:00
degasus	d735943aa2	Jit64: Use safe memory helpers for psq_l* without fastmem. RMEM won't help if there is no fastmem arena, so let's use our memory helpers.	2019-12-28 13:41:57 +01:00
degasus	74cb692591	Jit64: Only activate dcbz fastpath with fastmem. The code is safe not to create memory errors, but it accesses the fastmem area.	2019-12-28 13:41:57 +01:00
degasus	c6019f9814	PowerPC/Jit: Create fastmem arena on init.	2019-12-28 13:41:57 +01:00
degasus	9d88180df7	MMU: Use the Memory helpers for physical memory. physical_base is a fastmem helper. Its access is unsafe and might not be available without a Jit.	2019-12-28 12:57:51 +01:00
Stenzek	d744c5a148	Compile fixes for Windows-on-ARM64	2019-12-28 19:20:41 +10:00
Léo Lam	3cf2857aac	Merge pull request #8520 from lioncash/analyst-tidy PowerPC/PPCAnalyst: Remove unimplemented LogFunctionCall prototype	2019-12-15 12:07:38 +01:00
Lioncash	7c12081693	PowerPC/PPCAnalyst: Remove unimplemented LogFunctionCall prototype This doesn't have an implementation, so it can be removed.	2019-12-15 00:23:32 -05:00
Lioncash	474e210cd5	PPCCache: Make arrays constexpr where applicable Two of these arrays were stored within the save state when the exact same data is constructed all the time. We can just build this into the binary rather than the save state, shrinking a little bit of the save state's overall size.	2019-12-03 04:05:09 -05:00
Lioncash	1d5ce4708f	PPCCache: Make use of std::array where applicable Allows simplifying a bunch of initialization and filling code.	2019-12-03 04:05:05 -05:00
Léo Lam	a77108236e	Merge pull request #8418 from lioncash/core-fmt Core: Replace usages of StringFromFormat with fmt where applicable	2019-11-16 18:09:58 +01:00
degasus	3ebcc445de	Core/Jits: Adds an option to disable the register cache. This will help to disable all inter-instruction dependencies. So android users can check if only a single instruction is broken without compiling dolphin on their own.	2019-11-16 13:05:54 +01:00
Lioncash	febd1c3dba	Core: Replace usages of StringFromFormat with fmt where applicable Migrates usages of StringFromFormat over to using fmt.	2019-11-11 07:32:57 -05:00
Silent	45890c20cf	Remove obsolete "Windows compatibility" macros	2019-10-07 22:46:37 +02:00
Tillmann Karras	ee85ec95b4	Jit64: fix mcrfs It was deleting the wrong bits. This fixes issue 10074 (Super Monkey Ball 2 minigame). I introduced this regression in `0f2c656687`.	2019-08-18 03:49:32 +01:00
Connor McLaughlin	48ca2c6f2e	Merge pull request #8233 from JosJuice/stringutil-string-view StringUtil: Use std::string_view more	2019-08-09 23:39:01 +10:00
Connor McLaughlin	a8bbbd0bfc	Merge pull request #8262 from Tilka/gdbstub Fix GDB stub build	2019-07-25 13:54:02 +10:00
JosJuice	a2a1e04fc9	StringUtil: Use std::string_view more	2019-07-23 14:49:12 +02:00
Lioncash	a9a9b193bb	Common/DebugInterface: Use forward declarations where applicable We're allowed (by the standard) to forward declare types within std::vector, so we can replace direct includes with forward declarations and then include the types where they're directly needed. While we're at it, we can remove an unused inclusion of <cstring>, given nothing in the header uses anything from it. This also revealed an indirect inclusion, which this also resolves.	2019-07-08 18:59:11 -04:00
Tillmann Karras	da87638ea2	Fix GDB stub	2019-07-07 05:17:33 +01:00
Silent	6c21811090	Make DolphinAnalytics a true singleton - static local variables are initialized in a thread safe manner since C++11 Also works around a Visual Studio 2017 bug where static inline class fields are destructed multiple times	2019-06-23 21:43:47 +02:00
Sintendo	e9d4869965	Jit64AsmCommon: Micro-optimize GenFrsqrte Save 2 bytes by testing register against itself and branching on the sign flag. Before: 0: 48 0f ba e0 3f bt rax,0x3f After: 0: 48 85 c0 test rax,rax	2019-06-05 23:58:44 +02:00
MerryMage	f6afce781f	Jit64AsmCommon: Use PEXT in GenConvertDoubleToSingle	2019-05-25 23:07:50 +01:00
MerryMage	12314577c1	Jit64AsmCommon: Make ConvertDoubleToSingle use RSCRATCH as output	2019-05-25 23:07:50 +01:00
booto	c4799e5977	Jit64: Make DoubleToSingle a common asm routine	2019-05-25 23:07:50 +01:00
booto	27cb407ecf	PowerPC: Add Broadway's PVR (retail Wii)	2019-05-23 19:56:41 -04:00
Connor McLaughlin	68877c52d1	Merge pull request #8027 from MerryMage/MOVAPS Jit64: Prefer MOVAPS where possible	2019-05-22 15:05:17 +10:00
Techjar	ff972e3673	Reformat repo to clang-format 7.0 rules	2019-05-06 18:48:04 +00:00
Léo Lam	ab9ece9bca	Replace MathUtil::Clamp with std::clamp	2019-05-04 23:12:17 +02:00
Léo Lam	6f84984b7b	Use attribute [[fallthrough]]	2019-05-04 23:04:18 +02:00
Léo Lam	99a4ca8de7	Merge pull request #7839 from ShFil119/impr/redundant Remove redundant initialization	2019-05-04 22:50:51 +02:00
booto	2ff0486335	Debugger/Memory: Add support for address spaces Different address spaces can be chosen in the memory view panel. * Effective (or virtual): Probably the view people mostly want. Address translation goes through MMU. * Auxiliary: ARAM address space. Does not display anything in Wii mode. * Physical: Physical address space. Only supports mem1 and mem2 (wii mode) so far.	2019-05-02 21:14:30 -04:00
Léo Lam	f4d7b537cc	Merge pull request #7958 from Craftyawesome/cpu-emulation-order Re-order cpu emulation options from fastest to slowest	2019-05-01 17:56:19 +02:00

1 2 3 4 5 ...

2228 Commits