OpenGrok

//===---------------------------------------------------------------------===//
// Random ideas for the X86 backend: FP stack related stuff
//===---------------------------------------------------------------------===//

//===---------------------------------------------------------------------===//

Some targets (e.g. athlons) prefer freep to fstp ST(0):
http://gcc.gnu.org/ml/gcc-patches/2004-04/msg00659.html

//===---------------------------------------------------------------------===//

This should use fiadd on chips where it is profitable:
double foo(double P, int *I) { return P+*I; }

We have fiadd patterns now but the followings have the same cost and
complexity. We need a way to specify the later is more profitable.

def FpADD32m  : FpI<(ops RFP:$dst, RFP:$src1, f32mem:$src2), OneArgFPRW,
                    [(set RFP:$dst, (fadd RFP:$src1,
                                     (extloadf64f32 addr:$src2)))]>;
                // ST(0) = ST(0) + [mem32]

def FpIADD32m : FpI<(ops RFP:$dst, RFP:$src1, i32mem:$src2), OneArgFPRW,
                    [(set RFP:$dst, (fadd RFP:$src1,
                                     (X86fild addr:$src2, i32)))]>;
                // ST(0) = ST(0) + [mem32int]

//===---------------------------------------------------------------------===//

The FP stackifier should handle simple permutates to reduce number of shuffle
instructions, e.g. turning:

fld P	->		fld Q
fld Q			fld P
fxch

or:

fxch	->		fucomi
fucomi			jl X
jg X

Ideas:
http://gcc.gnu.org/ml/gcc-patches/2004-11/msg02410.html


//===---------------------------------------------------------------------===//

Add a target specific hook to DAG combiner to handle SINT_TO_FP and
FP_TO_SINT when the source operand is already in memory.

//===---------------------------------------------------------------------===//

Open code rint,floor,ceil,trunc:
http://gcc.gnu.org/ml/gcc-patches/2004-08/msg02006.html
http://gcc.gnu.org/ml/gcc-patches/2004-08/msg02011.html

Opencode the sincos[f] libcall.

//===---------------------------------------------------------------------===//

None of the FPStack instructions are handled in
X86RegisterInfo::foldMemoryOperand, which prevents the spiller from
folding spill code into the instructions.

//===---------------------------------------------------------------------===//

Currently the x86 codegen isn't very good at mixing SSE and FPStack
code:

unsigned int foo(double x) { return x; }

foo:
	subl $20, %esp
	movsd 24(%esp), %xmm0
	movsd %xmm0, 8(%esp)
	fldl 8(%esp)
	fisttpll (%esp)
	movl (%esp), %eax
	addl $20, %esp
	ret

This just requires being smarter when custom expanding fptoui.

//===---------------------------------------------------------------------===//
`Name`	`Date`	`Size`
Up to higher level directory
`AsmParser/`	21-Aug-2018
`Disassembler/`	21-Aug-2018
`INSTALL.vcxproj.filters`	21-Aug-2018	`657`
`InstPrinter/`	21-Aug-2018
`LLVMX86CodeGen.vcxproj`	21-Aug-2018	`28K`
`LLVMX86CodeGen.vcxproj.filters`	21-Aug-2018	`3.6K`
`Makefile`	21-Aug-2018	`861`
`MCTargetDesc/`	21-Aug-2018
`PACKAGE.vcxproj.filters`	21-Aug-2018	`657`
`README-FPStack.txt`	21-Aug-2018	`2.7K`
`README-MMX.txt`	21-Aug-2018	`1.5K`
`README-SSE.txt`	21-Aug-2018	`26.4K`
`README-UNIMPLEMENTED.txt`	21-Aug-2018	`679`
`README-X86-64.txt`	21-Aug-2018	`6K`
`README.txt`	21-Aug-2018	`53.8K`
`TargetInfo/`	21-Aug-2018
`Utils/`	21-Aug-2018
`X86.h`	21-Aug-2018	`2.5K`
`X86.td`	21-Aug-2018	`13.2K`
`X86AsmPrinter.cpp`	21-Aug-2018	`25.6K`
`X86AsmPrinter.h`	21-Aug-2018	`2.9K`
`X86CallingConv.td`	21-Aug-2018	`15.7K`
`X86CodeEmitter.cpp`	21-Aug-2018	`35.6K`
`X86COFFMachineModuleInfo.cpp`	21-Aug-2018	`615`
`X86COFFMachineModuleInfo.h`	21-Aug-2018	`1.4K`
`X86CommonTableGen.vcxproj`	21-Aug-2018	`113.8K`
`X86CommonTableGen.vcxproj.filters`	21-Aug-2018	`2.8K`
`X86CompilationCallback_Win64.asm`	21-Aug-2018	`1.6K`
`X86ELFWriterInfo.cpp`	21-Aug-2018	`4.2K`
`X86ELFWriterInfo.h`	21-Aug-2018	`2.2K`
`X86FastISel.cpp`	21-Aug-2018	`72.6K`
`X86FloatingPoint.cpp`	21-Aug-2018	`65K`
`X86FrameLowering.cpp`	21-Aug-2018	`52.3K`
`X86FrameLowering.h`	21-Aug-2018	`2.4K`
`X86GenAsmMatcher.inc`	21-Aug-2018	`380.5K`
`X86GenAsmWriter.inc`	21-Aug-2018	`274.7K`
`X86GenAsmWriter1.inc`	21-Aug-2018	`293.1K`
`X86GenCallingConv.inc`	21-Aug-2018	`33.7K`
`X86GenDAGISel.inc`	21-Aug-2018	`3M`
`X86GenDisassemblerTables.inc`	21-Aug-2018	`7.4M`
`X86GenEDInfo.inc`	21-Aug-2018	`2.7M`
`X86GenFastISel.inc`	21-Aug-2018	`194.9K`
`X86GenInstrInfo.inc`	21-Aug-2018	`674.7K`
`X86GenRegisterInfo.inc`	21-Aug-2018	`253.3K`
`X86GenSubtargetInfo.inc`	21-Aug-2018	`12.8K`
`X86Instr3DNow.td`	21-Aug-2018	`4.3K`
`X86InstrArithmetic.td`	21-Aug-2018	`55.8K`
`X86InstrBuilder.h`	21-Aug-2018	`6.7K`
`X86InstrCMovSetCC.td`	21-Aug-2018	`4.9K`
`X86InstrCompiler.td`	21-Aug-2018	`77.7K`
`X86InstrControl.td`	21-Aug-2018	`13.9K`
`X86InstrExtension.td`	21-Aug-2018	`7.8K`
`X86InstrFMA.td`	21-Aug-2018	`2.8K`
`X86InstrFormats.td`	21-Aug-2018	`20.7K`
`X86InstrFPStack.td`	21-Aug-2018	`33.3K`
`X86InstrFragmentsSIMD.td`	21-Aug-2018	`21.1K`
`X86InstrInfo.cpp`	21-Aug-2018	`140.2K`
`X86InstrInfo.h`	21-Aug-2018	`16.7K`
`X86InstrInfo.td`	21-Aug-2018	`84.4K`
`X86InstrMMX.td`	21-Aug-2018	`22.8K`
`X86InstrShiftRotate.td`	21-Aug-2018	`37.1K`
`X86InstrSSE.td`	21-Aug-2018	`339K`
`X86InstrSystem.td`	21-Aug-2018	`21.5K`
`X86InstrVMX.td`	21-Aug-2018	`2.8K`
`X86ISelDAGToDAG.cpp`	21-Aug-2018	`81.4K`
`X86ISelLowering.cpp`	21-Aug-2018	`573.1K`
`X86ISelLowering.h`	21-Aug-2018	`42K`
`X86JITInfo.cpp`	21-Aug-2018	`18.5K`
`X86JITInfo.h`	21-Aug-2018	`3K`
`X86MachineFunctionInfo.h`	21-Aug-2018	`5.3K`
`X86MCInstLower.cpp`	21-Aug-2018	`28K`
`X86MCInstLower.h`	21-Aug-2018	`1.3K`
`X86RegisterInfo.cpp`	21-Aug-2018	`30.7K`
`X86RegisterInfo.h`	21-Aug-2018	`4.7K`
`X86RegisterInfo.td`	21-Aug-2018	`20.2K`
`X86Relocations.h`	21-Aug-2018	`2K`
`X86SelectionDAGInfo.cpp`	21-Aug-2018	`9.8K`
`X86SelectionDAGInfo.h`	21-Aug-2018	`1.9K`
`X86Subtarget.cpp`	21-Aug-2018	`12.1K`
`X86Subtarget.h`	21-Aug-2018	`10.1K`
`X86TargetMachine.cpp`	21-Aug-2018	`5.7K`
`X86TargetMachine.h`	21-Aug-2018	`4.4K`
`X86TargetObjectFile.cpp`	21-Aug-2018	`1.7K`
`X86TargetObjectFile.h`	21-Aug-2018	`1.3K`
`X86VZeroUpper.cpp`	21-Aug-2018	`3.3K`