|
LLVM 23.0.0git
|
Merges shuffle masks and emits final shuffle instruction, if required. More...
Public Member Functions | |
| ShuffleCostEstimator (Type *ScalarTy, TargetTransformInfo &TTI, ArrayRef< Value * > VectorizedVals, BoUpSLP &R, SmallPtrSetImpl< Value * > &CheckedExtracts) | |
| Value * | adjustExtracts (const TreeEntry *E, MutableArrayRef< int > Mask, ArrayRef< std::optional< TTI::ShuffleKind > > ShuffleKinds, unsigned NumParts, bool &UseVecBaseAsInput) |
| std::optional< InstructionCost > | needToDelay (const TreeEntry *, ArrayRef< SmallVector< const TreeEntry * > >) const |
Checks if the specified entry E needs to be delayed because of its dependency nodes. | |
| void | resetForSameNode () |
| Reset the builder to handle perfect diamond match. | |
| void | add (const TreeEntry &E1, const TreeEntry &E2, ArrayRef< int > Mask) |
| void | add (const TreeEntry &E1, ArrayRef< int > Mask) |
| void | add (Value *V1, Value *V2, ArrayRef< int > Mask) |
| Adds 2 input vectors and the mask for their shuffling. | |
| void | add (Value *V1, ArrayRef< int > Mask, bool ForExtracts=false) |
| Adds another one input vector and the mask for the shuffling. | |
| Value * | gather (ArrayRef< Value * > VL, unsigned MaskVF=0, Value *Root=nullptr) |
| InstructionCost | createFreeze (InstructionCost Cost) |
| InstructionCost | finalize (ArrayRef< int > ExtMask, ArrayRef< std::pair< const TreeEntry *, unsigned > > SubVectors, ArrayRef< int > SubVectorsMask, unsigned VF=0, function_ref< void(Value *&, SmallVectorImpl< int > &, function_ref< Value *(Value *, Value *, ArrayRef< int >)>)> Action={}) |
| Finalize emission of the shuffles. | |
| ~ShuffleCostEstimator () | |
Merges shuffle masks and emits final shuffle instruction, if required.
It supports shuffling of 2 input vectors. It implements lazy shuffles emission, when the actual shuffle instruction is generated only if this is actually required. Otherwise, the shuffle instruction emission is delayed till the end of the process, to reduce the number of emitted instructions and further analysis/transformations.
Definition at line 15608 of file SLPVectorizer.cpp.
|
inline |
Definition at line 16155 of file SLPVectorizer.cpp.
References slpvectorizer::BoUpSLP::BoUpSLP().
|
inline |
Definition at line 16559 of file SLPVectorizer.cpp.
References assert().
|
inline |
Definition at line 16325 of file SLPVectorizer.cpp.
References assert(), llvm::find_if(), getNumberOfParts(), getPartNumElems(), getWidenedType(), llvm::not_equal_to(), and llvm::PoisonMaskElem.
|
inline |
Definition at line 16300 of file SLPVectorizer.cpp.
References add(), llvm::all_of(), assert(), llvm::find_if(), getNumberOfParts(), getPartNumElems(), getWidenedType(), llvm::not_equal_to(), and llvm::PoisonMaskElem.
Referenced by add().
|
inline |
Adds another one input vector and the mask for the shuffling.
Definition at line 16361 of file SLPVectorizer.cpp.
References llvm::all_of(), assert(), llvm::cast(), llvm::enumerate(), llvm::isa(), P, llvm::PoisonMaskElem, and llvm::V1.
|
inline |
Adds 2 input vectors and the mask for their shuffling.
Definition at line 16344 of file SLPVectorizer.cpp.
References llvm::all_of(), assert(), llvm::cast(), llvm::enumerate(), P, llvm::PoisonMaskElem, and llvm::V1.
|
inline |
Definition at line 16161 of file SLPVectorizer.cpp.
References llvm::any_of(), llvm::ArrayRef(), and reorderScalars().
|
inline |
Definition at line 16451 of file SLPVectorizer.cpp.
|
inline |
Finalize emission of the shuffles.
Definition at line 16453 of file SLPVectorizer.cpp.
|
inline |
Definition at line 16410 of file SLPVectorizer.cpp.
References assert(), llvm::ArrayRef< T >::begin(), llvm::cast(), llvm::dyn_cast(), llvm::ArrayRef< T >::end(), llvm::ArrayRef< T >::front(), llvm::ConstantVector::get(), llvm::PoisonValue::get(), llvm::UndefValue::get(), llvm::ElementCount::getFixed(), llvm::Constant::getNullValue(), llvm::Type::getScalarType(), llvm::ConstantVector::getSplat(), llvm::Value::getType(), llvm::isa(), llvm::SmallVectorTemplateBase< T, bool >::push_back(), replicateMask(), llvm::ArrayRef< T >::size(), SLPReVec, and llvm::ArrayRef< T >::take_front().
|
inline |
Checks if the specified entry E needs to be delayed because of its dependency nodes.
Definition at line 16286 of file SLPVectorizer.cpp.
|
inline |
Reset the builder to handle perfect diamond match.
Definition at line 16292 of file SLPVectorizer.cpp.