MyCaffe
1.12.2.41
Deep learning software for Windows C# programmers.
|
Specifies the parameters for the MultiHeadAttentionInterpLayer (Interpretable Multi-Head Attention Layer).
More...
Public Member Functions | |
MultiHeadAttentionInterpParameter () | |
Constructor for the parameter. More... | |
override object | Load (System.IO.BinaryReader br, bool bNewInstance=true) |
Load the parameter from a binary reader. More... | |
override void | Copy (LayerParameterBase src) |
Copy on parameter to another. More... | |
override LayerParameterBase | Clone () |
Creates a new copy of this instance of the parameter. More... | |
override RawProto | ToProto (string strName) |
Convert the parameter into a RawProto. More... | |
Public Member Functions inherited from MyCaffe.param.LayerParameterBase | |
LayerParameterBase () | |
Constructor for the parameter. More... | |
virtual string | PrepareRunModelInputs () |
This method gives derivative classes a chance specify model inputs required by the run model. More... | |
virtual void | PrepareRunModel (LayerParameter p) |
This method gives derivative classes a chance to prepare the layer for a run-model. More... | |
void | Save (BinaryWriter bw) |
Save this parameter to a binary writer. More... | |
abstract object | Load (BinaryReader br, bool bNewInstance=true) |
Load the parameter from a binary reader. More... | |
Public Member Functions inherited from MyCaffe.basecode.BaseParameter | |
BaseParameter () | |
Constructor for the parameter. More... | |
virtual bool | Compare (BaseParameter p) |
Compare this parameter to another parameter. More... | |
Static Public Member Functions | |
static MultiHeadAttentionInterpParameter | FromProto (RawProto rp) |
Parses the parameter from a RawProto. More... | |
Static Public Member Functions inherited from MyCaffe.basecode.BaseParameter | |
static double | ParseDouble (string strVal) |
Parse double values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More... | |
static bool | TryParse (string strVal, out double df) |
Parse double values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More... | |
static float | ParseFloat (string strVal) |
Parse float values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More... | |
static bool | TryParse (string strVal, out float f) |
Parse doufloatble values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More... | |
Properties | |
bool | enable_self_attention [getset] |
Specifies to enable self attention (one input, default = true). More... | |
uint | num_historical_steps [getset] |
Specifies the number of historical steps More... | |
uint | num_future_steps [getset] |
Specifies the number of future steps More... | |
uint | embed_dim [getset] |
Specifies the state size corresponding to both the input and output sizes. More... | |
uint | num_heads [getset] |
Specifies number of attention heads used in the multi-attention. More... | |
bool | enable_noise [getset] |
Enable/disable noise in the inner-product layer (default = false). More... | |
double | sigma_init [getset] |
Specifies the initialization value for the sigma weight and sigma bias used when 'enable_noise' = true. More... | |
FillerParameter | weight_filler [getset] |
The filler for the weights. More... | |
FillerParameter | bias_filler [getset] |
The filler for the bias. More... | |
Additional Inherited Members | |
Public Types inherited from MyCaffe.param.LayerParameterBase | |
enum | LABEL_TYPE { NONE , SINGLE , MULTIPLE , ONLY_ONE } |
Defines the label type. More... | |
Specifies the parameters for the MultiHeadAttentionInterpLayer (Interpretable Multi-Head Attention Layer).
The Multi-Headed Attention layer learns long-term relationships across different time-steps. This version of the layer is modified to enhance explainability. On this modification, the 'values' signal is shared across all heads - the additive aggregation is employed across all heads. According to the paper by Lim et al., each head can learn different temporal patterns, while attending to a common set of input features which can be interpreted as a simple ensemble over attention weights into a combined matrix, which compared to the original multi-head attention matrix, yields an increased representation capacity in an efficient way.
Definition at line 27 of file MultiHeadAttentionInterpParameter.cs.
MyCaffe.param.tft.MultiHeadAttentionInterpParameter.MultiHeadAttentionInterpParameter | ( | ) |
Constructor for the parameter.
Definition at line 40 of file MultiHeadAttentionInterpParameter.cs.
|
virtual |
Creates a new copy of this instance of the parameter.
Implements MyCaffe.param.LayerParameterBase.
Definition at line 174 of file MultiHeadAttentionInterpParameter.cs.
|
virtual |
Copy on parameter to another.
src | Specifies the parameter to copy. |
Implements MyCaffe.param.LayerParameterBase.
Definition at line 152 of file MultiHeadAttentionInterpParameter.cs.
|
static |
Parses the parameter from a RawProto.
rp | Specifies the RawProto to parse. |
Definition at line 217 of file MultiHeadAttentionInterpParameter.cs.
override object MyCaffe.param.tft.MultiHeadAttentionInterpParameter.Load | ( | System.IO.BinaryReader | br, |
bool | bNewInstance = true |
||
) |
Load the parameter from a binary reader.
br | Specifies the binary reader. |
bNewInstance | When true a new instance is created (the default), otherwise the existing instance is loaded from the binary reader. |
Definition at line 140 of file MultiHeadAttentionInterpParameter.cs.
|
virtual |
Convert the parameter into a RawProto.
strName | Specifies the name to associate with the RawProto. |
Implements MyCaffe.basecode.BaseParameter.
Definition at line 186 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
The filler for the bias.
Definition at line 133 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies the state size corresponding to both the input and output sizes.
Definition at line 78 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Enable/disable noise in the inner-product layer (default = false).
When enabled, noise is only used during the training phase.
Definition at line 101 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies to enable self attention (one input, default = true).
Definition at line 48 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies the number of future steps
Definition at line 68 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies number of attention heads used in the multi-attention.
Definition at line 88 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies the number of historical steps
Definition at line 58 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies the initialization value for the sigma weight and sigma bias used when 'enable_noise' = true.
Definition at line 111 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
The filler for the weights.
Definition at line 122 of file MultiHeadAttentionInterpParameter.cs.