![]() |
MyCaffe
1.12.2.41
Deep learning software for Windows C# programmers.
|
Specifies the parameters for the MultiHeadAttentionInterpLayer (Interpretable Multi-Head Attention Layer).
More...
Public Member Functions | |
| MultiHeadAttentionInterpParameter () | |
| Constructor for the parameter. More... | |
| override object | Load (System.IO.BinaryReader br, bool bNewInstance=true) |
| Load the parameter from a binary reader. More... | |
| override void | Copy (LayerParameterBase src) |
| Copy on parameter to another. More... | |
| override LayerParameterBase | Clone () |
| Creates a new copy of this instance of the parameter. More... | |
| override RawProto | ToProto (string strName) |
| Convert the parameter into a RawProto. More... | |
Public Member Functions inherited from MyCaffe.param.LayerParameterBase | |
| LayerParameterBase () | |
| Constructor for the parameter. More... | |
| virtual string | PrepareRunModelInputs () |
| This method gives derivative classes a chance specify model inputs required by the run model. More... | |
| virtual void | PrepareRunModel (LayerParameter p) |
| This method gives derivative classes a chance to prepare the layer for a run-model. More... | |
| void | Save (BinaryWriter bw) |
| Save this parameter to a binary writer. More... | |
| abstract object | Load (BinaryReader br, bool bNewInstance=true) |
| Load the parameter from a binary reader. More... | |
Public Member Functions inherited from MyCaffe.basecode.BaseParameter | |
| BaseParameter () | |
| Constructor for the parameter. More... | |
| virtual bool | Compare (BaseParameter p) |
| Compare this parameter to another parameter. More... | |
Static Public Member Functions | |
| static MultiHeadAttentionInterpParameter | FromProto (RawProto rp) |
| Parses the parameter from a RawProto. More... | |
Static Public Member Functions inherited from MyCaffe.basecode.BaseParameter | |
| static double | ParseDouble (string strVal) |
| Parse double values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More... | |
| static bool | TryParse (string strVal, out double df) |
| Parse double values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More... | |
| static float | ParseFloat (string strVal) |
| Parse float values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More... | |
| static bool | TryParse (string strVal, out float f) |
| Parse doufloatble values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More... | |
Properties | |
| bool | enable_self_attention [getset] |
| Specifies to enable self attention (one input, default = true). More... | |
| uint | num_historical_steps [getset] |
| Specifies the number of historical steps More... | |
| uint | num_future_steps [getset] |
| Specifies the number of future steps More... | |
| uint | embed_dim [getset] |
| Specifies the state size corresponding to both the input and output sizes. More... | |
| uint | num_heads [getset] |
| Specifies number of attention heads used in the multi-attention. More... | |
| bool | enable_noise [getset] |
| Enable/disable noise in the inner-product layer (default = false). More... | |
| double | sigma_init [getset] |
| Specifies the initialization value for the sigma weight and sigma bias used when 'enable_noise' = true. More... | |
| FillerParameter | weight_filler [getset] |
| The filler for the weights. More... | |
| FillerParameter | bias_filler [getset] |
| The filler for the bias. More... | |
Additional Inherited Members | |
Public Types inherited from MyCaffe.param.LayerParameterBase | |
| enum | LABEL_TYPE { NONE , SINGLE , MULTIPLE , ONLY_ONE } |
| Defines the label type. More... | |
Specifies the parameters for the MultiHeadAttentionInterpLayer (Interpretable Multi-Head Attention Layer).
The Multi-Headed Attention layer learns long-term relationships across different time-steps. This version of the layer is modified to enhance explainability. On this modification, the 'values' signal is shared across all heads - the additive aggregation is employed across all heads. According to the paper by Lim et al., each head can learn different temporal patterns, while attending to a common set of input features which can be interpreted as a simple ensemble over attention weights into a combined matrix, which compared to the original multi-head attention matrix, yields an increased representation capacity in an efficient way.
Definition at line 27 of file MultiHeadAttentionInterpParameter.cs.
| MyCaffe.param.tft.MultiHeadAttentionInterpParameter.MultiHeadAttentionInterpParameter | ( | ) |
Constructor for the parameter.
Definition at line 40 of file MultiHeadAttentionInterpParameter.cs.
|
virtual |
Creates a new copy of this instance of the parameter.
Implements MyCaffe.param.LayerParameterBase.
Definition at line 174 of file MultiHeadAttentionInterpParameter.cs.
|
virtual |
Copy on parameter to another.
| src | Specifies the parameter to copy. |
Implements MyCaffe.param.LayerParameterBase.
Definition at line 152 of file MultiHeadAttentionInterpParameter.cs.
|
static |
Parses the parameter from a RawProto.
| rp | Specifies the RawProto to parse. |
Definition at line 217 of file MultiHeadAttentionInterpParameter.cs.
| override object MyCaffe.param.tft.MultiHeadAttentionInterpParameter.Load | ( | System.IO.BinaryReader | br, |
| bool | bNewInstance = true |
||
| ) |
Load the parameter from a binary reader.
| br | Specifies the binary reader. |
| bNewInstance | When true a new instance is created (the default), otherwise the existing instance is loaded from the binary reader. |
Definition at line 140 of file MultiHeadAttentionInterpParameter.cs.
|
virtual |
Convert the parameter into a RawProto.
| strName | Specifies the name to associate with the RawProto. |
Implements MyCaffe.basecode.BaseParameter.
Definition at line 186 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
The filler for the bias.
Definition at line 133 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies the state size corresponding to both the input and output sizes.
Definition at line 78 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Enable/disable noise in the inner-product layer (default = false).
When enabled, noise is only used during the training phase.
Definition at line 101 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies to enable self attention (one input, default = true).
Definition at line 48 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies the number of future steps
Definition at line 68 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies number of attention heads used in the multi-attention.
Definition at line 88 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies the number of historical steps
Definition at line 58 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
Specifies the initialization value for the sigma weight and sigma bias used when 'enable_noise' = true.
Definition at line 111 of file MultiHeadAttentionInterpParameter.cs.
|
getset |
The filler for the weights.
Definition at line 122 of file MultiHeadAttentionInterpParameter.cs.