MyCaffe  1.12.2.41
Deep learning software for Windows C# programmers.
MyCaffe.param.gpt.CausalSelfAttentionParameter Class Reference

Specifies the parameters for the CausalSelfAttentionLayer. More...

Inheritance diagram for MyCaffe.param.gpt.CausalSelfAttentionParameter:
MyCaffe.param.LayerParameterBase MyCaffe.basecode.BaseParameter MyCaffe.basecode.IBinaryPersist

Public Member Functions

 CausalSelfAttentionParameter ()
 Constructor for the parameter. More...
 
override object Load (System.IO.BinaryReader br, bool bNewInstance=true)
 Load the parameter from a binary reader. More...
 
override void Copy (LayerParameterBase src)
 Copy on parameter to another. More...
 
override LayerParameterBase Clone ()
 Creates a new copy of this instance of the parameter. More...
 
override RawProto ToProto (string strName)
 Convert the parameter into a RawProto. More...
 
- Public Member Functions inherited from MyCaffe.param.LayerParameterBase
 LayerParameterBase ()
 Constructor for the parameter. More...
 
virtual string PrepareRunModelInputs ()
 This method gives derivative classes a chance specify model inputs required by the run model. More...
 
virtual void PrepareRunModel (LayerParameter p)
 This method gives derivative classes a chance to prepare the layer for a run-model. More...
 
void Save (BinaryWriter bw)
 Save this parameter to a binary writer. More...
 
abstract object Load (BinaryReader br, bool bNewInstance=true)
 Load the parameter from a binary reader. More...
 
- Public Member Functions inherited from MyCaffe.basecode.BaseParameter
 BaseParameter ()
 Constructor for the parameter. More...
 
virtual bool Compare (BaseParameter p)
 Compare this parameter to another parameter. More...
 

Static Public Member Functions

static CausalSelfAttentionParameter FromProto (RawProto rp)
 Parses the parameter from a RawProto. More...
 
- Static Public Member Functions inherited from MyCaffe.basecode.BaseParameter
static double ParseDouble (string strVal)
 Parse double values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More...
 
static bool TryParse (string strVal, out double df)
 Parse double values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More...
 
static float ParseFloat (string strVal)
 Parse float values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More...
 
static bool TryParse (string strVal, out float f)
 Parse doufloatble values using the US culture if the decimal separator = '.', then using the native culture, and if then lastly trying the US culture to handle prototypes containing '.' as the separator, yet parsed in a culture that does not use '.' as a decimal. More...
 

Properties

uint layers [getset]
 The number of layers (transformer blocks) used. More...
 
uint heads [getset]
 The number of heads used. More...
 
uint embed [getset]
 Specifies size of the embed. More...
 
uint block_size [getset]
 Specifies size of the block. More...
 
double attn_dropout [getset]
 Specifies dropout probability used on the attention weights. More...
 
double resid_dropout [getset]
 Specifies dropout probability used on the residual weights. More...
 

Additional Inherited Members

- Public Types inherited from MyCaffe.param.LayerParameterBase
enum  LABEL_TYPE { NONE , SINGLE , MULTIPLE , ONLY_ONE }
 Defines the label type. More...
 

Detailed Description

Specifies the parameters for the CausalSelfAttentionLayer.

Definition at line 15 of file CausalSelfAttentionParameter.cs.

Constructor & Destructor Documentation

◆ CausalSelfAttentionParameter()

MyCaffe.param.gpt.CausalSelfAttentionParameter.CausalSelfAttentionParameter ( )

Constructor for the parameter.

Definition at line 25 of file CausalSelfAttentionParameter.cs.

Member Function Documentation

◆ Clone()

override LayerParameterBase MyCaffe.param.gpt.CausalSelfAttentionParameter.Clone ( )
virtual

Creates a new copy of this instance of the parameter.

Returns
A new instance of this parameter is returned.

Implements MyCaffe.param.LayerParameterBase.

Definition at line 111 of file CausalSelfAttentionParameter.cs.

◆ Copy()

override void MyCaffe.param.gpt.CausalSelfAttentionParameter.Copy ( LayerParameterBase  src)
virtual

Copy on parameter to another.

Parameters
srcSpecifies the parameter to copy.

Implements MyCaffe.param.LayerParameterBase.

Definition at line 98 of file CausalSelfAttentionParameter.cs.

◆ FromProto()

static CausalSelfAttentionParameter MyCaffe.param.gpt.CausalSelfAttentionParameter.FromProto ( RawProto  rp)
static

Parses the parameter from a RawProto.

Parameters
rpSpecifies the RawProto to parse.
Returns
A new instance of the parameter is returned.

Definition at line 142 of file CausalSelfAttentionParameter.cs.

◆ Load()

override object MyCaffe.param.gpt.CausalSelfAttentionParameter.Load ( System.IO.BinaryReader  br,
bool  bNewInstance = true 
)

Load the parameter from a binary reader.

Parameters
brSpecifies the binary reader.
bNewInstanceWhen true a new instance is created (the default), otherwise the existing instance is loaded from the binary reader.
Returns
Returns an instance of the parameter.

Definition at line 86 of file CausalSelfAttentionParameter.cs.

◆ ToProto()

override RawProto MyCaffe.param.gpt.CausalSelfAttentionParameter.ToProto ( string  strName)
virtual

Convert the parameter into a RawProto.

Parameters
strNameSpecifies the name to associate with the RawProto.
Returns
The new RawProto is returned.

Implements MyCaffe.basecode.BaseParameter.

Definition at line 123 of file CausalSelfAttentionParameter.cs.

Property Documentation

◆ attn_dropout

double MyCaffe.param.gpt.CausalSelfAttentionParameter.attn_dropout
getset

Specifies dropout probability used on the attention weights.

Definition at line 70 of file CausalSelfAttentionParameter.cs.

◆ block_size

uint MyCaffe.param.gpt.CausalSelfAttentionParameter.block_size
getset

Specifies size of the block.

Definition at line 61 of file CausalSelfAttentionParameter.cs.

◆ embed

uint MyCaffe.param.gpt.CausalSelfAttentionParameter.embed
getset

Specifies size of the embed.

Definition at line 52 of file CausalSelfAttentionParameter.cs.

◆ heads

uint MyCaffe.param.gpt.CausalSelfAttentionParameter.heads
getset

The number of heads used.

Definition at line 43 of file CausalSelfAttentionParameter.cs.

◆ layers

uint MyCaffe.param.gpt.CausalSelfAttentionParameter.layers
getset

The number of layers (transformer blocks) used.

Definition at line 33 of file CausalSelfAttentionParameter.cs.

◆ resid_dropout

double MyCaffe.param.gpt.CausalSelfAttentionParameter.resid_dropout
getset

Specifies dropout probability used on the residual weights.

Definition at line 79 of file CausalSelfAttentionParameter.cs.


The documentation for this class was generated from the following file: