INDEX
Explanations
expressions of belief and trust
New Auto-Interp
Negative Logits
EndContext
-0.53
Reminds
-0.47
enterOuterAlt
-0.44
quec
-0.43
relative
-0.43
catel
-0.43
Relative
-0.43
cella
-0.42
relative
-0.42
Rit
-0.41
POSITIVE LOGITS
wholeheartedly
0.57
implicitly
0.56
strongly
0.55
fuertemente
0.51
firmly
0.51
firme
0.50
Strongly
0.48
Strongly
0.45
뀜
0.44
featureID
0.43
Activations Density 0.185%