INDEX
Explanations
requests or commands for action
New Auto-Interp
Negative Logits
NCP
-0.52
PDS
-0.48
mountain
-0.45
GSC
-0.45
Mountain
-0.44
Component
-0.43
ICM
-0.43
Karak
-0.43
talon
-0.43
imedes
-0.42
POSITIVE LOGITS
please
0.92
Please
0.90
please
0.88
Please
0.86
PLEASE
0.77
Bitte
0.76
Bitte
0.75
PLEASE
0.73
bitte
0.67
Kindly
0.64
Activations Density 0.052%