INDEX
Explanations
references to game components and services
New Auto-Interp
Negative Logits
pre
-0.64
ho
-0.62
bo
-0.60
-0.56
ung
-0.55
front
-0.54
ré
-0.53
ser
-0.52
critical
-0.52
r
-0.52
POSITIVE LOGITS
feroit
1.04
auroit
1.00
myſelf
0.99
,:);
0.95
ainfi
0.93
iconque
0.92
mxArray
0.90
Roskov
0.89
avoient
0.89
particuliers
0.87
Activations Density 0.036%