INDEX
Explanations
phrases indicating contrast or alternatives
New Auto-Interp
Negative Logits
Portail
-0.85
"));
-0.76
'))
-0.68
GAO
-0.67
firebaseConfig
-0.66
}`
-0.65
-0.65
ögon
-0.64
"));
-0.63
visor
-0.63
POSITIVE LOGITS
Instead
1.12
Instead
1.05
instead
1.01
instead
0.92
Rather
0.86
rather
0.82
Rather
0.78
uttosto
0.76
Statt
0.72
statt
0.68
Activations Density 0.148%