INDEX
Explanations
phrases related to specific conditions or restrictions
New Auto-Interp
Negative Logits
gridx
-0.46
adicionais
-0.45
referenties
-0.44
Empty
-0.44
ետ
-0.43
quedado
-0.43
地方
-0.43
topper
-0.43
vuoto
-0.42
parte
-0.42
POSITIVE LOGITS
confines
1.30
bounds
1.02
framework
0.99
boundaries
0.92
scope
0.91
limits
0.89
purview
0.85
walls
0.81
phạm
0.80
Within
0.77
Activations Density 0.177%