INDEX
Explanations
phrases related to movement or physical proximity
the word "around."
New Auto-Interp
Negative Logits
BT
-0.68
qua
-0.68
tatt
-0.62
Wr
-0.61
idy
-0.58
BILL
-0.57
Shock
-0.57
Pl
-0.56
Architects
-0.56
603
-0.55
POSITIVE LOGITS
abouts
0.87
ciating
0.86
ouver
0.78
rocal
0.75
corners
0.74
apons
0.72
..........
0.72
ruciating
0.72
atform
0.70
ento
0.68
Activations Density 0.040%