INDEX
Explanations
prepositions or prepositional phrases related to relationships between objects or actions
prepositions and phrases related to locations or conditions within contexts
New Auto-Interp
Negative Logits
çͰ
-0.94
NES
-0.83
é¾įåĸļ士
-0.81
ãĤ«
-0.76
Ô
-0.75
ENTS
-0.74
iHUD
-0.73
IFE
-0.73
BUG
-0.72
00007
-0.72
POSITIVE LOGITS
various
0.95
different
0.90
specific
0.82
selectively
0.80
varying
0.75
afar
0.73
themselves
0.70
outed
0.70
other
0.69
either
0.68
Activations Density 0.504%