INDEX
Explanations
phrases emphasizing possession or existence in relation to people, places, or events
New Auto-Interp
Negative Logits
asl
-0.19
odash
-0.14
Fant
-0.14
anno
-0.13
icia
-0.13
734
-0.13
ius
-0.13
rouw
-0.13
began
-0.13
ãĥ³ãĥĢ
-0.13
POSITIVE LOGITS
long
0.21
lain
0.20
weather
0.19
stood
0.19
served
0.19
lain
0.19
hosted
0.18
existed
0.17
held
0.17
fascinated
0.17
Activations Density 0.085%