INDEX
Explanations
references to individuals and their affiliations or actions in a narrative context
New Auto-Interp
Negative Logits
ibar
-0.18
abar
-0.17
acci
-0.17
£
-0.16
ãģłãģ£ãģ¦
-0.16
jerne
-0.16
pell
-0.15
rella
-0.15
allery
-0.15
Nim
-0.15
POSITIVE LOGITS
Pending
0.16
Pending
0.14
338
0.14
eam
0.14
Photograph
0.14
ıcı
0.14
iglia
0.13
ÙĪÙĩ
0.13
prop
0.13
âĸ¼
0.13
Activations Density 0.069%