INDEX
Explanations
references to political entities and conflicts
proper nouns related to countries, political entities, and organizations
New Auto-Interp
Negative Logits
partName
-0.65
âĢº
-0.64
Sym
-0.64
ãĥ£
-0.61
\":
-0.59
<<
-0.58
pmwiki
-0.54
rencies
-0.53
aj
-0.52
Morty
-0.51
POSITIVE LOGITS
embassy
0.64
artney
0.60
ledged
0.59
erent
0.57
usalem
0.57
himself
0.57
agine
0.56
abroad
0.56
ensis
0.56
inges
0.54
Activations Density 0.733%