INDEX
Explanations
proper nouns related to characters in movies or other fictional media
the word "in" and its frequent occurrences in various contexts
New Auto-Interp
Negative Logits
jri
-0.73
$$$$
-0.73
incial
-0.73
ratulations
-0.72
isan
-0.71
llor
-0.70
clust
-0.68
incumb
-0.66
landlords
-0.65
ollah
-0.65
POSITIVE LOGITS
verted
1.03
Episode
1.02
disguise
1.00
animate
0.99
lieu
0.98
Ghostbusters
0.96
Mortal
0.96
flashbacks
0.93
Fantastic
0.92
spite
0.88
Activations Density 0.168%