INDEX
Explanations
references to specific names or locations
names and proper nouns
New Auto-Interp
Negative Logits
Canaver
-0.32
¶
-0.29
galitarian
-0.28
estern
-0.27
osate
-0.27
âĺħ
-0.27
Nib
-0.26
Examination
-0.25
Scientology
-0.25
Esper
-0.25
POSITIVE LOGITS
)."
0.44
'."
0.43
]."
0.42
.).
0.40
.'"
0.40
.")
0.39
anwhile
0.33
").
0.33
};
0.32
').
0.32
Activations Density 7.136%