INDEX
Explanations
terminology related to academic research and analysis methods
preceding "is" or "are"
academic and research papers
the beginning of documents or major text sections.
New Auto-Interp
Negative Logits
(
-0.58
("-0.54
—
-0.52
('-0.52
(“
-0.46
(‘
-0.45
—
-0.45
vanguardia
-0.43
(`
-0.42
znamen
-0.42
POSITIVE LOGITS
itſelf
1.16
itself
1.01
myſelf
1.00
himſelf
0.87
itself
0.85
ⓧ
0.84
purpoſe
0.81
KURZBESCHREIBUNG
0.80
poffe
0.79
Majefty
0.79
Activations Density 0.525%