INDEX
Explanations
numbers or position-related words
the indefinite article "a" and its variants in various contexts
New Auto-Interp
Negative Logits
âĢİ
-0.81
Links
-0.77
letters
-0.76
forth
-0.72
reports
-0.72
evidence
-0.71
FontSize
-0.71
Sources
-0.71
Reports
-0.69
Contents
-0.66
POSITIVE LOGITS
rouse
1.10
bunch
1.06
lot
1.02
person
1.01
uras
1.00
particular
0.97
multitude
0.88
dime
0.87
stranger
0.87
single
0.86
Activations Density 0.681%