INDEX
Explanations
numeric values with decimal points
articles and the usage of the word "a" in various contexts
New Auto-Interp
Negative Logits
evidence
-0.66
Jagu
-0.62
Vaugh
-0.62
grounds
-0.57
eyes
-0.56
iments
-0.56
Eag
-0.56
ographs
-0.56
acebook
-0.56
Edit
-0.56
POSITIVE LOGITS
lot
1.00
bunch
0.90
couple
0.82
handful
0.78
few
0.76
huge
0.76
glimpse
0.75
uras
0.73
whopping
0.73
ird
0.72
Activations Density 0.558%