INDEX
Explanations
numerical data such as values and percentages
instances of the word "the" and their associating context in numerical data
New Auto-Interp
Negative Logits
igans
-0.74
oko
-0.68
someday
-0.63
endif
-0.62
ouch
-0.61
deserves
-0.61
deserve
-0.61
masc
-0.60
abilities
-0.60
ende
-0.60
POSITIVE LOGITS
entirety
1.14
entire
1.13
same
1.11
span
0.99
preceding
0.96
period
0.94
contiguous
0.92
shortest
0.91
total
0.90
lowest
0.90
Activations Density 0.265%