INDEX
Explanations
dates and times written in the format of numbers with periods between them
punctuation marks, specifically periods
New Auto-Interp
Negative Logits
sidel
-0.78
stre
-0.67
£ı
-0.65
shroud
-0.64
idence
-0.63
uracy
-0.63
comparisons
-0.62
deceptive
-0.62
trouble
-0.62
association
-0.61
POSITIVE LOGITS
000
1.04
05
0.99
07
0.97
06
0.96
09
0.95
5
0.94
500
0.94
04
0.94
08
0.91
00
0.91
Activations Density 0.122%