INDEX
Explanations
dates expressed in a specific format (e.g., "16:9" or "16:10")
references to numerical values, particularly those involving the number 16
New Auto-Interp
Negative Logits
olicy
-0.96
tremend
-0.88
isode
-0.83
atre
-0.83
uppet
-0.82
enhagen
-0.78
etsk
-0.76
andise
-0.75
pherd
-0.75
rint
-0.75
POSITIVE LOGITS
384
1.29
6666
1.20
th
0.92
05
0.87
07
0.86
650
0.85
06
0.84
03
0.83
teen
0.83
09
0.82
Activations Density 0.038%