INDEX
Explanations
times, dates, and proper nouns
timestamps and numerical data within the text
New Auto-Interp
Negative Logits
emale
-0.67
umblr
-0.66
minecraft
-0.63
Pref
-0.61
urn
-0.61
ONT
-0.60
ogn
-0.60
deck
-0.59
uliffe
-0.59
rul
-0.58
POSITIVE LOGITS
partName
0.83
isode
0.74
":""},{"0.72
UPDATE
0.72
·
0.69
IPM
0.67
³³³
0.65
REPORT
0.65
Countdown
0.64
CONTIN
0.63
Activations Density 0.209%