INDEX
Explanations
years represented in a specific format
years or numerical references in the context of historical content
New Auto-Interp
Negative Logits
aminer
-0.78
backfield
-0.76
millenn
-0.73
nomine
-0.70
stuffed
-0.68
igent
-0.67
cram
-0.67
istor
-0.66
shaved
-0.66
whipped
-0.65
POSITIVE LOGITS
âĸĪâĸĪ
1.11
49
1.10
19
1.10
61
1.08
05
1.01
08
1.00
39
1.00
04
0.99
37
0.99
58
0.98
Activations Density 0.016%