INDEX
Explanations
proper nouns or names
references to specific dates and notable individuals or events
New Auto-Interp
Negative Logits
vanity
-0.68
notations
-0.68
Grimoire
-0.64
Bundesliga
-0.59
Blizzard
-0.57
Tuc
-0.57
Discord
-0.56
Brewer
-0.56
Witcher
-0.55
Inquisition
-0.54
POSITIVE LOGITS
..."
0.86
,...
0.83
UNCLASSIFIED
0.80
Replay
0.78
.''
0.78
.]
0.76
}}
0.76
'';
0.75
.</
0.71
>>\
0.70
Activations Density 0.906%