INDEX
Explanations
mentions of the word "note" or its variations
references to notable events or items
New Auto-Interp
Negative Logits
£ı
-0.85
Ͻ
-0.73
ruary
-0.71
Kate
-0.71
dimension
-0.69
ailability
-0.61
hips
-0.60
disapp
-0.60
antha
-0.59
dred
-0.59
POSITIVE LOGITS
OPLE
1.12
tsky
1.02
lete
0.99
ote
0.98
otes
0.95
chnology
0.95
chn
0.90
atro
0.82
opsis
0.80
ito
0.79
Activations Density 0.011%