INDEX
Explanations
verbs in the past tense
references to actions related to performance or achievement
New Auto-Interp
Negative Logits
Miko
-0.76
Koch
-0.66
Ducks
-0.65
Kot
-0.64
Pacific
-0.64
Nanto
-0.60
Stars
-0.60
karma
-0.59
Warm
-0.59
PAC
-0.58
POSITIVE LOGITS
©¶æ¥µ
0.80
*.
0.73
icz
0.71
().
0.70
NetMessage
0.69
accustomed
0.68
tical
0.68
theirs
0.66
attest
0.64
entimes
0.64
Activations Density 0.212%