INDEX
Explanations
proper nouns
proper nouns, particularly names of individuals
New Auto-Interp
Negative Logits
è¦
-0.61
æĺ¯
-0.55
Entered
-0.54
plateau
-0.54
¥µ
-0.54
EPA
-0.53
``(
-0.53
âĶľâĶĢâĶĢ
-0.53
":"","
-0.53
states
-0.53
POSITIVE LOGITS
alive
0.89
overboard
0.82
's
0.81
onto
0.79
accountable
0.77
ineligible
0.77
hostage
0.75
ieri
0.73
away
0.71
into
0.71
Activations Density 0.386%