INDEX
Explanations
names of a person or brand "Strong"
references to a specific individual associated with a significant event
New Auto-Interp
Negative Logits
externalToEVAOnly
-0.83
mits
-0.74
MIA
-0.72
ktop
-0.71
UTE
-0.67
RECT
-0.66
opus
-0.63
McCann
-0.63
oreal
-0.62
çĦ
-0.62
POSITIVE LOGITS
strong
1.16
enger
1.01
itudinal
0.92
est
0.92
nesses
0.88
er
0.85
fast
0.82
Weak
0.80
itude
0.79
GoldMagikarp
0.76
Activations Density 0.008%