INDEX
Explanations
references to strength and resilience
strength and courage
New Auto-Interp
Negative Logits
UrlResolution
-0.44
bağlantılar
-0.35
sociedade
-0.34
География
-0.34
leuke
-0.33
anticipo
-0.33
iodía
-0.33
AllowAnonymous
-0.32
gebirge
-0.32
Coef
-0.31
POSITIVE LOGITS
strength
0.96
Strength
0.88
Strength
0.83
strength
0.82
STRENGTH
0.75
strong
0.69
strong
0.68
Strong
0.66
courage
0.65
STRONG
0.64
Activations Density 0.014%