INDEX
Explanations
phrases related to strong actions or impactful events
phrases related to social and moral issues
New Auto-Interp
Negative Logits
Oaks
-0.63
Gideon
-0.63
200000
-0.61
Robbie
-0.60
iage
-0.58
LH
-0.58
ilan
-0.57
ertility
-0.56
Kro
-0.56
Liberty
-0.55
POSITIVE LOGITS
»
2.00
âĢ
1.84
''
1.79
âĢ
1.68
ãĢį
1.67
ãĢ
1.61
''.
1.55
[/
1.53
¨
1.47
</
1.46
Activations Density 0.688%