INDEX
Explanations
words related to openness or lack of restriction
references to "open-ended" concepts or themes
New Auto-Interp
Negative Logits
Tsarnaev
-0.74
Tycoon
-0.74
Brach
-0.71
Mons
-0.69
UD
-0.68
Rouge
-0.66
Ń·
-0.66
Lumpur
-0.65
Khan
-0.65
Lowry
-0.64
POSITIVE LOGITS
ended
1.24
minded
1.22
source
1.22
eyed
1.13
facing
1.09
bodied
1.07
office
1.02
air
1.02
access
1.01
skinned
1.01
Activations Density 0.026%