INDEX
Explanations
words related to being free or without something (e.g., "free", "less")
terms related to freedom or absence of restrictions
New Auto-Interp
Negative Logits
skilled
-0.75
INC
-0.74
foreseen
-0.74
NEWS
-0.70
ional
-0.65
part
-0.64
along
-0.63
achu
-0.63
shows
-0.62
skill
-0.61
POSITIVE LOGITS
zing
1.10
zers
0.95
ze
0.91
lihood
0.90
zes
0.87
zer
0.84
zones
0.78
mentality
0.77
zone
0.76
zhen
0.76
Activations Density 0.099%