INDEX
Explanations
words related to bodily sensations or states
words and phrases associated with physical sensations and emotional states
New Auto-Interp
Negative Logits
Protector
-0.78
Bahrain
-0.74
Reply
-0.70
Owners
-0.69
Legions
-0.68
Republic
-0.68
Accountability
-0.65
Reference
-0.64
Reconstruction
-0.64
Bain
-0.64
POSITIVE LOGITS
swe
1.08
ety
1.07
eper
0.92
sweater
0.89
orthy
0.87
pee
0.84
itte
0.83
rette
0.83
cer
0.82
daq
0.80
Activations Density 0.007%