INDEX
Explanations
the word "Off" with the strongest activation levels
words related to the concept of "off" or "offering."
New Auto-Interp
Negative Logits
izabeth
-0.70
ridor
-0.64
growth
-0.63
dehydration
-0.62
skelet
-0.62
ascript
-0.61
federation
-0.61
omething
-0.60
enment
-0.60
ATHER
-0.60
POSITIVE LOGITS
Off
3.65
Off
2.44
OFF
2.42
off
1.88
OFF
1.80
off
1.56
offs
1.36
Offer
1.33
Away
1.27
Down
1.23
Activations Density 0.006%