INDEX
Explanations
mentions of an action related to removing or kicking something or someone out
occurrences of the word "Out" and its variations
New Auto-Interp
Negative Logits
arsen
-0.87
avorite
-0.73
interstitial
-0.69
OTT
-0.65
ione
-0.63
vre
-0.63
=-=-=-=-=-=-=-=-
-0.61
turnover
-0.58
misunder
-0.57
Metallic
-0.56
POSITIVE LOGITS
doors
1.20
rage
1.08
dated
1.07
breaks
1.03
fitted
1.01
casts
1.00
landish
0.99
fits
0.98
come
0.97
raged
0.97
Activations Density 0.049%