INDEX
Explanations
words or phrases related to exclusion or being excluded
terms related to exclusion and being excluded from groups or activities
New Auto-Interp
Negative Logits
oÄŁ
-0.77
stead
-0.77
Hum
-0.74
nai
-0.70
des
-0.69
Found
-0.68
rious
-0.67
Herald
-0.67
DAY
-0.66
ache
-0.65
POSITIVE LOGITS
ively
0.92
Territories
0.69
egu
0.63
prejud
0.63
spoilers
0.62
Shinra
0.62
naire
0.61
ij士
0.61
uary
0.61
atively
0.60
Activations Density 0.020%