INDEX
Explanations
descriptions of things as "strange"
occurrences of the word "strange" and its variations
New Auto-Interp
Negative Logits
cussion
-0.82
payers
-0.78
bern
-0.78
inders
-0.78
igate
-0.75
vation
-0.75
agles
-0.73
chen
-0.72
amaru
-0.72
apers
-0.71
POSITIVE LOGITS
occurrences
0.97
ness
0.87
coincidence
0.86
ly
0.86
worldly
0.83
twists
0.82
twist
0.79
assortment
0.76
aber
0.73
nesses
0.73
Activations Density 0.041%