INDEX
Explanations
words related to worn items or personal belongings
words related to negative physical effects or behaviors
New Auto-Interp
Negative Logits
Kills
-0.79
GMT
-0.72
Reborn
-0.72
Participant
-0.66
Solitaire
-0.66
MENTS
-0.65
Daylight
-0.64
MENT
-0.64
:]
-0.64
Abbey
-0.63
POSITIVE LOGITS
aring
0.88
asting
0.87
ipping
0.86
alog
0.86
orting
0.82
oldown
0.82
itored
0.81
istance
0.81
ivery
0.80
ility
0.80
Activations Density 0.113%