INDEX
Explanations
mentions of actions or activities related to playing
New Auto-Interp
Negative Logits
Beir
-0.75
fty
-0.66
thening
-0.66
imb
-0.66
inki
-0.64
ortium
-0.62
pora
-0.61
ageddon
-0.59
mediately
-0.57
identally
-0.56
POSITIVE LOGITS
wright
1.19
ername
1.13
lists
0.91
GROUND
0.88
testing
0.87
testers
0.85
Piano
0.84
plays
0.83
wr
0.81
Solitaire
0.79
Activations Density 0.476%