INDEX
Explanations
phrases related to giving advice or sharing instructions in casual or gaming contexts
New Auto-Interp
Negative Logits
irm
-0.74
rounder
-0.69
irmed
-0.62
chio
-0.61
thia
-0.61
tty
-0.61
olly
-0.60
ellow
-0.59
anus
-0.58
asts
-0.57
POSITIVE LOGITS
absurdity
0.80
lessness
0.78
brink
0.78
liest
0.77
where
0.76
extent
0.75
verge
0.73
ophys
0.71
exhaustion
0.70
points
0.70
Activations Density 0.028%