INDEX
Explanations
references to physical objects used for drinking, such as mugs
references to physical objects like mugs and jugs, particularly in a contextual or metaphorical sense
New Auto-Interp
Negative Logits
×Ļ×
-0.75
cision
-0.67
edient
-0.66
hovah
-0.65
IGH
-0.63
secondary
-0.63
ipher
-0.63
Virgin
-0.63
judicial
-0.62
thodox
-0.62
POSITIVE LOGITS
mug
1.32
gers
1.00
shots
0.93
ging
0.90
atures
0.90
Mug
0.88
shot
0.84
ger
0.83
ged
0.80
ograph
0.79
Activations Density 0.007%