INDEX
Explanations
beverages, specifically highlighting tea and coffee
phrases that include references to beverages
New Auto-Interp
Negative Logits
abilities
-0.91
cms
-0.77
iets
-0.75
nw
-0.75
aucuses
-0.74
arily
-0.71
hra
-0.70
ahime
-0.68
tremend
-0.68
çīĪ
-0.67
POSITIVE LOGITS
glass
0.81
cloth
0.81
disinfect
0.78
whisky
0.77
freshly
0.75
corn
0.74
ice
0.73
Scotch
0.72
plastic
0.72
silver
0.72
Activations Density 0.143%