INDEX
Explanations
phrases indicating what to expect or look for in a given context
phrases and expressions related to anticipation or expectations
New Auto-Interp
Negative Logits
trak
-0.68
mone
-0.65
inactive
-0.63
moot
-0.62
Fever
-0.62
DOT
-0.61
jury
-0.60
rid
-0.59
tur
-0.56
imilar
-0.56
POSITIVE LOGITS
ãĤ´
0.80
ACA
0.76
beforehand
0.73
ETS
0.72
versus
0.72
entails
0.72
?:
0.71
çIJ
0.71
depends
0.68
?,
0.67
Activations Density 0.255%