INDEX
Explanations
phrases indicating to monitor or pay attention to something
references to surveillance or keeping track of something
New Auto-Interp
Negative Logits
̶
-0.86
©¶æ¥µ
-0.85
hement
-0.74
itle
-0.73
phabet
-0.71
ayers
-0.69
sadd
-0.69
eno
-0.68
gebra
-0.68
¢
-0.67
POSITIVE LOGITS
closely
1.03
developments
0.96
lest
0.90
suspicious
0.84
incoming
0.82
trends
0.81
fluctuations
0.72
whereabouts
0.71
periphery
0.71
carefully
0.70
Activations Density 0.178%