INDEX
Explanations
terms indicating close observation or monitoring
references to careful observation or scrutiny
New Auto-Interp
Negative Logits
ICAN
-0.83
Bucket
-0.72
Chaser
-0.72
ule
-0.66
Lightning
-0.66
Memories
-0.65
æĹ
-0.65
Mania
-0.65
Bronze
-0.64
IRO
-0.64
POSITIVE LOGITS
aligned
0.95
resembles
0.90
closely
0.86
scrutin
0.83
resemble
0.82
enough
0.82
wired
0.81
minded
0.80
resembled
0.80
correlated
0.79
Activations Density 0.009%