INDEX
Explanations
phrases that imply a specific piece of information is being revealed or communicated
the word "indicate" and its variations, signifying evidence or suggestion
New Auto-Interp
Negative Logits
fare
-0.78
venge
-0.75
ÄŁ
-0.72
@#&
-0.71
ctors
-0.70
tal
-0.69
pping
-0.69
vas
-0.69
rit
-0.69
iling
-0.69
POSITIVE LOGITS
indications
1.04
indicates
0.90
indication
0.90
signs
0.86
indicated
0.85
hints
0.81
indicating
0.80
indicate
0.79
intervals
0.78
Signs
0.74
Activations Density 0.025%