INDEX
Explanations
studies or reports revealing statistical findings or survey results
phrases indicating research findings or survey results
New Auto-Interp
Negative Logits
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.75
ãĤ§
-0.65
via
-0.64
ï¸
-0.63
ä½ľ
-0.62
vard
-0.61
ãĤ¶
-0.61
argon
-0.60
ãĥ¤
-0.60
gur
-0.60
POSITIVE LOGITS
Orb
0.64
discrepancies
0.63
anomalies
0.62
evidence
0.61
¿½
0.61
irregularities
0.59
citing
0.57
findings
0.56
agi
0.56
that
0.56
Activations Density 0.220%