INDEX
Explanations
mentions of specific ideas or concepts stated with emphasis or certainty
New Auto-Interp
Negative Logits
izont
-0.78
ãĥ³ãĤ¸
-0.76
izens
-0.75
greg
-0.74
sts
-0.69
ãĥ¼ãĥĨ
-0.69
ãĥ¯ãĥ³
-0.69
ulner
-0.69
imaru
-0.68
apolis
-0.67
POSITIVE LOGITS
happens
1.37
happened
1.21
occurs
1.15
translates
1.04
happen
1.00
proves
0.94
occurred
0.94
applies
0.91
coincides
0.90
settles
0.90
Activations Density 0.069%