INDEX
Explanations
phrases expressing timely intentions or future events
New Auto-Interp
Negative Logits
myſelf
-1.09
Theſe
-1.00
themſelves
-0.98
itſelf
-0.95
himſelf
-0.95
avoient
-0.92
Efq
-0.92
Sucesor
-0.92
Beſ
-0.91
auroit
-0.90
POSITIVE LOGITS
vision
0.65
Vision
0.65
Vision
0.63
0.63
vision
0.62
insights
0.58
points
0.56
الحياه
0.56
insights
0.55
Insights
0.53
Activations Density 0.147%