INDEX
Explanations
phrases associated with disagreement or differing opinions
New Auto-Interp
Negative Logits
Fla
-0.64
APTER
-0.62
Amph
-0.58
understandably
-0.56
Tent
-0.55
Raqqa
-0.54
Fedora
-0.53
Ramos
-0.53
Cage
-0.53
Blade
-0.53
POSITIVE LOGITS
abouts
1.18
depending
1.05
Else
1.00
versa
1.00
thereof
1.00
altogether
0.94
phans
0.90
outright
0.88
å§«
0.87
alternatively
0.85
Activations Density 1.951%