INDEX
Explanations
phrases related to comparison and observation, especially in different contexts like books, personal experiences, or conflicts
commas in the text
New Auto-Interp
Negative Logits
¬¼
-0.74
orse
-0.73
iculty
-0.66
endment
-0.66
ore
-0.65
ft
-0.64
orn
-0.64
prise
-0.63
=#
-0.61
Reply
-0.61
POSITIVE LOGITS
namely
1.31
viz
1.00
including
0.98
plus
0.84
albeit
0.81
totaling
0.78
excluding
0.77
respectively
0.77
excluding
0.76
notably
0.73
Activations Density 0.442%