INDEX
Explanations
general information or details from news articles
occurrences of the word "More."
New Auto-Interp
Negative Logits
keeping
-0.75
2024
-0.70
liest
-0.68
Fram
-0.67
atan
-0.67
RL
-0.65
Enlarge
-0.65
itudes
-0.65
RN
-0.64
same
-0.62
POSITIVE LOGITS
ado
1.16
Than
1.10
than
1.04
importantly
1.03
mature
0.79
than
0.76
important
0.75
sophisticated
0.75
extensive
0.74
stringent
0.74
Activations Density 0.035%