INDEX
Explanations
numerical values indicating a specific amount or limit
mentions of the number "200" and related numerical representations or contexts
New Auto-Interp
Negative Logits
streak
-0.77
pronounced
-0.67
melanch
-0.67
interpretation
-0.66
Stre
-0.65
Laz
-0.64
commentary
-0.64
opinion
-0.62
Gle
-0.60
interpretations
-0.59
POSITIVE LOGITS
200
3.35
400
2.24
300
2.23
600
2.14
100
2.11
500
2.10
800
1.96
700
1.88
201
1.87
250
1.84
Activations Density 0.021%