INDEX
Explanations
descriptions of features and characteristics of various items or products
New Auto-Interp
Negative Logits
Transcript
-0.80
Facts
-0.71
Written
-0.70
1947
-0.70
documents
-0.67
Names
-0.67
Announce
-0.67
Aad
-0.65
Newsp
-0.65
Politics
-0.64
POSITIVE LOGITS
manageable
0.89
comfortably
0.88
noticeably
0.88
heavier
0.87
finer
0.86
overpowered
0.85
smoother
0.85
bottleneck
0.85
overpower
0.83
cramped
0.81
Activations Density 0.960%