INDEX
Explanations
adjectives describing intensity or importance
New Auto-Interp
Negative Logits
éŃĶ
-0.79
Recall
-0.78
ãĥİ
-0.78
Surely
-0.73
CLAIM
-0.72
âī¡
-0.71
ãĤ®
-0.69
UPDATE
-0.69
Flickr
-0.68
UPDATE
-0.68
POSITIVE LOGITS
[
0.85
holistic
0.83
mathemat
0.79
intangible
0.77
transitional
0.75
qualitative
0.75
punitive
0.74
layered
0.72
deterrent
0.72
gradual
0.71
Activations Density 0.303%