INDEX
Explanations
phrases related to different levels of achievement or measurement
references to varying levels of proficiency or performance
New Auto-Interp
Negative Logits
vous
-0.81
rontal
-0.74
ãĥĹ
-0.66
Terra
-0.65
Arg
-0.65
Phot
-0.64
Andromeda
-0.63
lez
-0.63
retina
-0.62
Wil
-0.62
POSITIVE LOGITS
level
1.03
level
0.99
levels
0.93
Level
0.90
levels
0.80
Level
0.77
afety
0.72
Levels
0.72
rise
0.72
ropri
0.69
Activations Density 0.030%