INDEX
Explanations
phrases related to being ranked or rated
instances of ranking or scoring systems
New Auto-Interp
Negative Logits
Tanz
-0.64
Materials
-0.58
endish
-0.57
ETHOD
-0.56
Disabled
-0.55
imar
-0.55
endon
-0.55
REM
-0.54
Howell
-0.53
IPS
-0.53
POSITIVE LOGITS
plates
0.99
crunch
0.92
one
0.91
one
0.89
Ones
0.89
thirteen
0.88
eleven
0.87
less
0.87
two
0.87
eight
0.86
Activations Density 0.053%