INDEX
Explanations
references to screens or display-related concepts
New Auto-Interp
Negative Logits
География
-0.57
reservations
-0.57
(@"
-0.57
Портал
-0.56
:])
-0.55
BoxFit
-0.53
Meksiku
-0.53
experimenta
-0.53
umgekehrt
-0.52
zaw
-0.52
POSITIVE LOGITS
screen
0.96
screen
0.94
Belief
0.85
screens
0.82
SCREEN
0.81
creen
0.79
Belief
0.79
belief
0.78
excuse
0.78
Screens
0.75
Activations Density 0.169%