INDEX
Explanations
instances of being trapped or caught in various situations
caught or trapped
New Auto-Interp
Negative Logits
kasarigan
-0.57
nakalista
-0.53
complexContent
-0.51
nahilalakip
-0.50
Infórmanos
-0.49
surla
-0.49
للمعارف
-0.49
новниш
-0.49
SpringRunner
-0.47
$__
-0.47
POSITIVE LOGITS
stuck
0.56
trapped
0.56
Flucht
0.54
Lost
0.52
stuck
0.46
escaped
0.46
Escape
0.46
ESCAPE
0.46
escape
0.45
caught
0.45
Activations Density 0.014%