INDEX
Explanations
occurrences of the word "left" followed by a number
instances of the word "left" indicating abandonment or lack of support
New Auto-Interp
Negative Logits
andise
-0.69
alez
-0.69
%]
-0.68
insula
-0.65
CoC
-0.64
Temperature
-0.62
alist
-0.62
acity
-0.61
Wan
-0.61
uracy
-0.61
POSITIVE LOGITS
overs
1.09
undone
0.93
wing
0.90
untreated
0.86
ward
0.81
wich
0.81
fing
0.75
handed
0.75
Dise
0.73
handed
0.71
Activations Density 0.029%