INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
ICLE
-0.67
HOUSE
-0.64
ittal
-0.60
TRUE
-0.59
disposed
-0.58
burnt
-0.58
awi
-0.58
aron
-0.58
motion
-0.58
furnished
-0.58
POSITIVE LOGITS
76561
0.78
Alz
0.70
achus
0.69
quote
0.68
Chev
0.67
chev
0.67
omething
0.67
python
0.66
akespe
0.66
Quote
0.65
Activations Density 0.000%
No Known Activations
This feature has no known activations.