INDEX
Explanations
labels or titles within a structured document or webpage
New Auto-Interp
Negative Logits
Tennis
-0.72
Fusion
-0.69
tennis
-0.68
clan
-0.68
Crimson
-0.67
jet
-0.66
Cyber
-0.65
jets
-0.64
cra
-0.64
upgrade
-0.62
POSITIVE LOGITS
Label
3.08
label
2.66
title
1.42
Title
1.20
Label
1.19
ppelin
1.14
Button
1.10
Text
1.08
lab
1.07
displayText
0.97
Activations Density 0.031%