INDEX
Explanations
phrases or sentences where some entity is identified as a specific person or group
instances of the word "identified" indicating recognition or classification
New Auto-Interp
Negative Logits
olt
-0.70
goodbye
-0.67
prem
-0.66
rip
-0.65
touch
-0.64
uu
-0.62
Heaven
-0.62
pow
-0.62
heaven
-0.61
rain
-0.61
POSITIVE LOGITS
identified
3.38
identifies
2.05
identified
2.00
identify
1.93
identifiable
1.85
identifying
1.80
detected
1.58
identification
1.57
Identified
1.56
described
1.48
Activations Density 0.014%