INDEX
Explanations
symbols and characters, possibly related to formatted text or specific identifiers
New Auto-Interp
Negative Logits
hubby
-0.19
(
-0.17
OTO
-0.16
;-
-0.16
:-
-0.15
--
-0.14
:-)
-0.14
(--
-0.14
youngster
-0.14
;-
-0.14
POSITIVE LOGITS
“[
0.22
Yale
0.21
students
0.19
sophomore
0.19
student
0.19
Cornell
0.19
campus
0.18
undergraduate
0.18
Student
0.18
Students
0.18
Activations Density 0.027%