Linguistic profile automated characterisation in pluripotential clinical high-risk mental state (CHARMS) conditions: methodology of a multicentre observational study

Luca Magnani; Luca Carmisciano; Felice dell’Orletta; Ornella Bettinardi; Silvia Chiesa; Massimiliano Imbesi; Giuliano Limonta; Elisa Montagna; Ilaria Turone; Dario Martinasso; Andrea Aguglia; Gianluca Serafini; Mario Amore; Andrea Amerio; Alessandra Costanza; Francesca Sibilla; Pietro Calcagno; Sara Patti; Gabriella Molino; Andrea Escelsior; Alice Trabucco; Lisa Marzano; Dominique Brunato; Andrea Amelio Ravelli; Marco Cappucciati; Roberta Fiocchi; Gisella Guerzoni; Davide Maravita; Fabio Macchetti; Elisa Mori; Chiara Anna Paglia; Federica Roscigno; Antonio Saginario

doi:10.1136/bmjopen-2022-066642

Article Text

PDF

PDF +
Supplementary
Material

Mental health

Protocol

Linguistic profile automated characterisation in pluripotential clinical high-risk mental state (CHARMS) conditions: methodology of a multicentre observational study

http://orcid.org/0000-0002-3620-2021Luca Magnani1,2,
Luca Carmisciano3,
Felice dell’Orletta4,
Ornella Bettinardi5,
Silvia Chiesa5,
Massimiliano Imbesi5,
Giuliano Limonta5,
Elisa Montagna1,2,
Ilaria Turone1,2,
Dario Martinasso1,2,
Andrea Aguglia1,2,
Gianluca Serafini1,2,
Mario Amore1,2,
Andrea Amerio1,2,
http://orcid.org/0000-0001-6387-6462Alessandra Costanza6,7,
Francesca Sibilla1,2,
Pietro Calcagno1,2,
Sara Patti8,
Gabriella Molino8,
Andrea Escelsior1,2,
Alice Trabucco1,2,
http://orcid.org/0000-0001-9735-3512Lisa Marzano9,
Dominique Brunato4,
Andrea Amelio Ravelli4,
Marco Cappucciati5,10,
Roberta Fiocchi5,
Gisella Guerzoni5,
Davide Maravita5,
Fabio Macchetti5,
Elisa Mori5,
Chiara Anna Paglia5,
Federica Roscigno5,
Antonio Saginario5
LNG-PSY Study Investigators

¹IRCCS Ospedale Policlinico San Martino, Genoa, Italy
²Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health (DINOGMI), Section of Psychiatry, University of Genoa, Genoa, Italy
³Department of Health Sciences (DISSAL), Section of Biostatistics, University of Genoa, Genoa, Italy
⁴Italian Natural Language Processing Lab, Institute of Computational Linguistics "Antonio Zampolli", CNR di Pisa, Pisa, Italy
⁵Department of Mental Health and Pathological Addictions, Piacenza Local Authority, Piacenza, Italy
⁶Department of Psychiatry, Faculty of Medicine, University of Geneva (UNIGE), Geneva, Switzerland
⁷Department of Psychiatry, Service of Adult Psychiatry (SPA), University Hospital of Geneva (HUG), Geneva, Switzerland
⁸Department of Mental Health and Pathological Addictions, Genoa Local Authority, Genoa, Liguria, Italy
⁹Departement of Psychology, School of Science and Technology, Middlesex University, London, UK
¹⁰Early Psychosis: Interventions and Clinical-Detection (EPIC) Lab, Department of Psychosis Studies, Institute of Psychiatry, Psychology & Neuroscience, Kings College London, London, UK

Correspondence to Dr Luca Magnani; magnani1991{at}gmail.com

Abstract

Introduction Language is usually considered the social vehicle of thought in intersubjective communications. However, the relationship between language and high-order cognition seems to evade this canonical and unidirectional description (ie, the notion of language as a simple means of thought communication). In recent years, clinical high at-risk mental state (CHARMS) criteria (evolved from the Ultra-High-Risk paradigm) and the introduction of the Clinical Staging system have been proposed to address the dynamicity of early psychopathology. At the same time, natural language processing (NLP) techniques have greatly evolved and have been successfully applied to investigate different neuropsychiatric conditions. The combination of at-risk mental state paradigm, clinical staging system and automated NLP methods, the latter applied on spoken language transcripts, could represent a useful and convenient approach to the problem of early psychopathological distress within a transdiagnostic risk paradigm.

Methods and analysis Help-seeking young people presenting psychological distress (CHARMS+/− and Clinical Stage 1a or 1b; target sample size for both groups n=90) will be assessed through several psychometric tools and multiple speech analyses during an observational period of 1-year, in the context of an Italian multicentric study. Subjects will be enrolled in different contexts: Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health (DINOGMI), Section of Psychiatry, University of Genoa—IRCCS Ospedale Policlinico San Martino, Genoa, Italy; Mental Health Department—territorial mental services (ASL 3—Genoa), Genoa, Italy; and Mental Health Department—territorial mental services (AUSL—Piacenza), Piacenza, Italy. The conversion rate to full-blown psychopathology (CS 2) will be evaluated over 2 years of clinical observation, to further confirm the predictive and discriminative value of CHARMS criteria and to verify the possibility of enriching them with several linguistic features, derived from a fine-grained automated linguistic analysis of speech.

Ethics and dissemination The methodology described in this study adheres to ethical principles as formulated in the Declaration of Helsinki and is compatible with International Conference on Harmonization (ICH)-good clinical practice. The research protocol was reviewed and approved by two different ethics committees (CER Liguria approval code: 591/2020—id.10993; Comitato Etico dell’Area Vasta Emilia Nord approval code: 2022/0071963). Participants will provide their written informed consent prior to study enrolment and parental consent will be needed in the case of participants aged less than 18 years old. Experimental results will be carefully shared through publication in peer-reviewed journals, to ensure proper data reproducibility.

Trial registration number DOI:10.17605/OSF.IO/BQZTN.

Adult psychiatry
Child & adolescent psychiatry
Health informatics
EPIDEMIOLOGY
Protocols & guidelines

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

https://doi.org/10.1136/bmjopen-2022-066642

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

Using validated diagnostic criteria, the study aims to improve the characterisation of early psychopathology with the results of a fine-grained analysis of language, hoping to define proper linguistic biomarkers.
The selection of relevant linguistic features is performed through a data-driven approach, without predefined cut-offs correlated to pathological significance.
Spoken-language data can be highly variable, and a key challenge concerns the optimisation of such a phase of data acquisition to allow the extraction of relevant information during the subsequent phase of textual processing.
The retention of participants for the entire duration of the observational period is a further challenge.
The assessment of conversion to full-blown psychopathology during the second year of prolonged observation may represent a methodological issue.

Introduction

Language, thought and human beings

Language is usually considered the social vehicle of thought in the context of intersubjective communications. This canonical interpretation of the thought-language relationship implicitly entails the priority of the first term. Therefore, in common medical practice, the verbalisation of delirious contents during an acute psychotic episode is notoriously considered as the manifestation of an underlying thought disorder.1 On the other hand, interspecies cognitive studies revealed the existence of a fundamental gap between the expression of linguistic and non-linguistic contents.2–4 The relationship between language and high-order cognition seems to evade the canonical unidirectional description and the common idea of a thought primacy. In fact, it is at least plausible that language acquisition exhibits a critical role for human cognitive development5: a subsequent deficit in cognitive functions has been experimentally linked to a primary insufficient development of linguistic skills.6–8 Moreover, according to Tattersall’s theory,9 10 language should be regarded as the fact that permitted the transition across different cognitive phenotypes during the evolution of human species. In philosophy, many authors highlighted the deep constitutive character of language for humans.11 12

Language and phenomenal experience

As well known, consciousness definition is still an ongoing issue. Among other things, it could be simplistically represented as an active and fundamental background that synthetise experienced phenomena within a spatiotemporal schema. This said, the linguistic apparatus, analysable through advanced natural language processing (NLP) techniques, can be considered as something that shapes the product of this primary synthesis to further reduce the phenomenal complexity and to allow the emergence of a unique and well-defined subjective experience. Therefore, the conscious phenomenon offers itself as a synthetic elaboration of an unrefined experience, following a double logic (first transcendental and then formal-linguistic). In this context, language stops being exclusively a means of communication for predefined thoughts, thus becoming a refined tool of interaction with the experienced world, regardless of what it can be said about abstract thought function. In some recent interpretations this complex problem has been transposed within the theoretical framework of predictive brain13 14 to show that ‘verbal cues (even if self-generated) can act as highly flexible (and metabolically cheap) contexts (set of priors), to generate a predictive signal helping the system to process an input that is otherwise too weak or noisy”.15

Psychopathology and language: floating on fluid psychopathological substrates

Most relevant psychopathological conditions appear to originate during early life stages.16 17 At the same time, the boundaries between previously distinct psychopathological disorders seem to weaken.18–20 The inadequacy of classical nosography has become progressively more evident, especially when it is applied to the dynamicity of early psychopathological phenomena.21 22 A classification based on strictly defined diagnostic categories appears inadequate when considering the complex phenomenon of comorbidities23 24 and symptomatological overlaps, frequently expressed during early stages of mental illness.25–29 To address the complex world of early psychopathology the so-called Ultra-High-Risk paradigm has been proposed over the past decades,30 originally developed to individuate schizophrenia prodromes.31 More recently, the concept of at-risk mental state has been expanded to detect conditions of ‘trans-diagnostic risk’.32–35 Coherently, Hartmann and colleagues36 proposed a new methodology based on the application of clinical high at-risk mental state (CHARMS) criteria and enriched with the introduction of a clinical staging system37 (table 1). Specifically, the detection of the CHARMS criteria allows to identify adolescents and young adults (aged up to 25 years old) expressing a psychopathological condition of transdiagnostic risk, which corresponds to stage 1b in the context of the clinical staging system (CS 1b). Coherently with the transdiagnostic approach, the risk is referred to a generic ‘exit syndrome’, that is, a first episode of full-blown psychopathology, defined through the overcoming of some psychometric thresholds, as well as through the verification of DSM-5 (Diagnostic and Statistical Manual of Mental Disorders - Fifth Edition) criteria for a specific disorder. Within the wide group of CHARMS+ / CS 1b subjects, some subcategories of ‘attenuated syndromes’ have been proposed by original authors (see table 2): the psychosis trait vulnerability group; the bipolar trait vulnerability group; the attenuated psychotic symptoms group; the attenuated (hypo)manic symptoms group; the moderate (attenuated) depression group; the attenuated borderline personality group; the brief limited intermittent psychotic symptom group. An ongoing observational study has been structured starting from this latter proposal, aimed to verify the predictive power of some novel risk category defined through the application of CHARMS criteria.38 Preliminary results have recently been published.39 To conceive our proposal and to first verify data reproducibility, we initially acquired the methodology described by Hartmann and colleagues.36 However, we chose to further enrich the experimental apparatus to gather different information from a fine-grained analysis of the linguistic profile, derived from subjects’ speech. This analysis will be conducted on textual data, revised by researchers, after a first automated transcription of the audio records, directly acquired during experimental assessment. In fact, language, as previously reinterpreted, can represent a further pathoplastic/pathogenetic factor, favouring the pathological crystallisation of phenomenal data.

View this table:

Table 1

Clinical staging system

View this table:

Table 2

CHARMS subgroups of risk or early clinical phenotypes

Language production disturbances in psychosis and schizophrenia have been investigated since some early works promoted by Harrow and Quinlan40 and Andreasen and Grove,41 the sudden development of innovative methods of automated linguistic analysis promoted further investigations (for a comprehensive review see Corcoran et al 2020,42 increasingly oriented towards the early stages of the disease. In fact, it seems that language alterations may soon represent valid and practical biomarkers to perform a multilayered assessment of psychotic risk and to offer more tailored interventions. The hope concerns the possibility of extending this approach within the abovementioned transdiagnostic risk paradigm.

NLP techniques and their application in neuropsychiatric conditions

As reported by Voleti and colleagues,43 it is possible to identify different levels of complexity in linguistic analysis. For each level, several features have been proposed as potentially relevant in association with different neuropsychiatric conditions.

Lexical level

This level allows to extract features that account for the diversity and richness of lexicon used in a text. At this level, the following metrics are usually computed: (a) the type/token ratio (TTR), a standard index of lexical variety; (b) the Moving Average type-token ratio,44 considered as an ‘advanced’ TTR as it calculates lexical variety of a sample using a moving window that estimates TTRs for each successive window of fixed length. Moving Average TTR, Brunét Index, Honoré’s Statistic, part-of-speech (POS) tagging, aim to quantify lexical diversity and density. These parameters have been mainly studied either for risk stratification or for the diagnosis of morbid conditions of purely neurological relevance.45–49 These variables are quick and easy to assess. However, this simplicity is reflected in a reduced capacity of providing relevant information.

Morpho-syntactic level

This level allows to extract information from the POS-tagging step of linguistic annotation. In particular, the following variables are usually computed (a) the percentage distribution of morpho-syntactic categories (both functional and lexical); (b) the ‘lexical density’ index (ie, the proportion between functional words over the total number of words in a text).

Syntactic level

At this level whole propositions are examined to analyse the way in which words are organised in sentences and sentences in speeches. Considering the work of Mota and colleagues,50 the researchers extrapolated objective parameters of language measurement, useful for quantifying the alterations characteristically found in specific morbid states. The set of verifiable syntactic features covers a wide range of properties which can be further grouped. For instance, features related to the parse tree structure (eg, maximum parse tree depth, average length of dependency links), to the use of specific syntactic relations (eg, use of coordination and subordination) and to canonicity effects (eg, relative order of subject and object with respect to the verb).

Semantic analysis

The analysis of linguistic expressions in relation to the meaning they acquire in speech. Among related NLP methods, one of the first developed is the Latent Semantic Analysis (LSA),51 today carried out through the application of specific Machine Learning algorithms, exploiting artificial neural networks word2vec or GloVe.52 53 These tools can probabilistically define, starting from the analysis of large textual corpora, the semantic content of individual words and develop a specific vocabulary. More recently, further algorithms have been developed that can operate similarly at the level of entire propositions (eg, sent2vec, InferSent, Universal Sentence Encoder - USE). One of the first studies carried out in this area aimed to measure, by applying LSA, the semantic coherence of the language of patients suffering from Formal Thought Disorder (FTD) of different severity.54 Furthermore, through these methods the importance of semantic and pragmatic alterations in schizophrenia was confirmed.55 In recent years, the characteristics of language potentially predictive for the onset of psychosis in clinically defined high-risk subjects (clinical high-risk, CHR) were also investigated.56 57 Similarly, Rezaii and colleagues58 defined a so-called digital phenotype useful for quantifying the risk of psychosis onset in CHR subjects. According to Morgan and colleagues,59 different NLP measures may provide complementary information, being potentially associable to distinct aspects of mental disorders.

Aims and objectives

According to our theoretical speculations and inspired by previous works proposed by Hartmann and colleagues,36 39 we designed a multicentre observational study with the following objectives:

Primary objective

To investigate the alterations of multiple spoken language variables in subjects at pluripotential risk (CHARMS+ and Clinical Stage 1b) of developing a range of full-blown psychopathological disorders by estimating the associations between those variables and the conversion to full-blown psychopathological disorder in reference to a second group (CHARMS− and CS 1a), internally defined after the exclusion of the presence of a full-blown disorder (CS 2), as well as of a transdiagnostic risk condition (CHARMS+ and CS 1b).
To prospectively confirm the predictive and discriminant validity of the CHARMS criteria in a sample of CHARMS+ and CS 1b subjects and in a second group (CHARMS− and CS 1a).

Secondary objectives

To develop a prediction model for the probability of conversion to full-blown disease (Clinical Stage 2), using CHARMS criteria, CHARMS subgroups, linguistic features and a data-driven subset of language markers.
To evaluate the predictive and discriminative capabilities of such model.

Crucially, the conversion to full-blown conditions (ie, Clinical stage 2) is referred to a set of ‘exit-syndromes’, in line with the original methodology (ie, psychotic disorder, bipolar disorder, depressive disorder and borderline personality disorder).

Methods and analysis

Participants and setting

We designed a longitudinal follow-up study with an observational period of 2 years. The expected sample size is n=180: 90 subjects who meet CHARMS criteria (Group A: CHARMS+; CS 1b) and 90 controls (Group B: CHARMS−; CS 1a). Potential participants are help-seeking people aged 14–25 who are referred to local mental health service. Subjects will be enrolled in different contexts:

Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health (DINOGMI), Section of Psychiatry, University of Genoa—IRCCS Ospedale Policlinico San Martino, Genoa, Italy;
Mental Health Department—territorial mental services (ASL 3 – Genoa), Genoa, Italy;
Mental Health Department—territorial mental services (AUSL – Piacenza), Piacenza, Italy.

Group A (CHARMS+ and CS 1b): attenuated syndrome with moderate/subthreshold symptoms.

Inclusion criteria are: (1) the verified presence of a pluripotential ‘at-risk’ mental state—CHARMS+ and Clinical Stage 1b attenuated syndrome; (2) age between 14 and 25 Years; (3) native speakers (Italian) with a good understanding of spoken and written Italian language; (4) the ability to give informed consent by participants themselves and/or by parental authority holders in the case of a minor.

Exclusion criteria are: (1) a documented history of intellectual disability or a diagnosis of autism spectrum disorder; (2) current or previous diagnosed full-blown mental disorder—Clinical Stage ≥2; (3) the presence of relevant neurological disorders (including brain trauma, epilepsy, stroke, cerebral palsy); (4) a mental condition directly and exclusively induced by substances or by organic causes (in accordance with DSM-V criteria); (5) the identification through psychic examination of delusional persecutory thoughts, such that the experimental audio recording may be a source of discomfort for the enrolled subject.

Group B (CHARMS− and CS 1a): mild or non-specific symptoms.

Inclusion criteria are: (1) the exclusion of a pluripotential ‘at-risk’ mental state—CHARMS+ and Clinical Stage 1b attenuated syndrome; (2) age between 14 and 25 Years; (3) native speakers (Italian) with a good understanding of spoken and written Italian language; (4) the ability to give informed consent by participants themselves and/or by parental authority holders in case of a minor.

Exclusion criteria are: (1) a documented history of intellectual disability or a diagnosis of autism spectrum disorder; (2) current or previous diagnosed full-blown mental disorder—Clinical Stage≥2; (3) the presence of relevant neurological disorders (including brain trauma, epilepsy, stroke, cerebral palsy); (4) a mental condition directly and exclusively induced by substances or by organic causes (in accordance with DSM-V criteria).

Procedure and data acquisition

Baseline interview A—psychometric measures

For each enrolled subject, the necessary informed consent is acquired (for participants aged <18 years, parental consent will also be obtained). The baseline examination will be conducted by preselected and pretrained research team members and will involve a general neuropsychiatric evaluation and a structured recording of medical/family history. The following psychometric scales will be administered to finally verify inclusion/exclusion criteria and subjects eligibility:

**Comprehensive Assessment of At-Risk Mental States38—semi-structured interview, seven different psychopathological domains rated according to a global rating score (0–6), a frequency score (0–6) and a substance use score (0–2). Italian validation.60
**Structured Clinical Interview for DSM-5 (SCID-5)61 and SCID-5 Personality Disorders (SCID-5_PD)62—semi-structured interviews for establishing clinical diagnoses (gold standard) based on DSM-5. Regarding the SCID-5_PD section, only the modules concerning borderline personality disorder and schizotypal personality disorder are considered for the purposes of this study.
**Social and Occupational Functioning Scale (SOFAS)63—observer-rated (0–100) scale assessing the social and occupational functioning.
Global Functioning Scale: Social and Role (GFS/GFR)64 65 (clinician rated)—both deriving from Global Assessment of Functioning, GFS assesses (from 1—extreme dysfunction to 10—superior functioning) quantity and quality of social relationship, while GFR assesses (same scoring system) subject’s performance in different contexts (school, work or home).
Quick Inventory of Depressive Symptomatology-Clinicians rated (QIDS-C)66—clinician-rated 16-items questionnaire that assess the severity of depressive symptoms during the previous week.
**Young Mania Rating Scale67—11 clinician-rated items that assess (gold-standard)68 severity of manic symptomology over the previous 48 hours. Italian version.69
Depression Anxiety Stress Scale 21 (DASS-21)70—self-report scale (short version of the 42-item DASS) to assess three domains (seven items for each domain) of negative affectivity referred to the past weeks. When the scale is administered in children and adolescents, only one (general) score is defined.71
Bipolar Spectrum Diagnostic Scale (BSDS)72—self-rating narrative-based scale which assesses the entire bipolar spectrum, including subthreshold states of bipolar illness.73
**Personality inventory for DSM-5, brief version74—self-rating screening75 76 tools (25 items) for assessing in adult and adolescents five maladaptive personality trait-dimensions, described according to the alternative model of personality disorder.
Davos Assessment of Cognitive Biases Scale (DACOBS)77 78—self-report scale with 42 items to assess the presence of possible cognitive biases, cognitive limitations and avoidance behaviours.
Munich Cronotype Questionnaire (MCTQ)79—a self-report tool to assess information on sleep referred to work and work-free days and to quantitatively obtain a chronotype related to sleep intervals.
**Insomnia Severity Index80 81—self-rating (seven items) instrument to assess night-time and daytime symptoms of insomnia in adults and adolescents.82 Italian version.83

** Italian version available.

As indicated, some of these psychometric tools (ie, QIDS-C, SOFAS, GFS/GFR, DASS-21, BSDS, DACOBS and MCTQ) have not been officially validated in Italian, thus a preliminary internal translation was realised to reproduce as much as possible a certain methodology36 and to eventually perform an internal validation of the abovementioned psychometric tools.

If a participant will exceed the threshold for a full-threshold disorder (Clinical Stage ≥2), then he/she will be excluded from the study and committed to the mental health service. On the contrary, if the subject will meet the CHARMS criteria (CHARMS+ and CS 1b) or falls below the CHARMS threshold (CHARMS− and CS 1a), then he/she will be included in the research programme (unless additional exclusion criteria are met).

Through the acquisition of psychometric variables and the application of CHARMS criteria it is also possible to verify for each subject in Group A the subgroup of risk, as proposed by Hartmann et al36 (table 2).

Baseline interview B—speech recording

The baseline assessment includes a second part to be carried out after a few days (T0-b), according to a shared agenda and by the same research team members.

At T0-b, subjects of both groups will be first evaluated through the Montreal Cognitive Assessment scale.84 85 Besides, the spoken language of enrolled subjects will be audio-recorded. Participants will describe four sequences of vignettes, picturing four logically linked events. In each sequence human individuals engaged in simple actions within contexts of daily life are represented. Two sequences were specifically created to be affectively neutral; a third one should be more emotionally salient; finally, a fourth sequence should express a less intuitive logic of transition between the single depicted moments. Then, they will be asked to answer four predefined questions, each related to a particular detail of each picture in a sequence. The free speech of each participant will also be recorded, eventually elicited with some questions, formulated according to narrative interview’s recommendations (ie, phenomenological inquiry paradigm86

The recording sessions will last 30–45 min. Data will be acquired using the same recording device and using a free software (Auphonic), with the following settings:

Format: CAF/WAV (PCM).
Sample rate: 44 khz or 48 khz.
Channel: MONO.
Depth: 16 bits.

Time series

The first year of observation includes three phases of data acquisition for each participant.

During each phase, all relevant information about participants will be registered/updated. Linguistic data will be recorded at Tn-b, following the same methodology of acquisition described for T0-b. Data acquired at each time point are summed in table 3.

View this table:

Table 3

Gantt chart

Crucially, at each different phase of the first year, subjects’ conversion to full-blown disorder (Clinical Stage 2) will be verified. Each participant will be also periodically evaluated by mental health specialists (not directly involved in the study), who will eventually provide him/her with any appropriate therapeutic intervention. During the second year of observation subjects will be specifically assessed for conversion to Clinical Stage 2 any time a significative worsening of psychological status will be reported from the abovementioned standard periodical evaluation performed by external mental health teams.

Linguistic data processing and elaboration

At each phase of data acquisition, audio reports will be automated transcribed verbatim under the supervision of dedicated researchers. A first database of anonymous raw transcripts will be produced. In such a form, transcripts will be shared with our partner institution (Computational Linguistic Institute ‘Antonio Zampolli’, National Research Council (CNR), Pisa, Italy) to perform textual data processing. Starting from the transcripts, linguistic analyses will be performed along different levels. Raw text and (morpho-)syntactic analysis will be automatically carried out by means of Profiling-UD,87 a multilingual web-based tool that provides a comprehensive assessment of language use. The tool performs a two-stage process: linguistic annotation (carried out by UDPipe)88 according to Universal Dependencies (UD) formalism89 and linguistic profiling. The annotated texts will be used as input to the further step, performed by the linguistic profiling component that defines the rules to extract and quantify the formal properties. The final output of the process is a vector-like representation that can comprises more than 120 linguistic features: (1) shallow features, for example, average length and counts of words and sentences, (2) morpho-syntactic features, for example, POS tagging distributions and inflectional properties of verbs or (3) more complex features obtained from syntactic parsing of the sentences, such as the use of subordination. The set of features from Profiling-UD has been derived from the literature on linguistic complexity, language acquisition and neurolinguistics, and have been successfully applied in a wide range of tasks and scenarios: from the automatic tracking of developmental patterns in child language acquisition90 91 and the evolution of written language competence in school learners,92 93 to the prediction of behavioural and cognitive impairments based on the detection of relevant linguistic markers from clinical tests.46 94

Furthermore, semantic representations of each transcript will be computed for both single sentence and the whole session level. To this end, we will rely on state-of-the-art neural network architecture, for example, Transformers models,95 which have shown massive improvements in NLP. Particularly, Natural Language Understanding models based on this technology, such as those of the BERT family96 have defined new states of the art in many tasks (eg, GLUE collection of benchmark tasks). The main advantage of these recent models over previous methodologies is that the embedding of a word is not fixed but computed for every occurrence based on its lexical contour; they can also applied in pathological contexts97. We plan to exploit a pretrained BERT model for the Italian language that has been trained on a huge corpus of more than 13 billions of words, that is, ‘bert-base-italian-cased’ to encode semantic information of words and sentences. Following Corcoran et al,42 we plan to analyse the coherence in the flow of subject speech by computing the semantic closeness of contiguous sentences (ie, the cosine distance between the embedding representations of the sentences).

Duration of the study

The recruitment kick-off is scheduled for July 2022; the database lock will be carried out according to the achievement of a sufficient sample size. The minimum expected duration of the observational period for a non-dropped out participant is 2 years. Preliminary data referred to each participant will be analysed at the end of recruitment phase. After this first passage data could possibly undergo a correction process due to the potential delayed conversion to full-blown disease (Clinical Stage 2) during the second year of observation.

Patient and public involvement

Research questions and outcomes were defined to address the complexity of early psychopathology and to better correspond to help-seeking young people needs, frequently expressed in real-world mental health settings. The richness of patients’ symptoms descriptions and their expressive urgency guided the development of the study design, prompting us to focus our attention on the characteristics of spoken language. At the end of the period of data acquisition, the results of the experimental investigations could help inform patients’ primary advisers, potentially optimising the care offer. Furthermore, during the individual assessment, the exceeding of predefined psychopathological thresholds (conversion to full-blown disease—Clinical Stage 2) will be communicated to the patient’s advisers to immediately adopt appropriate therapeutic measures.

Statistical analyses

Estimated sample size and statistical power

As reported by Hartmann and colleagues,36 literature'-based expectations of the 1-year transition rate in the CHARMS+ and CHARMS− groups are, respectively, 20% and 3%. To detect such a 6.7-fold increase as significant with 90% power, 5% significance level and 20% drop-out rate, a total of 180 subjects are required. Hence, we defined an expected sample size of n=180 (n=90 group A and n=90 group B).

Power calculations were performed by simulation with R software V.4.0.2. All the supporting material is available at a publicly repository.

Statistical analyses post data acquisition

Primary analysis will determine whether the rate of patient’s conversion to full-blown disease (Clinical Stage 2) in CHARMS+ patient’s group differs from the rate in CHARMS− group. Pearson’s χ² test will be performed on the 2×2 contingency table of patient’s group (CHARMS+ or CHARMS−) and the occurrence of the Stage 2 conversion over a fixed follow-up time (which is one or 2 years of observation in preliminary and final analyses, respectively).

In the presence of heterogeneity in drop-out rates between the two patient’s groups the main analysis will be performed both in the complete case data set and under the assumption of the conversion rate to 1 (most conservative approach) for all drop-out patients.

In the presence of significative imbalances of patient’s characteristics between the two patient’s groups a multivariable logistical regression analysis will be used to adjust for potential confounders. Both raw and adjusted analysis will be reported with OR, 95% CIs and p values. A sensitivity analysis considering the conversion event as a time dependent outcome will be preplanned to make full usage of all available follow-up periods and make each day of observation contribute to the final conversion rate estimation.

In the case of difference between the rate of Stage 2 conversion in CHARMS+ and CHARMS− patients three classification analyses will be performed to detect: (1) the clinical alterations, (2) the CHARMS subgroups and (3) the spoken language features most associated to the Stage 2 conversion regardless the CHARMS group. Within these three analyses variable selection procedures might be applied, such as: bidirectional stepwise, ridge, lasso or elastic net to identify the best combination of predictors to identify Stage 2 conversion. Receiver Operating Characteristic curve, the area below such curve and metrics derived from the 2×2 prediction-observation confusion matrix (such as sensitivity and specificity, positive and negative predictive values) will be used to estimate model’s predictive and discriminative capabilities. Internal validation procedures such as k-fold cross validation may also be required to improve model generalisability. If needed, external information source such as real-world prevalence rates may be used to contextualise model performance.

Due to the high number of tested, there is a high risk of labelling some false, spurious, associations as significative. Therefore, we define a three-step strategy aimed at mitigating this risk. First, we prospectively list a set of characteristics (N=41, online supplemental material) that will be tested for interaction with CHARMS group in Stage 2 conversion rate. Second, the p values to test these prespecified characteristic associations to the risk of Stage 2 conversion will be presented using Bonferroni correction for multiple comparisons based on the number of variables in the list (ie, 41, regardless of any data collection issues that may emerge during the study execution). Third, due to the data driven nature of the audio processing approach, other unplanned analyses on characteristics yet to be defined are expected to be performed; such analyses will be explicitly labelled as exploratory, and the reader will be acknowledged in the result presentation (and trough this protocol) to carefully look at the findings merely as ‘hypothesis generating’.

Supplemental material

[bmjopen-2022-066642supp001.pdf]

All analyses will be performed with R software (or equivalent statistical software) and uploaded in a public repository to guarantee the transparency and replicability of any finding.

Ethics and dissemination

The methodology described in this study adheres to ethical principles as formulated in the Declaration of Helsinki and is compatible with International Conference on Harmonization (ICH)-good clinical practice. The research protocol was reviewed and approved by two different Ethics Committees (CER Liguria approval code: 591/2020 – id.10993; Comitato Etico dell’Area Vasta Emilia Nord approval code: 2022/0071963). Participants will provide their written informed consent prior to study enrolment and parental consent will be needed in the case of participants aged less than 18 years old. Experimental results will be carefully shared through publication in peer-reviewed journals, to ensure proper data reproducibility.

Ethics statements

Patient consent for publication

Acknowledgments

This work was developed within the framework of the DINOGMI Department of Excellence of MIUR 2018-2022 (Law 232/2016).

References

↵
1. Bleuler E
. Dementia praecox, oder Gruppe der Schizophrenien. 1911.
↵
1. Hauser MD
. The evolution of communication. Cambridge, MA: MIT Press, 1996: 760.
↵
1. Penn DC,
2. Holyoak KJ,
3. Povinelli DJ
. Darwin’s mistake: explaining the discontinuity between human and nonhuman minds. Behav Brain Sci 2008;31:109–30. doi:10.1017/S0140525X08003543
OpenUrl CrossRef PubMed
↵
1. Tomasello M
. Origins of human communication. Cambridge, MA: MIT Press, 2010: 408.
↵
1. Vouloumanos A,
2. Waxman SR
. Listen up! speech is for thinking during infancy. Trends Cogn Sci 2014;18:642–6. doi:10.1016/j.tics.2014.10.001
OpenUrl CrossRef PubMed
↵
1. Eigsti I-M,
2. de Marchena AB,
3. Schuh JM, et al
. Language acquisition in autism spectrum disorders: a developmental review. Res Autism Spectr Disord 2011;5:681–91. doi:10.1016/j.rasd.2010.09.001
OpenUrl
↵
1. Schaller S,
2. Sacks O
. A man without words. Berkeley: University of California Press, 1991: 204.
↵
1. Humphries T,
2. Kushalnagar P,
3. Mathur G, et al
. Ensuring language acquisition for deaf children: what linguists can do. Language 2014;90:e31–52. doi:10.1353/lan.2014.0036
OpenUrl
↵
1. Tattersall I
. The world from beginnings to 4000 BCE. Oxford: Oxford University Press, 2008.
↵
1. Tattersall I
. An evolutionary context for the emergence of language. Language Sciences 2014;46:199–206. doi:10.1016/j.langsci.2014.06.011
OpenUrl
↵
1. Heidegger M,
2. Krell DF
. Basic writings: from being and time (1927) to the task of thinking; 1964.
↵
1. Cassirer E
. An essay on man an introduction to a philosophy of human culture. Yale University Press, 1944.
↵
1. Rao RPN,
2. Ballard DH
. Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat Neurosci 1999;2:79–87. doi:10.1038/4580
OpenUrl CrossRef PubMed Web of Science
↵
1. Helmholtz H
. Helmholtz’s treatise on physiological optics. Dover Publications, 1962.
↵
1. Lupyan G,
2. Clark A
. Words and the world: predictive coding and the language-perception-cognition interface. Curr Dir Psychol Sci 2015;24:279–84.
OpenUrl CrossRef
↵
1. Kessler RC,
2. Berglund P,
3. Demler O, et al
. Lifetime prevalence and age-of-onset distributions of DSM-IV disorders in the National comorbidity survey replication. Arch Gen Psychiatry 2005;62:593. doi:10.1001/archpsyc.62.6.593
OpenUrl CrossRef PubMed Web of Science
↵
1. Jones PB
. Adult mental health disorders and their age at onset. Br J Psychiatry 2013;202:s5–10. doi:10.1192/bjp.bp.112.119164
OpenUrl Abstract/FREE Full Text
↵
1. Potash JB
. Carving chaos: genetics and the classification of mood and psychotic syndromes. Harv Rev Psychiatry 2006;14:47–63. doi:10.1080/10673220600655780
OpenUrl CrossRef PubMed Web of Science
↵
1. Potash JB,
2. Bienvenu OJ
. Neuropsychiatric disorders: shared genetics of bipolar disorder and schizophrenia. Nat Rev Neurol 2009;5:299–300. doi:10.1038/nrneurol.2009.71
OpenUrl PubMed
↵
1. Ivleva EI,
2. Morris DW,
3. Moates AF, et al
. Genetics and intermediate phenotypes of the schizophrenia--bipolar disorder boundary. Neurosci Biobehav Rev 2010;34:897–921. doi:10.1016/j.neubiorev.2009.11.022
OpenUrl CrossRef PubMed Web of Science
↵
1. McGorry PD,
2. Hickie IB,
3. Yung AR, et al
. Clinical staging of psychiatric disorders: a heuristic framework for choosing earlier, safer and more effective interventions. Aust N Z J Psychiatry 2006;40:616–22. doi:10.1080/j.1440-1614.2006.01860.x
OpenUrl CrossRef PubMed Web of Science
↵
1. McGorry P
. Transition to adulthood: the critical period for pre-emptive, disease-modifying care for schizophrenia and related disorders. Schizophr Bull 2011;37:524–30. doi:10.1093/schbul/sbr027
OpenUrl CrossRef PubMed Web of Science
↵
1. Kessler RC,
2. Birnbaum H,
3. Demler O, et al
. The prevalence and correlates of nonaffective psychosis in the national comorbidity survey replication (NCS-R). Biol Psychiatry 2005;58:668–76. doi:10.1016/j.biopsych.2005.04.034
OpenUrl CrossRef PubMed Web of Science
↵
1. Kessler RC,
2. Ormel J,
3. Petukhova M, et al
. Development of lifetime comorbidity in the world health organization world mental health surveys. Arch Gen Psychiatry 2011;68:90–100. doi:10.1001/archgenpsychiatry.2010.180
OpenUrl CrossRef PubMed Web of Science
↵
1. Merikangas KR,
2. Herrell R,
3. Swendsen J, et al
. Specificity of bipolar spectrum conditions in the comorbidity of mood and substance use disorders. Arch Gen Psychiatry 2008;65:47. doi:10.1001/archgenpsychiatry.2007.18
OpenUrl CrossRef PubMed Web of Science
↵
1. Merikangas KR,
2. He J-P,
3. Burstein M, et al
. Lifetime prevalence of mental disorders in U.S. adolescents: results from the National comorbidity survey replication -- adolescent supplement (NCS-A). J Am Acad Child Adolesc Psychiatry 2010;49:980–9. doi:10.1016/j.jaac.2010.05.017
OpenUrl CrossRef PubMed Web of Science
↵
1. Merikangas KR,
2. Cui L,
3. Kattan G, et al
. Mania with and without depression in a community sample of US adolescents. Arch Gen Psychiatry 2012;69:943–51. doi:10.1001/archgenpsychiatry.2012.38
OpenUrl CrossRef PubMed
↵
1. Murray GK,
2. Jones PB
. Psychotic symptoms in young people without psychotic illness: mechanisms and meaning. Br J Psychiatry 2012;201:4–6. doi:10.1192/bjp.bp.111.107789
OpenUrl Abstract/FREE Full Text
↵
1. Ormel J,
2. Raven D,
3. van Oort F, et al
. Mental health in Dutch adolescents: a trails report on prevalence, severity, age of onset, continuity and co-morbidity of DSM disorders. Psychol Med 2015;45:345–60. doi:10.1017/S0033291714001469
OpenUrl CrossRef PubMed
↵
1. Yung AR,
2. McGorry PD
. The prodromal phase of first-episode psychosis: past and current conceptualizations. Schizophr Bull 1996;22:353–70. doi:10.1093/schbul/22.2.353
OpenUrl CrossRef PubMed Web of Science
↵
1. Malla AK,
2. Norman RMG
. Prodromal symptoms in schizophrenia. Br J Psychiatry 1994;164:487–93. doi:10.1192/bjp.164.4.487
OpenUrl Abstract/FREE Full Text
↵
1. McGorry PD,
2. Mei C
. Ultra-high-risk paradigm: lessons learnt and new directions. Evid Based Ment Health 2018;21:131–3. doi:10.1136/ebmental-2018-300061
OpenUrl Abstract/FREE Full Text
↵
1. Lin A,
2. Wood SJ,
3. Nelson B, et al
. Outcomes of nontransitioned cases in a sample at ultra-high risk for psychosis. Am J Psychiatry 2015;172:249–58. doi:10.1176/appi.ajp.2014.13030418
OpenUrl CrossRef PubMed
↵
1. Rutigliano G,
2. Valmaggia L,
3. Landi P, et al
. Persistence or recurrence of non-psychotic comorbid mental disorders associated with 6-year poor functional outcomes in patients at ultra high risk for psychosis. J Affect Disord 2016;203:101–10. doi:10.1016/j.jad.2016.05.053
OpenUrl
↵
1. Beck K,
2. Andreou C,
3. Studerus E, et al
. Clinical and functional long-term outcome of patients at clinical high risk (CHR) for psychosis without transition to psychosis: a systematic review. Schizophr Res 2019;210:39–47. doi:10.1016/j.schres.2018.12.047
OpenUrl
↵
1. Hartmann JA,
2. Nelson B,
3. Spooner R, et al
. Broad clinical high-risk mental state (charms): methodology of a cohort study validating criteria for pluripotent risk. Early Interv Psychiatry 2019;13:379–86. doi:10.1111/eip.12483
OpenUrl CrossRef
↵
1. Shah JL,
2. Scott J,
3. McGorry PD, et al
. Transdiagnostic clinical staging in youth mental health: a first international consensus statement. World Psychiatry 2020;19:233–42. doi:10.1002/wps.20745
OpenUrl CrossRef PubMed
↵
1. Yung AR,
2. Yuen HP,
3. McGorry PD, et al
. Mapping the onset of psychosis: the comprehensive assessment of at-risk mental states. Aust N Z J Psychiatry 2005;39:964–71. doi:10.1080/j.1440-1614.2005.01714.x
OpenUrl CrossRef PubMed Web of Science
↵
1. Hartmann JA,
2. McGorry PD,
3. Destree L, et al
. Pluripotential risk and clinical staging: theoretical considerations and preliminary data from a transdiagnostic risk identification approach. Front Psychiatry 2020;11:553578. doi:10.3389/fpsyt.2020.553578
OpenUrl CrossRef
↵
1. Harrow M,
2. Quinlan D
. Is disordered thinking unique to schizophrenia? Arch Gen Psychiatry 1977;34:15–21. doi:10.1001/archpsyc.1977.01770130017001
OpenUrl CrossRef PubMed Web of Science
↵
1. Andreasen NC,
2. Grove WM
. Thought, language, and communication in schizophrenia: diagnosis and prognosis. Schizophr Bull 1986;12:348–59. doi:10.1093/schbul/12.3.348
OpenUrl CrossRef PubMed Web of Science
↵
1. Corcoran CM,
2. Mittal VA,
3. Bearden CE, et al
. Language as a biomarker for psychosis: a natural language processing approach. Schizophr Res 2020;226:158–66. doi:10.1016/j.schres.2020.04.032
OpenUrl CrossRef PubMed
↵
1. Voleti R,
2. Liss JM,
3. Berisha V
. A review of automated speech and language features for assessment of cognitive and thought disorders. IEEE J Sel Top Signal Process 2020;14:282–98. doi:10.1109/jstsp.2019.2952087
OpenUrl CrossRef PubMed
↵
1. Covington MA,
2. McFall JD
. Cutting the gordian knot: the moving-average type–token ratio (MATTR). J Quant Linguist 2010;17:94–100. doi:10.1080/09296171003643098
OpenUrl CrossRef
↵
1. Asgari M,
2. Kaye J,
3. Dodge H
. Predicting mild cognitive impairment from spontaneous spoken utterances. Alzheimers Dement (N Y) 2017;3:219–28. doi:10.1016/j.trci.2017.01.006
OpenUrl
↵
1. Roark B,
2. Mitchell M,
3. Hollingshead K
. Syntactic complexity measures for detecting mild cognitive impairment - ACL anthology. BioNLP 2007:1–8. doi:10.3115/1572392.1572394
↵
1. Bucks RS,
2. Singh S,
3. Cuerden JM, et al
. Analysis of spontaneous, conversational speech in dementia of Alzheimer type: evaluation of an objective technique for analysing lexical performance. Aphasiology 2000;14:71–91. doi:10.1080/026870300401603
OpenUrl CrossRef
↵
1. Fraser KC,
2. Meltzer JA,
3. Rudzicz F
. Linguistic features identify Alzheimer’s disease in narrative speech. J Alzheimers Dis 2016;49:407–22. doi:10.3233/JAD-150520
OpenUrl
↵
1. Sadeghian R,
2. Schaffer JD,
3. Zahorian SA
. Speech processing approach for diagnosing dementia in an early stage. INTERSPEECH 2017; ISCA, 2017:2705–9 doi:10.21437/Interspeech.2017-1712
↵
1. Mota NB,
2. Vasconcelos NAP,
3. Lemos N, et al
. Speech graphs provide a quantitative measure of thought disorder in psychosis. PLoS One 2012;7:e34928. doi:10.1371/journal.pone.0034928
↵
1. Landauer TK,
2. Foltz PW,
3. Laham D
. An introduction to latent semantic analysis. Discourse Processes 1998;25:259–84. doi:10.1080/01638539809545028
OpenUrl CrossRef
↵
1. Mikolov T,
2. Chen K,
3. Corrado G, et al
. Efficient estimation of word representations in vector space. ICLR, 2013.
↵
1. Pennington J,
2. Socher R,
3. Manning CD
. Glove: global vectors for word representation. EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference; 2014:1532–43 doi:10.3115/v1/D14-1162
↵
1. Elvevåg B,
2. Foltz PW,
3. Weinberger DR, et al
. Quantifying incoherence in speech: an automated methodology and novel application to schizophrenia. Schizophr Res 2007;93:304–16. doi:10.1016/j.schres.2007.03.001
OpenUrl CrossRef PubMed Web of Science
↵
1. Covington MA,
2. He C,
3. Brown C, et al
. Schizophrenia and the structure of language: the linguist’s view. Schizophr Res 2005;77:85–98. doi:10.1016/j.schres.2005.01.016
OpenUrl CrossRef PubMed Web of Science
↵
1. Bedi G,
2. Carrillo F,
3. Cecchi GA, et al
. Automated analysis of free speech predicts psychosis onset in high-risk youths. NPJ Schizophr 2015;1:15030. doi:10.1038/npjschz.2015.30
OpenUrl PubMed
↵
1. Corcoran CM,
2. Carrillo F,
3. Fernández-Slezak D, et al
. Prediction of psychosis across protocols and risk cohorts using automated language analysis. World Psychiatry 2018;17:67–75. doi:10.1002/wps.20491
OpenUrl CrossRef
↵
1. Rezaii N,
2. Walker E,
3. Wolff P
. A machine learning approach to predicting psychosis using semantic density and latent content analysis. NPJ Schizophr 2019;5:9. doi:10.1038/s41537-019-0077-9
↵
1. Morgan SE,
2. Diederen K,
3. Vértes PE, et al
. Natural language processing markers in first episode psychosis and people at clinical high-risk. Transl Psychiatry 2021;11:630. doi:10.1038/s41398-021-01722-y
↵
1. Pelizza L,
2. Paterlini F,
3. Azzali S, et al
. The approved Italian version of the comprehensive assessment of at-risk mental states (CAARMS-ITA): field test and psychometric features. Early Interv Psychiatry 2019;13:810–7. doi:10.1111/eip.12669
OpenUrl
↵
1. First MB,
2. Williams JB,
3. Karg RS, et al
. Structured clinical interview for DSM-5—research version (SCID-5 for DSM-5, research version; SCID-5-RV) [Preprint at]. Arlington, VA: American Psychiatric Association, 2015: 1–94.
↵
1. First MB,
2. Williams JBW,
3. Benjamin LS, et al
. Structured clinical interview for DSM-5 personality disorders [Preprint at]. 2016.
↵
1. Goldman HH,
2. Skodol AE,
3. Lave TR
. Revising axis V for DSM-IV: a review of measures of social functioning. Am J Psychiatry 1992;149:1148–56. doi:10.1176/ajp.149.9.1148
OpenUrl CrossRef PubMed Web of Science
↵
1. Auther A,
2. Smith C,
3. Cornblatt B
. Global functioning: social scale (GF: social) [Preprint at]. 2006.
↵
1. Niendam TA,
2. Bearden CE,
3. Johnson JK, et al
. Global functioning: role scale (GF: role) [Preprint at]. 2006.
↵
1. Rush AJ,
2. Trivedi MH,
3. Ibrahim HM, et al
. The 16-item quick inventory of depressive symptomatology (QIDS), clinician rating (QIDS-C), and self-report (QIDS-SR): a psychometric evaluation in patients with chronic major depression. Biol Psychiatry 2003;54:573–83. doi:10.1016/s0006-3223(02)01866-8
OpenUrl CrossRef PubMed Web of Science
↵
1. Young RC,
2. Biggs JT,
3. Ziegler VE, et al
. A rating scale for mania: reliability, validity and sensitivity. Br J Psychiatry 1978;133:429–35. doi:10.1192/bjp.133.5.429
OpenUrl Abstract/FREE Full Text
↵
1. Lam RW,
2. Michalaak EE,
3. Swinson RP
. Assessment scales in depression, mania, and anxiety. Taylor & Francis, 2005. doi:10.4324/9780203308356
↵
1. Palma A,
2. Pancheri P
. Scale di valutazione e di misura dei sintomi psichiatrici. In: Masson Italia M, ed. Trattato Italiano di Psichiatria. 1999.
↵
1. Lovibond SH,
2. Lovibond PF
. Manual for the depression anxiety stress scales; 1995.
↵
1. Patrick J,
2. Dyck M,
3. Bramston P
. Depression anxiety stress scale: is it valid for children and adolescents? J Clin Psychol 2010;66:996–1007. doi:10.1002/jclp.20696
OpenUrl CrossRef PubMed
↵
1. Nassir Ghaemi S,
2. Miller CJ,
3. Berv DA, et al
. Sensitivity and specificity of a new bipolar spectrum diagnostic scale. J Affect Disord 2005;84:273–7. doi:10.1016/S0165-0327(03)00196-4
OpenUrl CrossRef PubMed
↵
1. Baldassano CF
. Assessment tools for screening and monitoring bipolar disorder. Bipolar Disord 2005;7 Suppl 1:8–15. doi:10.1111/j.1399-5618.2005.00189.x
OpenUrl
↵
1. Krueger RF,
2. Derringer J,
3. Markon KE, et al
. Initial construction of a maladaptive personality trait model and inventory for DSM-5. Psychol Med 2012;42:1879–90. doi:10.1017/S0033291711002674
OpenUrl CrossRef PubMed
↵
1. Anderson JL,
2. Sellbom M,
3. Salekin RT
. Utility of the personality inventory for DSM-5-brief form (PID-5-BF) in the measurement of maladaptive personality and psychopathology. Assessment 2018;25:596–607. doi:10.1177/1073191116676889
OpenUrl
↵
1. Fossati A,
2. Somma A,
3. Borroni S, et al
. A head-to-head comparison of the personality inventory for DSM-5 (PID-5) with the personality diagnostic questionnaire-4 (PDQ-4) in predicting the general level of personality pathology among community dwelling subjects. J Pers Disord 2016;30:82–94. doi:10.1521/pedi_2015_29_184
OpenUrl
↵
1. van der Gaag M,
2. Schütz C,
3. Ten Napel A, et al
. Development of the davos assessment of cognitive biases scale (DACOBS). Schizophr Res 2013;144:63–71. doi:10.1016/j.schres.2012.12.010
OpenUrl CrossRef PubMed
↵
1. Bastiaens T,
2. Claes L,
3. Smits D, et al
. The cognitive biases questionnaire for psychosis (CBQ-P) and the davos assessment of cognitive biases (DACOBS): validation in a flemish sample of psychotic patients and healthy controls. Schizophr Res 2013;147:310–4. doi:10.1016/j.schres.2013.04.037
OpenUrl
↵
1. Roenneberg T,
2. Wirz-Justice A,
3. Merrow M
. Life between clocks: daily temporal patterns of human chronotypes. J Biol Rhythms 2003;18:80–90. doi:10.1177/0748730402239679
OpenUrl CrossRef PubMed Web of Science
↵
1. Bastien CH,
2. Vallières A,
3. Morin CM
. Validation of the insomnia severity index as an outcome measure for insomnia research. Sleep Med 2001;2:297–307. doi:10.1016/s1389-9457(00)00065-4
OpenUrl CrossRef PubMed Web of Science
↵
1. Morin CM,
2. Belleville G,
3. Bélanger L, et al
. The insomnia severity index: psychometric indicators to detect insomnia cases and evaluate treatment response. Sleep 2011;34:601–8. doi:10.1093/sleep/34.5.601
OpenUrl CrossRef PubMed Web of Science
↵
1. Chung KF,
2. Kan KKK,
3. Yeung WF
. Assessing insomnia in adolescents: comparison of insomnia severity index, Athens insomnia scale and sleep quality index. Sleep Med 2011;12:463–70. doi:10.1016/j.sleep.2010.09.019
OpenUrl CrossRef PubMed Web of Science
↵
1. Castronovo V,
2. Galbiati A,
3. Marelli S, et al
. Validation study of the Italian version of the insomnia severity index (ISI). Neurol Sci 2016;37:1517–24. doi:10.1007/s10072-016-2620-z
OpenUrl CrossRef
↵
1. Nasreddine ZS,
2. Phillips NA,
3. Bédirian V, et al
. The Montreal cognitive assessment, MoCA: a brief screening tool for mild cognitive impairment. J Am Geriatr Soc 2005;53:695–9. doi:10.1111/j.1532-5415.2005.53221.x
OpenUrl CrossRef PubMed Web of Science
↵
1. Pirrotta F,
2. Timpano F,
3. Bonanno L, et al
. Italian validation of montreal cognitive assessment. Eur J Psychol Assess 2015;31:131–7. doi:10.1027/1015-5759/a000217
OpenUrl
↵
1. Ben-David S,
2. Birnbaum ML,
3. Eilenberg ME, et al
. The subjective experience of youths at clinically high risk of psychosis: a qualitative study. Psychiatr Serv 2014;65:1499–501. doi:10.1176/appi.ps.201300527
OpenUrl CrossRef PubMed
↵
1. Brunato D,
2. Cimino A,
3. Dell’Orletta F, et al
. Profiling-UD: a tool for linguistic profiling of texts. In: ACL Anthology. 2020: 7145–51.
↵
1. Straka M,
2. Hajič J,
3. Straková J
. UDPipe: trainable pipeline for processing CoNLL-U files performing tokenization, morphological analysis, POS tagging and parsing - ACL anthology. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16). 2016: 4290–7.
↵
1. Nivre J,
2. de Marneffe M-C,
3. Ginter F, et al
. Universal dependencies v1: a multilingual treebank collection; 2016.
↵
1. Lu X
. Automatic measurement of syntactic complexity in child language acquisition. IJCL 2009;14:3–28. doi:10.1075/ijcl.14.1.02lu
OpenUrl
↵
1. Lubetich S,
2. Sagae K
. Data-driven measurement of child language development with simple syntactic templates - ACL anthology. In: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 2014: 2151–60.
↵
1. Weiss Z,
2. Meurers D
. Analyzing linguistic complexity and accuracy in academic language development of German across elementary and secondary school. In: ACL 2019 - Innovative Use of NLP for Building Educational Applications, BEA 2019 - Proceedings of the 14th Workshop. 2019: 380–93. doi:10.18653/v1/W19-4440
↵
1. Miaschi A,
2. Brunato D,
3. Dell’Orletta F, et al
. What makes my model perplexed? A linguistic investigation on neural language models perplexity [Online]. Proceedings of Deep Learning Inside Out (DeeLIO): The 2nd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures 2021:40–7.
↵
1. Prud’hommeaux E. T,
2. Roark B,
3. Black L. M, et al
. Classification of atypical language in autism. Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics 2011:88–96.
↵
1. Vaswani A,
2. Shazeer N,
3. Parmer N, et al
. Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS’17). Red Hook, NY, USA: Curran Associates Inc, 2017: 6000–10. Available: https://papers.nips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html
↵
1. Devlin J,
2. Chang M-W,
3. Lee K, et al
. Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North 2019:4171–86. doi:10.18653/v1/N19-1423
↵
1. Costanza A,
2. Amerio A,
3. Aguglia A, et al
. “Hard to say, hard to understand, hard to live”: possible associations between neurologic language impairments and suicide risk. Brain Sci 2021;11:1594. doi:10.3390/brainsci11121594

Supplementary materials

Supplementary Data

This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Data supplement 1

Footnotes

Twitter @lisa_marzano
Collaborators The LNG-PSY Study Investigators: Alessandra Costanza (Department of Psychiatry, Faculty of Medicine, University of Geneva (UNIGE), Geneva, Switzerland). Francesca Sibilla, Pietro Calcagno (Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health (DINOGMI), Section of Psychiatry, University of Genoa, Genoa, Italy. IRCCS Ospedale Policlinico San Martino, Genoa, Italy). Sara Patti, Gabriella Molino (Department of Mental Health and Pathological Addictions, Genoa Local Health Authority, Genoa, Italy). Andrea Escelsior (Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health (DINOGMI), Section of Psychiatry, University of Genoa, Genoa, Italy. IRCCS Ospedale Policlinico San Martino, Genoa, Italy). Alice Trabucco (Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health (DINOGMI), Section of Psychiatry, University of Genoa, Genoa, Italy). Lisa Marzano (Department of Psychology, School of Science and Technology, Middlesex University, London, UK). Dominique Brunato, Andrea Amelio Ravelli (Italian Natural Language Processing Lab, Institute of Computational Linguistics 'Antonio Zampolli' (ILC-CNR), Pisa, Italy). Marco Cappucciati (Department of Mental Health and Pathological Addictions, Piacenza Local Health Authority, Piacenza, Italy. Early Psychosis: Interventions and Clinical-detection (EPIC) lab, Department of Psychosis Studies, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK). Roberta Fiocchi, Gisella Guerzoni, Davide Maravita, Fabio Macchetti, Elisa Mori, Chiara Anna Paglia, Federica Roscigno, Antonio Saginario (Department of Mental Health and Pathological Addictions, Piacenza Local Health Authority, Piacenza, Italy).
Contributors The authors have all contributed to the manuscript equally. LMag and AAm conceptualised and designed the study. LMag, EMon, IT, AT and DMart wrote the first draft of the protocol. OB, SC, MI and GL carried out a first methodological integration to optimise the experimental design. GM, SP, PC, FS, MC, RF, GG, DMara, FM, EMor, CAP, FR and AS further corrected the design to adapt it to different experimental settings, included in the multicentric context. LC conceptualised the planning of statistical analyses. FD, DB and AAR from Italian Natural Language Processing Lab provided a detailed project for linguistic data extraction and analysis. MA, GS, LMag, LMar, AC, AAg and AE carefully revised and approved the final version of the manuscript. Furthermore, a special thanks goes to the patient’s primary advisers for their contribution in reporting enrollable subjects.
Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.
Competing interests None declared.
Patient and public involvement Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

[1] ↵
Bleuler E
. Dementia praecox, oder Gruppe der Schizophrenien. 1911.

[2] Bleuler E

[3] ↵
Hauser MD
. The evolution of communication. Cambridge, MA: MIT Press, 1996: 760.

[4] Hauser MD

[5] ↵
Penn DC,
Holyoak KJ,
Povinelli DJ
. Darwin’s mistake: explaining the discontinuity between human and nonhuman minds. Behav Brain Sci 2008;31:109–30. doi:10.1017/S0140525X08003543
OpenUrl CrossRef PubMed

[6] Penn DC,

[7] Holyoak KJ,

[8] Povinelli DJ

[9] ↵
Tomasello M
. Origins of human communication. Cambridge, MA: MIT Press, 2010: 408.

[10] Tomasello M

[11] ↵
Vouloumanos A,
Waxman SR
. Listen up! speech is for thinking during infancy. Trends Cogn Sci 2014;18:642–6. doi:10.1016/j.tics.2014.10.001
OpenUrl CrossRef PubMed

[12] Vouloumanos A,

[13] Waxman SR

[14] ↵
Eigsti I-M,
de Marchena AB,
Schuh JM, et al
. Language acquisition in autism spectrum disorders: a developmental review. Res Autism Spectr Disord 2011;5:681–91. doi:10.1016/j.rasd.2010.09.001
OpenUrl

[15] Eigsti I-M,

[16] de Marchena AB,

[17] Schuh JM, et al

[18] ↵
Schaller S,
Sacks O
. A man without words. Berkeley: University of California Press, 1991: 204.

[19] Schaller S,

[20] Sacks O

[21] ↵
Humphries T,
Kushalnagar P,
Mathur G, et al
. Ensuring language acquisition for deaf children: what linguists can do. Language 2014;90:e31–52. doi:10.1353/lan.2014.0036
OpenUrl

[22] Humphries T,

[23] Kushalnagar P,

[24] Mathur G, et al

[25] ↵
Tattersall I
. The world from beginnings to 4000 BCE. Oxford: Oxford University Press, 2008.

[26] Tattersall I

[27] ↵
Tattersall I
. An evolutionary context for the emergence of language. Language Sciences 2014;46:199–206. doi:10.1016/j.langsci.2014.06.011
OpenUrl

[28] Tattersall I

[29] ↵
Heidegger M,
Krell DF
. Basic writings: from being and time (1927) to the task of thinking; 1964.

[30] Heidegger M,

[31] Krell DF

[32] ↵
Cassirer E
. An essay on man an introduction to a philosophy of human culture. Yale University Press, 1944.

[33] Cassirer E

[34] ↵
Rao RPN,
Ballard DH
. Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat Neurosci 1999;2:79–87. doi:10.1038/4580
OpenUrl CrossRef PubMed Web of Science

[35] Rao RPN,

[36] Ballard DH

[37] ↵
Helmholtz H
. Helmholtz’s treatise on physiological optics. Dover Publications, 1962.

[38] Helmholtz H

[39] ↵
Lupyan G,
Clark A
. Words and the world: predictive coding and the language-perception-cognition interface. Curr Dir Psychol Sci 2015;24:279–84.
OpenUrl CrossRef

[40] Lupyan G,

[41] Clark A

[42] ↵
Kessler RC,
Berglund P,
Demler O, et al
. Lifetime prevalence and age-of-onset distributions of DSM-IV disorders in the National comorbidity survey replication. Arch Gen Psychiatry 2005;62:593. doi:10.1001/archpsyc.62.6.593
OpenUrl CrossRef PubMed Web of Science

[43] Kessler RC,

[44] Berglund P,

[45] Demler O, et al

[46] ↵
Jones PB
. Adult mental health disorders and their age at onset. Br J Psychiatry 2013;202:s5–10. doi:10.1192/bjp.bp.112.119164
OpenUrl Abstract/FREE Full Text

[47] Jones PB

[48] ↵
Potash JB
. Carving chaos: genetics and the classification of mood and psychotic syndromes. Harv Rev Psychiatry 2006;14:47–63. doi:10.1080/10673220600655780
OpenUrl CrossRef PubMed Web of Science

[49] Potash JB

[50] ↵
Potash JB,
Bienvenu OJ
. Neuropsychiatric disorders: shared genetics of bipolar disorder and schizophrenia. Nat Rev Neurol 2009;5:299–300. doi:10.1038/nrneurol.2009.71
OpenUrl PubMed

[51] Potash JB,

[52] Bienvenu OJ

[53] ↵
Ivleva EI,
Morris DW,
Moates AF, et al
. Genetics and intermediate phenotypes of the schizophrenia--bipolar disorder boundary. Neurosci Biobehav Rev 2010;34:897–921. doi:10.1016/j.neubiorev.2009.11.022
OpenUrl CrossRef PubMed Web of Science

[54] Ivleva EI,

[55] Morris DW,

[56] Moates AF, et al

[57] ↵
McGorry PD,
Hickie IB,
Yung AR, et al
. Clinical staging of psychiatric disorders: a heuristic framework for choosing earlier, safer and more effective interventions. Aust N Z J Psychiatry 2006;40:616–22. doi:10.1080/j.1440-1614.2006.01860.x
OpenUrl CrossRef PubMed Web of Science

[58] McGorry PD,

[59] Hickie IB,

[60] Yung AR, et al

[61] ↵
McGorry P
. Transition to adulthood: the critical period for pre-emptive, disease-modifying care for schizophrenia and related disorders. Schizophr Bull 2011;37:524–30. doi:10.1093/schbul/sbr027
OpenUrl CrossRef PubMed Web of Science

[62] McGorry P

[63] ↵
Kessler RC,
Birnbaum H,
Demler O, et al
. The prevalence and correlates of nonaffective psychosis in the national comorbidity survey replication (NCS-R). Biol Psychiatry 2005;58:668–76. doi:10.1016/j.biopsych.2005.04.034
OpenUrl CrossRef PubMed Web of Science

[64] Kessler RC,

[65] Birnbaum H,

[66] Demler O, et al

[67] ↵
Kessler RC,
Ormel J,
Petukhova M, et al
. Development of lifetime comorbidity in the world health organization world mental health surveys. Arch Gen Psychiatry 2011;68:90–100. doi:10.1001/archgenpsychiatry.2010.180
OpenUrl CrossRef PubMed Web of Science

[68] Kessler RC,

[69] Ormel J,

[70] Petukhova M, et al

[71] ↵
Merikangas KR,
Herrell R,
Swendsen J, et al
. Specificity of bipolar spectrum conditions in the comorbidity of mood and substance use disorders. Arch Gen Psychiatry 2008;65:47. doi:10.1001/archgenpsychiatry.2007.18
OpenUrl CrossRef PubMed Web of Science

[72] Merikangas KR,

[73] Herrell R,

[74] Swendsen J, et al

[75] ↵
Merikangas KR,
He J-P,
Burstein M, et al
. Lifetime prevalence of mental disorders in U.S. adolescents: results from the National comorbidity survey replication -- adolescent supplement (NCS-A). J Am Acad Child Adolesc Psychiatry 2010;49:980–9. doi:10.1016/j.jaac.2010.05.017
OpenUrl CrossRef PubMed Web of Science

[76] Merikangas KR,

[77] He J-P,

[78] Burstein M, et al

[79] ↵
Merikangas KR,
Cui L,
Kattan G, et al
. Mania with and without depression in a community sample of US adolescents. Arch Gen Psychiatry 2012;69:943–51. doi:10.1001/archgenpsychiatry.2012.38
OpenUrl CrossRef PubMed

[80] Merikangas KR,

[81] Cui L,

[82] Kattan G, et al

[83] ↵
Murray GK,
Jones PB
. Psychotic symptoms in young people without psychotic illness: mechanisms and meaning. Br J Psychiatry 2012;201:4–6. doi:10.1192/bjp.bp.111.107789
OpenUrl Abstract/FREE Full Text

[84] Murray GK,

[85] Jones PB

[86] ↵
Ormel J,
Raven D,
van Oort F, et al
. Mental health in Dutch adolescents: a trails report on prevalence, severity, age of onset, continuity and co-morbidity of DSM disorders. Psychol Med 2015;45:345–60. doi:10.1017/S0033291714001469
OpenUrl CrossRef PubMed

[87] Ormel J,

[88] Raven D,

[89] van Oort F, et al

[90] ↵
Yung AR,
McGorry PD
. The prodromal phase of first-episode psychosis: past and current conceptualizations. Schizophr Bull 1996;22:353–70. doi:10.1093/schbul/22.2.353
OpenUrl CrossRef PubMed Web of Science

[91] Yung AR,

[92] McGorry PD

[93] ↵
Malla AK,
Norman RMG
. Prodromal symptoms in schizophrenia. Br J Psychiatry 1994;164:487–93. doi:10.1192/bjp.164.4.487
OpenUrl Abstract/FREE Full Text

[94] Malla AK,

[95] Norman RMG

[96] ↵
McGorry PD,
Mei C
. Ultra-high-risk paradigm: lessons learnt and new directions. Evid Based Ment Health 2018;21:131–3. doi:10.1136/ebmental-2018-300061
OpenUrl Abstract/FREE Full Text

[97] McGorry PD,

[98] Mei C

[99] ↵
Lin A,
Wood SJ,
Nelson B, et al
. Outcomes of nontransitioned cases in a sample at ultra-high risk for psychosis. Am J Psychiatry 2015;172:249–58. doi:10.1176/appi.ajp.2014.13030418
OpenUrl CrossRef PubMed

[100] Lin A,

[101] Wood SJ,

[102] Nelson B, et al

[103] ↵
Rutigliano G,
Valmaggia L,
Landi P, et al
. Persistence or recurrence of non-psychotic comorbid mental disorders associated with 6-year poor functional outcomes in patients at ultra high risk for psychosis. J Affect Disord 2016;203:101–10. doi:10.1016/j.jad.2016.05.053
OpenUrl

[104] Rutigliano G,

[105] Valmaggia L,

[106] Landi P, et al

[107] ↵
Beck K,
Andreou C,
Studerus E, et al
. Clinical and functional long-term outcome of patients at clinical high risk (CHR) for psychosis without transition to psychosis: a systematic review. Schizophr Res 2019;210:39–47. doi:10.1016/j.schres.2018.12.047
OpenUrl

[108] Beck K,

[109] Andreou C,

[110] Studerus E, et al

[111] ↵
Hartmann JA,
Nelson B,
Spooner R, et al
. Broad clinical high-risk mental state (charms): methodology of a cohort study validating criteria for pluripotent risk. Early Interv Psychiatry 2019;13:379–86. doi:10.1111/eip.12483
OpenUrl CrossRef

[112] Hartmann JA,

[113] Nelson B,

[114] Spooner R, et al

[115] ↵
Shah JL,
Scott J,
McGorry PD, et al
. Transdiagnostic clinical staging in youth mental health: a first international consensus statement. World Psychiatry 2020;19:233–42. doi:10.1002/wps.20745
OpenUrl CrossRef PubMed

[116] Shah JL,

[117] Scott J,

[118] McGorry PD, et al

[119] ↵
Yung AR,
Yuen HP,
McGorry PD, et al
. Mapping the onset of psychosis: the comprehensive assessment of at-risk mental states. Aust N Z J Psychiatry 2005;39:964–71. doi:10.1080/j.1440-1614.2005.01714.x
OpenUrl CrossRef PubMed Web of Science

[120] Yung AR,

[121] Yuen HP,

[122] McGorry PD, et al

[123] ↵
Hartmann JA,
McGorry PD,
Destree L, et al
. Pluripotential risk and clinical staging: theoretical considerations and preliminary data from a transdiagnostic risk identification approach. Front Psychiatry 2020;11:553578. doi:10.3389/fpsyt.2020.553578
OpenUrl CrossRef

[124] Hartmann JA,

[125] McGorry PD,

[126] Destree L, et al

[127] ↵
Harrow M,
Quinlan D
. Is disordered thinking unique to schizophrenia? Arch Gen Psychiatry 1977;34:15–21. doi:10.1001/archpsyc.1977.01770130017001
OpenUrl CrossRef PubMed Web of Science

[128] Harrow M,

[129] Quinlan D

[130] ↵
Andreasen NC,
Grove WM
. Thought, language, and communication in schizophrenia: diagnosis and prognosis. Schizophr Bull 1986;12:348–59. doi:10.1093/schbul/12.3.348
OpenUrl CrossRef PubMed Web of Science

[131] Andreasen NC,

[132] Grove WM

[133] ↵
Corcoran CM,
Mittal VA,
Bearden CE, et al
. Language as a biomarker for psychosis: a natural language processing approach. Schizophr Res 2020;226:158–66. doi:10.1016/j.schres.2020.04.032
OpenUrl CrossRef PubMed

[134] Corcoran CM,

[135] Mittal VA,

[136] Bearden CE, et al

[137] ↵
Voleti R,
Liss JM,
Berisha V
. A review of automated speech and language features for assessment of cognitive and thought disorders. IEEE J Sel Top Signal Process 2020;14:282–98. doi:10.1109/jstsp.2019.2952087
OpenUrl CrossRef PubMed

[138] Voleti R,

[139] Liss JM,

[140] Berisha V

[141] ↵
Covington MA,
McFall JD
. Cutting the gordian knot: the moving-average type–token ratio (MATTR). J Quant Linguist 2010;17:94–100. doi:10.1080/09296171003643098
OpenUrl CrossRef

[142] Covington MA,

[143] McFall JD

[144] ↵
Asgari M,
Kaye J,
Dodge H
. Predicting mild cognitive impairment from spontaneous spoken utterances. Alzheimers Dement (N Y) 2017;3:219–28. doi:10.1016/j.trci.2017.01.006
OpenUrl

[145] Asgari M,

[146] Kaye J,

[147] Dodge H

[148] ↵
Roark B,
Mitchell M,
Hollingshead K
. Syntactic complexity measures for detecting mild cognitive impairment - ACL anthology. BioNLP 2007:1–8. doi:10.3115/1572392.1572394

[149] Roark B,

[150] Mitchell M,

[151] Hollingshead K

[152] ↵
Bucks RS,
Singh S,
Cuerden JM, et al
. Analysis of spontaneous, conversational speech in dementia of Alzheimer type: evaluation of an objective technique for analysing lexical performance. Aphasiology 2000;14:71–91. doi:10.1080/026870300401603
OpenUrl CrossRef

[153] Bucks RS,

[154] Singh S,

[155] Cuerden JM, et al

[156] ↵
Fraser KC,
Meltzer JA,
Rudzicz F
. Linguistic features identify Alzheimer’s disease in narrative speech. J Alzheimers Dis 2016;49:407–22. doi:10.3233/JAD-150520
OpenUrl

[157] Fraser KC,

[158] Meltzer JA,

[159] Rudzicz F

[160] ↵
Sadeghian R,
Schaffer JD,
Zahorian SA
. Speech processing approach for diagnosing dementia in an early stage. INTERSPEECH 2017; ISCA, 2017:2705–9 doi:10.21437/Interspeech.2017-1712

[161] Sadeghian R,

[162] Schaffer JD,

[163] Zahorian SA

[164] ↵
Mota NB,
Vasconcelos NAP,
Lemos N, et al
. Speech graphs provide a quantitative measure of thought disorder in psychosis. PLoS One 2012;7:e34928. doi:10.1371/journal.pone.0034928

[165] Mota NB,

[166] Vasconcelos NAP,

[167] Lemos N, et al

[168] ↵
Landauer TK,
Foltz PW,
Laham D
. An introduction to latent semantic analysis. Discourse Processes 1998;25:259–84. doi:10.1080/01638539809545028
OpenUrl CrossRef

[169] Landauer TK,

[170] Foltz PW,

[171] Laham D

[172] ↵
Mikolov T,
Chen K,
Corrado G, et al
. Efficient estimation of word representations in vector space. ICLR, 2013.

[173] Mikolov T,

[174] Chen K,

[175] Corrado G, et al

[176] ↵
Pennington J,
Socher R,
Manning CD
. Glove: global vectors for word representation. EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference; 2014:1532–43 doi:10.3115/v1/D14-1162

[177] Pennington J,

[178] Socher R,

[179] Manning CD

[180] ↵
Elvevåg B,
Foltz PW,
Weinberger DR, et al
. Quantifying incoherence in speech: an automated methodology and novel application to schizophrenia. Schizophr Res 2007;93:304–16. doi:10.1016/j.schres.2007.03.001
OpenUrl CrossRef PubMed Web of Science

[181] Elvevåg B,

[182] Foltz PW,

[183] Weinberger DR, et al

[184] ↵
Covington MA,
He C,
Brown C, et al
. Schizophrenia and the structure of language: the linguist’s view. Schizophr Res 2005;77:85–98. doi:10.1016/j.schres.2005.01.016
OpenUrl CrossRef PubMed Web of Science

[185] Covington MA,

[186] He C,

[187] Brown C, et al

[188] ↵
Bedi G,
Carrillo F,
Cecchi GA, et al
. Automated analysis of free speech predicts psychosis onset in high-risk youths. NPJ Schizophr 2015;1:15030. doi:10.1038/npjschz.2015.30
OpenUrl PubMed

[189] Bedi G,

[190] Carrillo F,

[191] Cecchi GA, et al

[192] ↵
Corcoran CM,
Carrillo F,
Fernández-Slezak D, et al
. Prediction of psychosis across protocols and risk cohorts using automated language analysis. World Psychiatry 2018;17:67–75. doi:10.1002/wps.20491
OpenUrl CrossRef

[193] Corcoran CM,

[194] Carrillo F,

[195] Fernández-Slezak D, et al

[196] ↵
Rezaii N,
Walker E,
Wolff P
. A machine learning approach to predicting psychosis using semantic density and latent content analysis. NPJ Schizophr 2019;5:9. doi:10.1038/s41537-019-0077-9

[197] Rezaii N,

[198] Walker E,

[199] Wolff P

[200] ↵
Morgan SE,
Diederen K,
Vértes PE, et al
. Natural language processing markers in first episode psychosis and people at clinical high-risk. Transl Psychiatry 2021;11:630. doi:10.1038/s41398-021-01722-y

[201] Morgan SE,

[202] Diederen K,

[203] Vértes PE, et al

[204] ↵
Pelizza L,
Paterlini F,
Azzali S, et al
. The approved Italian version of the comprehensive assessment of at-risk mental states (CAARMS-ITA): field test and psychometric features. Early Interv Psychiatry 2019;13:810–7. doi:10.1111/eip.12669
OpenUrl

[205] Pelizza L,

[206] Paterlini F,

[207] Azzali S, et al

[208] ↵
First MB,
Williams JB,
Karg RS, et al
. Structured clinical interview for DSM-5—research version (SCID-5 for DSM-5, research version; SCID-5-RV) [Preprint at]. Arlington, VA: American Psychiatric Association, 2015: 1–94.

[209] First MB,

[210] Williams JB,

[211] Karg RS, et al

[212] ↵
First MB,
Williams JBW,
Benjamin LS, et al
. Structured clinical interview for DSM-5 personality disorders [Preprint at]. 2016.

[213] First MB,

[214] Williams JBW,

[215] Benjamin LS, et al

[216] ↵
Goldman HH,
Skodol AE,
Lave TR
. Revising axis V for DSM-IV: a review of measures of social functioning. Am J Psychiatry 1992;149:1148–56. doi:10.1176/ajp.149.9.1148
OpenUrl CrossRef PubMed Web of Science

[217] Goldman HH,

[218] Skodol AE,

[219] Lave TR

[220] ↵
Auther A,
Smith C,
Cornblatt B
. Global functioning: social scale (GF: social) [Preprint at]. 2006.

[221] Auther A,

[222] Smith C,

[223] Cornblatt B

[224] ↵
Niendam TA,
Bearden CE,
Johnson JK, et al
. Global functioning: role scale (GF: role) [Preprint at]. 2006.

[225] Niendam TA,

[226] Bearden CE,

[227] Johnson JK, et al

[228] ↵
Rush AJ,
Trivedi MH,
Ibrahim HM, et al
. The 16-item quick inventory of depressive symptomatology (QIDS), clinician rating (QIDS-C), and self-report (QIDS-SR): a psychometric evaluation in patients with chronic major depression. Biol Psychiatry 2003;54:573–83. doi:10.1016/s0006-3223(02)01866-8
OpenUrl CrossRef PubMed Web of Science

[229] Rush AJ,

[230] Trivedi MH,

[231] Ibrahim HM, et al

[232] ↵
Young RC,
Biggs JT,
Ziegler VE, et al
. A rating scale for mania: reliability, validity and sensitivity. Br J Psychiatry 1978;133:429–35. doi:10.1192/bjp.133.5.429
OpenUrl Abstract/FREE Full Text

[233] Young RC,

[234] Biggs JT,

[235] Ziegler VE, et al

[236] ↵
Lam RW,
Michalaak EE,
Swinson RP
. Assessment scales in depression, mania, and anxiety. Taylor & Francis, 2005. doi:10.4324/9780203308356

[237] Lam RW,

[238] Michalaak EE,

[239] Swinson RP

[240] ↵
Palma A,
Pancheri P
. Scale di valutazione e di misura dei sintomi psichiatrici. In: Masson Italia M, ed. Trattato Italiano di Psichiatria. 1999.

[241] Palma A,

[242] Pancheri P

[243] ↵
Lovibond SH,
Lovibond PF
. Manual for the depression anxiety stress scales; 1995.

[244] Lovibond SH,

[245] Lovibond PF

[246] ↵
Patrick J,
Dyck M,
Bramston P
. Depression anxiety stress scale: is it valid for children and adolescents? J Clin Psychol 2010;66:996–1007. doi:10.1002/jclp.20696
OpenUrl CrossRef PubMed

[247] Patrick J,

[248] Dyck M,

[249] Bramston P

[250] ↵
Nassir Ghaemi S,
Miller CJ,
Berv DA, et al
. Sensitivity and specificity of a new bipolar spectrum diagnostic scale. J Affect Disord 2005;84:273–7. doi:10.1016/S0165-0327(03)00196-4
OpenUrl CrossRef PubMed

[251] Nassir Ghaemi S,

[252] Miller CJ,

[253] Berv DA, et al

[254] ↵
Baldassano CF
. Assessment tools for screening and monitoring bipolar disorder. Bipolar Disord 2005;7 Suppl 1:8–15. doi:10.1111/j.1399-5618.2005.00189.x
OpenUrl

[255] Baldassano CF

[256] ↵
Krueger RF,
Derringer J,
Markon KE, et al
. Initial construction of a maladaptive personality trait model and inventory for DSM-5. Psychol Med 2012;42:1879–90. doi:10.1017/S0033291711002674
OpenUrl CrossRef PubMed

[257] Krueger RF,

[258] Derringer J,

[259] Markon KE, et al

[260] ↵
Anderson JL,
Sellbom M,
Salekin RT
. Utility of the personality inventory for DSM-5-brief form (PID-5-BF) in the measurement of maladaptive personality and psychopathology. Assessment 2018;25:596–607. doi:10.1177/1073191116676889
OpenUrl

[261] Anderson JL,

[262] Sellbom M,

[263] Salekin RT

[264] ↵
Fossati A,
Somma A,
Borroni S, et al
. A head-to-head comparison of the personality inventory for DSM-5 (PID-5) with the personality diagnostic questionnaire-4 (PDQ-4) in predicting the general level of personality pathology among community dwelling subjects. J Pers Disord 2016;30:82–94. doi:10.1521/pedi_2015_29_184
OpenUrl

[265] Fossati A,

[266] Somma A,

[267] Borroni S, et al

[268] ↵
van der Gaag M,
Schütz C,
Ten Napel A, et al
. Development of the davos assessment of cognitive biases scale (DACOBS). Schizophr Res 2013;144:63–71. doi:10.1016/j.schres.2012.12.010
OpenUrl CrossRef PubMed

[269] van der Gaag M,

[270] Schütz C,

[271] Ten Napel A, et al

[272] ↵
Bastiaens T,
Claes L,
Smits D, et al
. The cognitive biases questionnaire for psychosis (CBQ-P) and the davos assessment of cognitive biases (DACOBS): validation in a flemish sample of psychotic patients and healthy controls. Schizophr Res 2013;147:310–4. doi:10.1016/j.schres.2013.04.037
OpenUrl

[273] Bastiaens T,

[274] Claes L,

[275] Smits D, et al

[276] ↵
Roenneberg T,
Wirz-Justice A,
Merrow M
. Life between clocks: daily temporal patterns of human chronotypes. J Biol Rhythms 2003;18:80–90. doi:10.1177/0748730402239679
OpenUrl CrossRef PubMed Web of Science

[277] Roenneberg T,

[278] Wirz-Justice A,

[279] Merrow M

[280] ↵
Bastien CH,
Vallières A,
Morin CM
. Validation of the insomnia severity index as an outcome measure for insomnia research. Sleep Med 2001;2:297–307. doi:10.1016/s1389-9457(00)00065-4
OpenUrl CrossRef PubMed Web of Science

[281] Bastien CH,

[282] Vallières A,

[283] Morin CM

[284] ↵
Morin CM,
Belleville G,
Bélanger L, et al
. The insomnia severity index: psychometric indicators to detect insomnia cases and evaluate treatment response. Sleep 2011;34:601–8. doi:10.1093/sleep/34.5.601
OpenUrl CrossRef PubMed Web of Science

[285] Morin CM,

[286] Belleville G,

[287] Bélanger L, et al

[288] ↵
Chung KF,
Kan KKK,
Yeung WF
. Assessing insomnia in adolescents: comparison of insomnia severity index, Athens insomnia scale and sleep quality index. Sleep Med 2011;12:463–70. doi:10.1016/j.sleep.2010.09.019
OpenUrl CrossRef PubMed Web of Science

[289] Chung KF,

[290] Kan KKK,

[291] Yeung WF

[292] ↵
Castronovo V,
Galbiati A,
Marelli S, et al
. Validation study of the Italian version of the insomnia severity index (ISI). Neurol Sci 2016;37:1517–24. doi:10.1007/s10072-016-2620-z
OpenUrl CrossRef

[293] Castronovo V,

[294] Galbiati A,

[295] Marelli S, et al

[296] ↵
Nasreddine ZS,
Phillips NA,
Bédirian V, et al
. The Montreal cognitive assessment, MoCA: a brief screening tool for mild cognitive impairment. J Am Geriatr Soc 2005;53:695–9. doi:10.1111/j.1532-5415.2005.53221.x
OpenUrl CrossRef PubMed Web of Science

[297] Nasreddine ZS,

[298] Phillips NA,

[299] Bédirian V, et al

[300] ↵
Pirrotta F,
Timpano F,
Bonanno L, et al
. Italian validation of montreal cognitive assessment. Eur J Psychol Assess 2015;31:131–7. doi:10.1027/1015-5759/a000217
OpenUrl

[301] Pirrotta F,

[302] Timpano F,

[303] Bonanno L, et al

[304] ↵
Ben-David S,
Birnbaum ML,
Eilenberg ME, et al
. The subjective experience of youths at clinically high risk of psychosis: a qualitative study. Psychiatr Serv 2014;65:1499–501. doi:10.1176/appi.ps.201300527
OpenUrl CrossRef PubMed

[305] Ben-David S,

[306] Birnbaum ML,

[307] Eilenberg ME, et al

[308] ↵
Brunato D,
Cimino A,
Dell’Orletta F, et al
. Profiling-UD: a tool for linguistic profiling of texts. In: ACL Anthology. 2020: 7145–51.

[309] Brunato D,

[310] Cimino A,

[311] Dell’Orletta F, et al

[312] ↵
Straka M,
Hajič J,
Straková J
. UDPipe: trainable pipeline for processing CoNLL-U files performing tokenization, morphological analysis, POS tagging and parsing - ACL anthology. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16). 2016: 4290–7.

[313] Straka M,

[314] Hajič J,

[315] Straková J

[316] ↵
Nivre J,
de Marneffe M-C,
Ginter F, et al
. Universal dependencies v1: a multilingual treebank collection; 2016.

[317] Nivre J,

[318] de Marneffe M-C,

[319] Ginter F, et al

[320] ↵
Lu X
. Automatic measurement of syntactic complexity in child language acquisition. IJCL 2009;14:3–28. doi:10.1075/ijcl.14.1.02lu
OpenUrl

[321] Lu X

[322] ↵
Lubetich S,
Sagae K
. Data-driven measurement of child language development with simple syntactic templates - ACL anthology. In: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 2014: 2151–60.

[323] Lubetich S,

[324] Sagae K

[325] ↵
Weiss Z,
Meurers D
. Analyzing linguistic complexity and accuracy in academic language development of German across elementary and secondary school. In: ACL 2019 - Innovative Use of NLP for Building Educational Applications, BEA 2019 - Proceedings of the 14th Workshop. 2019: 380–93. doi:10.18653/v1/W19-4440

[326] Weiss Z,

[327] Meurers D

[328] ↵
Miaschi A,
Brunato D,
Dell’Orletta F, et al
. What makes my model perplexed? A linguistic investigation on neural language models perplexity [Online]. Proceedings of Deep Learning Inside Out (DeeLIO): The 2nd Workshop on Knowledge Extraction and Integration for Deep Learning Architectures 2021:40–7.

[329] Miaschi A,

[330] Brunato D,

[331] Dell’Orletta F, et al

[332] ↵
Prud’hommeaux E. T,
Roark B,
Black L. M, et al
. Classification of atypical language in autism. Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics 2011:88–96.

[333] Prud’hommeaux E. T,

[334] Roark B,

[335] Black L. M, et al

[336] ↵
Vaswani A,
Shazeer N,
Parmer N, et al
. Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS’17). Red Hook, NY, USA: Curran Associates Inc, 2017: 6000–10. Available: https://papers.nips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html

[337] Vaswani A,

[338] Shazeer N,

[339] Parmer N, et al

[340] ↵
Devlin J,
Chang M-W,
Lee K, et al
. Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North 2019:4171–86. doi:10.18653/v1/N19-1423

[341] Devlin J,

[342] Chang M-W,

[343] Lee K, et al

[344] ↵
Costanza A,
Amerio A,
Aguglia A, et al
. “Hard to say, hard to understand, hard to live”: possible associations between neurologic language impairments and suicide risk. Brain Sci 2021;11:1594. doi:10.3390/brainsci11121594

[345] Costanza A,

[346] Amerio A,

[347] Aguglia A, et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Strengths and limitations of this study

Introduction

Language, thought and human beings

Language and phenomenal experience

Psychopathology and language: floating on fluid psychopathological substrates

NLP techniques and their application in neuropsychiatric conditions

Lexical level

Morpho-syntactic level

Syntactic level

Semantic analysis

Aims and objectives

Primary objective

Secondary objectives

Methods and analysis

Participants and setting

Procedure and data acquisition

Baseline interview A—psychometric measures

Baseline interview B—speech recording

Time series

Linguistic data processing and elaboration

Duration of the study

Patient and public involvement

Statistical analyses

Estimated sample size and statistical power

Statistical analyses post data acquisition

Supplemental material

Ethics and dissemination

Ethics statements

Patient consent for publication

Acknowledgments

References

Supplementary materials

Supplementary Data

Footnotes

Read the full text or download the PDF:

Log in using your username and password