<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "https://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="eng" dtd-version="1.1">
	<front>
		<journal-meta>
			<journal-id journal-id-type="eissn">2211-7253</journal-id>
			<journal-id journal-id-type="publisher-id">DuJAL</journal-id>
			<journal-title-group>
				<journal-title>Dutch Journal of Applied Linguistics</journal-title>
			</journal-title-group>
			<issn publication-format="print">2211-7245</issn>
			<issn publication-format="online">2211-7253</issn>
			<publisher>
				<publisher-name>Openjournals.nl</publisher-name>
			</publisher>
		</journal-meta>
		<article-meta>
			<article-id pub-id-type="doi">10.51751/DuJAL15756</article-id>
			<title-group>
				<article-title>Integration, demands and accommodations: An exploration of summative assessment practices in Dutch bilingual secondary education</article-title>
				<alt-title alt-title-type="running-head">Integration, demands and accommodations</alt-title>
			</title-group>
			<contrib-group>
				<contrib contrib-type="author" corresp="yes">
					<name name-style="western">
						<surname>Mearns</surname>
						<given-names>Tessa</given-names>
					</name>
					<email xlink:href="mailto:t.l.mearns@iclon.leidenuniv.nl">t.l.mearns@iclon.leidenuniv.nl</email>
					<xref ref-type="aff" rid="AFF000001"/>
					<role>conceptualisation, methodology, investigation, validation, writing&#x00A0;&#x2013; original draft, writing&#x00A0;&#x2013; reviewing and editing</role>
				</contrib>
				<contrib contrib-type="author">
					<name name-style="western">
						<surname>van Beuningen</surname>
						<given-names>Catherine</given-names>
					</name>
					<xref ref-type="aff" rid="AFF000002"/>
					<xref ref-type="aff" rid="AFF000003"/>
					<role>conceptualisation, methodology, investigation, formal analysis, writing&#x00A0;&#x2013; original draft, writing&#x00A0;&#x2013; reviewing and editing</role>
				</contrib>
				<contrib contrib-type="author">
					<name name-style="western">
						<surname>Nederlof</surname>
						<given-names>Niels</given-names>
					</name>
					<xref ref-type="aff" rid="AFF000004"/>
					<role>methodology, investigation, data curation</role>
				</contrib>
				<contrib contrib-type="author">
					<name name-style="western">
						<surname>de Jong</surname>
						<given-names>Nivja H.</given-names>
					</name>
					<xref ref-type="aff" rid="AFF000001"/>
					<xref ref-type="aff" rid="AFF000005"/>
					<role>conceptualisation, methodology, investigation, formal analysis, writing&#x00A0;&#x2013; reviewing and editing</role>
				</contrib>
				<aff id="AFF000001">
					<label>1</label>
					<institution-wrap>
						<institution>ICLON Leiden University Graduate School of Teaching</institution>
					</institution-wrap>
				</aff>
				<aff id="AFF000002">
					<label>2</label>
					<institution-wrap>
						<institution>Amsterdam University of Applied Sciences</institution>
					</institution-wrap>
				</aff>
				<aff id="AFF000003">
					<label>3</label>
					<institution-wrap>
						<institution>University of Amsterdam</institution>
					</institution-wrap>
				</aff>
				<aff id="AFF000004">
					<label>4</label>
					<institution-wrap>
						<institution>Dialogic</institution>
					</institution-wrap>
				</aff>
				<aff id="AFF000005">
					<label>5</label>
					<institution-wrap>
						<institution>Leiden University Centre for Linguistics</institution>
					</institution-wrap>
				</aff>
			</contrib-group>
			<pub-date publication-format="online">
				<day>21</day>
				<month>05</month>
				<year>2025</year>
			</pub-date>
			<volume content-type="number">14</volume>
			<volume content-type="year">2025</volume>
			<fpage specific-use="PDF">1</fpage>
			<lpage>29</lpage>
			<history>
				<date date-type="received">
					<day>06</day>
					<month>07</month>
					<year>2023</year>
				</date>
				<date date-type="accepted">
					<day>06</day>
					<month>11</month>
					<year>2024</year>
				</date>
				<date date-type="pub">
					<day>21</day>
					<month>05</month>
					<year>2025</year>
				</date>
			</history>
			<permissions>
				<copyright-statement>Copyright 2025 by the author(s).</copyright-statement>
				<copyright-year>2025</copyright-year>
				<copyright-holder>the author(s)</copyright-holder>
				<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:title="CC BY">
					<license-p>This is an open access article distributed under the terms of the CC BY 4.0 license.</license-p>
				</license>
			</permissions>
			<self-uri content-type="PDF" xlink:href="DuJAL15756_text.pdf"/>
			<abstract>
				<title>Abstract</title>
				<p>In content and language integrated learning (CLIL), the goal is to support learning of both subject content and (subject-related) language, through an integrated approach. Based on the principle of constructive alignment (Biggs, 1996), it is logical to assume that such integration carries through to assessment. This study explores the extent to which this is the case in the context of bilingual secondary education in the Netherlands. Drawing on quantitative, self-report data from a teacher questionnaire, and qualitative analysis of assessment materials, we explore the relative roles of language and content, the use of accommodations and translanguaging as support, and the cognitive and linguistic demands posed by assessments. In line with previous research, it appears that assessment practices do not always align with the integrated goals of CLIL. Implications for practice are discussed, as is the need for more attention for this aspect of CLIL in research and teacher education.</p>
			</abstract>
			<kwd-group kwd-group-type="uncontrolled">
				<title>Keywords</title>
				<kwd>content and language integrated learning (CLIL)</kwd>
				<kwd>assessment</kwd>
				<kwd>summative</kwd>
				<kwd>teaching practice</kwd>
				<kwd>bilingual education</kwd>
				<kwd>accommodations</kwd>
				<kwd>translanguaging</kwd>
				<kwd>cognitive demands</kwd>
				<kwd>linguistic demands</kwd>
				<kwd>integration</kwd>
			</kwd-group>
			<funding-group>
				<funding-statement>Funding for this research was provided by the Meesterschapsteam Moderne Vreemde Talen, part of the programme Meesterschap in de Vakdidactiek from Vakdidactiek Geesteswetenschappen.</funding-statement>
			</funding-group>
			<custom-meta-group>
				<custom-meta>
					<meta-name>Statement of interest</meta-name>
					<meta-value>The authors have no known competing financial interests or personal relationships that could have influenced the work reported in this manuscript.</meta-value>
				</custom-meta>
				<custom-meta>
					<meta-name>Statement of technology use</meta-name>
					<meta-value>No AI-based generative technology was used in the preparation of this manuscript and the execution of the research that the manuscript reports upon.</meta-value>
				</custom-meta>
				<custom-meta>
					<meta-name>Supporting information</meta-name>
					<meta-value>In the supplementary file: Appendix S1: Teacher questionnaire on self-reported assessment practices; Appendix S2: Background questionnaire on assessment bundles; Appendix S3: Coding Scheme for Assessment Bundles</meta-value>
				</custom-meta>
			</custom-meta-group>
		</article-meta>
	</front>
	<body>
		<sec sec-type="heading-1">
			<label>1</label>
			<title>Introduction</title>
			<p>Teachers in content and language integrated learning (CLIL) contexts are faced with a dual teaching task (Coyle et al., 2010; Dalton-Puffer et al., 2014). They have to guide learners in their content learning, and in their development of second language (L2) proficiency, in particular in subject-specific discourses (Coyle &amp; Meyer, 2021). Following the principle of constructive alignment (Biggs, 1996), it seems logical to assume that this dual, integrated teaching and learning goal will carry through to the way in which learning is assessed (Sato, 2023). In other words, if language and content are integrated in CLIL instruction, one could expect them to also be integrated in CLIL assessment.</p>
			<p>As the limited research on summative assessment in CLIL (e.g. Otto, 2017) has shown, however, this assumption may not be so self-evident. Indeed, there appears to be a lack of consensus regarding the extent to which language and content in CLIL should be assessed separately or in combination. While practical publications aimed at CLIL teachers generally agree that the question of content and language integration adds extra complexity to assessment, recommendations as to how to respond to this complexity are not always clear-cut or consistent. As Lo and Fung (2020) rightfully argue, without a clear framework guiding CLIL teachers in the design of assessments, we run the risk of assessing learners in unfair and invalid ways.</p>
			<p>In the Netherlands, which has one of the most well-established and clearly regulated paradigms of bilingual education in Europe (Mearns &amp; de Graaff, 2018), the picture is equally murky. While Dutch secondary education in general is heavily focused on summative assessment (Sluijsmans et al., 2013), the national Quality Standard for Bilingual Education (Nuffic, 2019) makes no mention of how this aspect of CLIL should or could be addressed, and there has been no research published on assessment practices in this setting (Mearns et al., 2023). Therefore, we have a limited picture of how CLIL teachers in the Netherlands approach assessment of language and content. The current, small-scale study therefore aims to map how these teachers shape their assessment practices, in absence of clear guidelines, and as such to spark discussion regarding the ways in which learning is assessed in this and other multilingual education contexts. Moreover, as language plays a crucial role in any kind of learning, irrespective of the content being learned or the language in which the learning takes place in, we hope to open a similar discussion regarding the role of language in assessment in mainstream education.</p>
			<sec sec-type="heading-2">
				<label>1.1</label>
				<title>Separate vs. integrated assessment</title>
				<p>As CLIL encompasses the teaching of both content and language, it follows that valid CLIL assessment will also be dual-focused (Llinares et al., 2012; Mehisto &amp; Ting, 2017; Otto, 2017; Lo &amp; Fung, 2020). What is less clear, however, is whether this means that the content and language taught and learned through CLIL should&#x00A0;&#x2013; and can&#x00A0;&#x2013; be assessed together or separately (Otto, 2019).</p>
				<p>Some practical publications (e.g. Bentley, 2010; Coyle et al., 2010; Dale &amp; Tanner, 2012) aim to help CLIL teachers identify the focus of assessment and to decide whether the emphasis should be on content, language or other related skills. They suggest that distinctions might also be made between formative and summative assessment, whereby content assessment alone is linked to grading, but feedback is provided on language. Other authors have questioned whether the separation of language and content in assessment is desirable considering the integrated goals of CLIL (e.g., Llinares et al., 2012; Mehisto &amp; Ting, 2017; Morton, 2020). They suggest that language and content should be assessed together, as language is an integral aspect of subject content.</p>
				<p>Calls for integrated assessment reflect recent thinking on CLIL from a disciplinary literacies perspective, which highlights that the content, language and cognitive processes involved in a subject are inextricably linked (e.g., Dalton-Puffer et al., 2018; Coyle &amp; Meyer, 2021), or, as Morton (2020) formulates it: &#x201C;Learning a school subject [&#x2026;] means being able to comprehend and produce the types of texts or genres (both oral and written) through which knowledge in the subject is communicated&#x201D; (p.&#x00A0;9). In an integrated model of CLIL assessment (Llinares et al., 2012), this inextricable relationship between content and language is acknowledged at all stages of the learning process. The tasks and subject-specific language used in assessment will reflect those encountered in the classroom, so that learners know what is expected of them and how they will be evaluated.</p>
				<p>Sato (2023) addresses the variety of perspectives on assessment in CLIL by focusing on the purposes of assessment. He discerns three types of CLIL assessment: &#x2018;separate&#x2019;, &#x2018;weakly integrated&#x2019; and &#x2018;strongly integrated&#x2019;. Sato emphasizes that, while separation of language and content (e.g. through standardized language tests and content-focused testing in the first language) can be useful for research purposes, it does not align with the integrated goals of CLIL and is therefore a less logical choice for assessment in teaching practice. Weakly integrated assessments mitigate potential interference from lower L2 proficiency by managing linguistic demands, for example through contingent scaffolding during oral interaction or reducing the amount of active language production required in written tasks. This, Sato suggests, can be useful for diagnostic or formative assessment of content learning. For summative assessment that is aligned with CLIL&#x2019;s integrated goals, however, Sato recommends a strongly integrated approach in which language is regarded as an aspect of the subject content. He argues that, if teaching introduces and supports the learning of subject-specific language as integral to subject learning, it follows that assessment should also be integrated.</p>
			</sec>
			<sec sec-type="heading-2">
				<label>1.2</label>
				<title>Balancing cognitive and linguistic demands</title>
				<p>While Lo and Fung (2020) agree that integrated assessment approaches are the best fit with the dual goal of CLIL, they also point to the fact that &#x201C;CLIL assessments should avoid underestimating students&#x2019; content knowledge due to their inability to express their understanding with appropriate language&#x201D; (p.&#x00A0;1194). Whereas Sato (2023) proposed to mitigate this interference between language proficiency and content-related performance in diagnostic and formative assessment only (i.e. &#x2018;weak integration&#x2019;), others suggest that support or &#x2018;accommodations&#x2019; can also help increase the validity of summative assessment (Coyle et al., 2010; Lo &amp; Lin, 2014; Otto, 2017) by &#x201C;[helping] students access the content in English and better demonstrate what they know&#x201D; (Butler &amp; Stevens, 1997, p.&#x00A0;5). As defined by researchers in non-CLIL multilingual education settings, accommodations can be embedded in the assessment format (e.g. visual aids, text modifications, glossaries, translations) or made available as part of the assessment procedure (e.g. providing explanations on request; access to support materials such as dictionaries or online materials) (De Backer et al., 2019a; Yang, 2020). Accommodations have been identified as both helping learners and strengthening the validity of assessments (De Backer, 2020; Lo &amp; Lin, 2014), although which types of accommodations are most effective appears to vary between learners and situations. De Backer et al. (2019a) therefore recommend the use of a range of accommodation strategies. Examples of accommodations that have been identified as both helpful to learners and beneficial in strengthening the validity of assessments are adaptations of texts to make them more accessible; providing aural input alongside written texts; and providing access to support materials such as glossaries or dictionaries (De Backer et al., 2019a; Lo &amp; Lin, 2014).</p>
				<p>A specific accommodation that can be used to balance the linguistic and cognitive assessment demands, is &#x2018;translanguaging&#x2019;: providing room for learners to use multiple languages in assessment and/or response formats (De Backer et al., 2019a; De Backer et al., 2019b). Translanguaging can support the learning of both content and language (Cenoz, 2017) and has been found to be an effective tool in minimizing interference of language in the assessment of subject content (Serra, 2007). In this sense, L1 support could be a helpful tool in what Sato (2023) terms &#x2018;weakly integrated&#x2019; CLIL assessment where the main focus is on content outcomes. It is important to note, however, that learners do not always perceive choice of language in assessments as helpful, as identified in both CLIL and other L2 content classrooms (De Backer, 2020; H&#x00F6;nig, 2010). As De Backer et al. (2019a) observed, individual factors such as L2 proficiency and personal preference can influence the extent to which learners perceive translanguaging in assessments as useful.</p>
			</sec>
			<sec sec-type="heading-2">
				<label>1.3</label>
				<title>Previous research on CLIL assessment practices</title>
				<p>Research on assessment practices in CLIL contexts is scarce. From the few studies that are available, it seems that CLIL teachers have a preference for separating language and content in assessments, but that they do not always achieve this in practice. H&#x00F6;nig (2010), for example, observed a contrast between teachers&#x2019; conviction that they focused only on content in CLIL assessment, and their actual assessments, which appeared to be influenced by linguistic aspects such as fluency and length of responses. Similarly, Otto and Estrada (2019) observed that content teachers found it difficult to ignore language in assessment, but at the same time did not feel equipped to assess it. These studies highlight how language can be an &#x201C;invisible&#x201D; criterion in CLIL assessment (Llinares et al., 2012, p.&#x00A0;284). Furthermore, the preference among CLIL teachers to take the same approaches to assessment with CLIL groups as they do with non-CLIL groups (Otto, 2017; Reierstam &amp; Sylv&#x00E9;n, 2019), namely with little attention for language, also seems to suggest that CLIL teachers may be insufficiently aware of the role language plays in assessment. According to Lo et al. (2019), an invisible role for language in CLIL assessment is not only a threat to fairness and validity, but can also affect the extent to which these aspects are integrated in CLIL teaching. They noted that a lack of explicit attention for language in assessment can have a washback effect, leading teachers to neglect the linguistic aspects of their subject in classroom teaching.</p>
				<p>Multiple studies from CLIL contexts have shown that lack of L2 proficiency can hinder CLIL learners in the expression and demonstration of their content knowledge and skills, especially in the earlier stages of L2 development. Oattes et al. (2020) observed that learners in their first year of CLIL performed better in a multiple-choice history assessment conducted in Dutch (the main school language) than in a comparable assessment conducted in English (the CLIL language). This effect was not observed among CLIL learners in their third year, suggesting that learners with higher L2 proficiency coped better with the assessment&#x2019;s demands. Likewise, Lo and Fung (2020) found that students&#x2019; performance in CLIL assessments declined with increasing linguistic demands. The latter study furthermore showed that, in CLIL assessments for science and biology in upper secondary education, both the cognitive and linguistic demands were generally high. In light of their findings pertaining to students&#x2019; performance, the authors warn that this situation is problematic as the high linguistic demands may prevent students with lower L2 proficiency from demonstrating their understanding of subject content. This also reflects evidence from non-CLIL contexts where limited mastery of the instructional language was found to negatively influence students&#x2019; performance on content exams (Trenkic &amp; Warmington, 2019).</p>
				<p>Little CLIL research has explored the potential of accommodations to mitigate interference between language and content in assessment. Regarding the presence of accommodations in CLIL assessment, while around two-thirds of the assessment materials analysed by Otto (2017) contained visual or pictorial support to limit language demands, only a quarter used graphic organizers to support language production, and none made use of glossaries or other types of linguistic support. To the best of our knowledge, there are no studies that explore if and how translanguaging can support CLIL assessment. This might be related to the fact that in some CLIL contexts, including the Netherlands, there is a strong conviction among schools and teachers that communication in CLIL should take place exclusively in the CLIL target language (Oattes et al., 2018), based on widespread misconceptions about multilingual language development (van Beuningen &amp; Poli&#x0161;ensk&#x00E1;, 2019). That said, H&#x00F6;nig (2010) observed that, when students were offered a choice of language in which to complete an assessment, they all opted for the language in which they had been taught (English) rather than the main school language (German). This could corroborate Coyle et al.&#x2019;s (2010) argument that it is inconsistent to assess learners in a language other than the one in which they have been taught. Considering the arguments presented above from other, non-CLIL, multilingual contexts, however (e.g. De Backer et al., 2019a), it seems that offering L1 translations or other materials as optional support during content-focused assessments is potentially helpful for some learners.</p>
			</sec>
			<sec sec-type="heading-2">
				<label>1.4</label>
				<title>Current study</title>
				<p>As explored above, based on the principle of constructive alignment (Biggs, 1996) and on the theoretical recommendations for CLIL practice, it could be assumed that (a) the integrated goals of language and content learning would translate into integrated approaches to assessment in CLIL contexts; and (b) scaffolded approaches as used in CLIL teaching would feed into the use of accommodations and translanguaging in assessment, to balance the cognitive and linguistic demands of assessment tasks. From the scarce research carried out in CLIL settings so far, however, it appears that practices in both of these regards are varied and that teachers are not always aware of the interaction between content and language in the way they assess learners&#x2019; performance. With this in mind, the current small-scale study explored ways in which summative assessment is approached in the first three years of Dutch-English bilingual secondary education (BSE) in the Netherlands, with the aim of increasing our understanding of CLIL assessment in this specific context.</p>
				<p>Whereas CLIL is often associated with teaching and learning in non-language subjects, such as biology, history or art, many teachers of English in bilingual education (TEB&#x202F;s) in the Netherlands identify both as language acquisition specialists, and as content specialists in literature and language arts (Dale et al., 2018). This could lead to TEB&#x202F;s and teachers of other subjects (ST&#x202F;s) having different and possibly complementary perspectives on assessment, although we consider them all CLIL teachers. Therefore, although it was not the main goal of the current study to compare the assessment practices of ST&#x202F;s and TEB&#x202F;s, we analysed these groups&#x2019; practices separately.</p>
				<p>The research question guiding this study is: How do ST&#x202F;s and TEB&#x202F;s in bilingual secondary education in the Netherlands approach summative assessment of language and content?</p>
				<p>The answer to this question was sought in relation to the themes discussed in the preceding section, namely the extent of content and language integration; the interplay between cognitive and linguistic demands; and the use of strategies to balance these demands (i.e. accommodations and translanguaging).</p>
			</sec>
		</sec>
		<sec sec-type="heading-1">
			<label>2</label>
			<title>Method</title>
			<p>To answer the research question, summative assessment practices in Dutch-English BSE were explored in both their perceived and operational forms (van den Akker, 2003), combining a broader examination of teachers&#x2019; self-reported practices with in-depth analysis of assessment materials from actual teaching practice. This allowed us to explore and understand assessment practices from teachers&#x2019; perspectives while also gaining a more objective impression of what those practices look like.</p>
			<sec sec-type="heading-2">
				<label>2.1</label>
				<title>Research context</title>
				<p>The research took place in the context of lower secondary (learners aged 12-15) BSE in the Netherlands. BSE (known locally as <italic>tweetalig onderwijs</italic> or <italic>TTO</italic>) has existed since 1989 and has followed a national quality standard and accreditation scheme since 2003. At the time of writing, 134 schools offer BSE, amounting to about 21&#x202F;% of the country&#x2019;s 648 secondary schools (Mearns et al., 2023). BSE schools generally follow the same curriculum as mainstream schools, but offer a significant proportion of that curriculum in English. For pre-vocational (or <italic>vmbo</italic>) learners, the minimum proportion of English-medium teaching is 30&#x202F;% in years 1-2, and for learners in the higher general (or <italic>havo</italic>) and pre-university (or <italic>vwo</italic>) tracks it is 50&#x202F;% in years 1-3. By the end of their third year in BSE, learners are expected to reach A2 (pre-vocational/<italic>vmbo</italic>), B1 (higher general/<italic>havo</italic>) or B2 (pre-university/<italic>vwo</italic>) level English (according to the Common European Framework of Reference for Languages, CEFR). English-medium provision is spread across different subjects, so that learners develop L2 proficiency in a range of disciplines. In the senior years, most subjects revert to teaching in Dutch, in preparation for the centralized final examinations. The English-as-discipline curriculum in the senior years of BSE is enhanced with a content-oriented international examination program, in most cases the International Baccalaureate (IB).</p>
			</sec>
			<sec sec-type="heading-2">
				<label>2.2</label>
				<title>Participants, data collection and data-analysis</title>
				<p>The study combined two stages of data collection: a survey aimed at painting a broad picture of teachers&#x2019; (N = 42) self-reported summative assessment practices, and an in-depth analysis of example assessments (N = 17) submitted by thirteen teachers. The participants, instruments and analyses pertaining to each stage are described below.</p>
				<sec sec-type="heading-3">
					<label>2.2.1</label>
					<title>Stage 1: Self-reported assessment practices</title>
					<sec sec-type="heading-4">
						<label>2.2.1.1</label>
						<title>Participants</title>
						<p>Participants were recruited using a convenience sampling method, via the Network of Dutch Bilingual Schools and the researchers&#x2019; own networks. In total, 45 teachers teaching in the junior years of BSE from 17 different schools filled in the questionnaire completely (an additional five teachers started but did not complete the questionnaire). We excluded three teachers: one German language teacher, one teacher with no CLIL experience, and one teacher of physical education. Of the 42 participants, ten were TEB&#x202F;s. The remaining 32 ST&#x202F;s taught a variety of subjects: history (6); mathematics (5); geography (6); art/music/drama (6); biology (4); physics/chemistry (3); economics (1); religious studies (1). The teachers had between 1 and 37 years&#x2019; teaching experience and between 1 and 30 years&#x2019; experience of teaching in BSE.</p>
					</sec>
					<sec sec-type="heading-4">
						<label>2.2.1.2</label>
						<title>Questionnaire</title>
						<p>An online questionnaire was designed to gather information on (1) teachers&#x2019; summative assessment practices in general, (2) the relative role of language and content in assessments, and (3) how teachers used accommodations and translanguaging in assessment. The questionnaire was published in Formdesk and consisted of two sections. The first section requested background information regarding the teachers and their school context. The second section contained open and multiple-choice questions regarding the three themes specified above, including typical assessment types, integration of content and language in assessment, relative weighting of content and language in grading, language(s) used in assessment, and the availability of accommodations during assessment. This section also contained a matrix in which participants could indicate which aspects of language were addressed in grading and in the feedback they provided. The latter was included in the knowledge that summative assessments can also serve a formative function, so it may be the case that teachers give feedback on language aspects even when they do not grade them. For several questions, a distinction was drawn between summative &#x2018;tests&#x2019; (i.e. &#x201C;an assessment that is completed in class time and measures what a student can do at that moment&#x201D;) and summative &#x2018;assignments&#x2019; (i.e. &#x201C;products that a student or group of students might spend longer on, perhaps over a number of lessons or including time outside of class, such as an essay, short story, project, lab report, presentation, artwork or video&#x201D;), as assessment practices might be different for each method, for example due to time constraints or the availability of help and resources. Therefore, handling all assessment types together might create a less clearly defined picture of current assessment practices than if tests and assignments were approached separately. The questionnaire can be found in Appendix S1.</p>
					</sec>
					<sec sec-type="heading-4">
						<label>2.2.1.3</label>
						<title>Procedure</title>
						<p>Participants received a link to the questionnaire by email or via social media. Before entering the questionnaire, participants were made aware of the fact that participation was voluntary and that their answers would be processed anonymously, and they were asked to give their consent. The final question asked if participants would be willing to share their materials (for the next stage of the data collection) and if so, to share an email address. Otherwise, participants were anonymous. Data were gathered in October-November of 2021. The time needed to fill in the questionnaire was around 15 minutes.</p>
					</sec>
					<sec sec-type="heading-4">
						<label>2.2.1.4</label>
						<title>Analyses</title>
						<p>Quantitative data gathered in the multiple-choice items were analyzed descriptively. Additional explanatory comments and additional responses to multiple-choice questions were categorized and are reported quantitatively. Responses from TEB&#x202F;s and ST&#x202F;s are reported separately.</p>
					</sec>
				</sec>
				<sec sec-type="heading-3">
					<label>2.2.2</label>
					<title>Stage 2: Actual assessment practices</title>
					<sec sec-type="heading-4">
						<label>2.2.2.1</label>
						<title>Participants</title>
						<p>In this stage too, teachers from years 1-3 of BSE were approached via the Network of Dutch Bilingual Schools and the researchers&#x2019; professional networks, and through the final question of the Stage 1 questionnaire. This round of recruitment was separate to the invitation to complete the questionnaire. 34 teachers, including 12 questionnaire respondents, registered their interest in participating and were sent instructions as to how to do so. Thirteen teachers agreed to participate: two TEB&#x202F;s and eleven ST&#x202F;s.</p>
						<p>The spread of subjects can be found in Table&#x00A0;1. Participants had between 2 and 26 years&#x2019; teaching experience, of which between 2 and 25 years in BSE. 6 of the teachers (with between 2 and 25 years&#x2019; experience) had taught in BSE from the start of their career. Two teachers self-reported to be &#x2018;native&#x2019; speakers of English; two had university degrees in English; seven reported having obtained a Cambridge Proficiency in English (CPE) certificate (CEFR level C2); two reported having no formal language certification, but estimated their own level of English at C2 or &#x201C;excellent&#x201D;. All participants reported having received some CLIL training, either as professional development in school or externally, or as part of their initial teacher education.</p>
					</sec>
					<sec sec-type="heading-4">
						<label>2.2.2.2</label>
						<title>Assessment bundles</title>
						<p>Participating teachers were asked to share one or more &#x2018;assessment bundle(s)&#x2019; showing examples of summative assessments they had recently used in a CLIL setting. They were provided with written instructions as to what to submit, including explanations of what was meant by terms such as &#x2018;assessment&#x2019;, &#x2018;summative&#x2019;, &#x2018;test&#x2019; and &#x2018;assignment&#x2019; in the context of this study, and asked to classify their materials as either tests or assignments based on those definitions (see Appendix S2). Requested for each bundle were:</p>
						<list list-type="order">
							<list-item>
								<p>All relevant documents pertaining to a summative assessment recently used in years 1-3 of BSE, i.e. the test or assignment instructions, assessment criteria/answer keys, and anything else the teacher considered relevant.</p>
							</list-item>
							<list-item>
								<p>Three anonymized examples of completed and graded student products, preferably exemplifying strong, average and weaker performance.</p>
							</list-item>
							<list-item>
								<p>A completed questionnaire (Appendix S2) regarding the teacher&#x2019;s background and teaching context, and information about the assessment materials.</p>
							</list-item>
							<list-item>
								<p>A signed consent form.</p>
							</list-item>
						</list>
						<p>In total, the thirteen participating teachers submitted seventeen assessment bundles. Two teachers submitted more than one bundle (T02_TEB: bundles A02-A05; T06_ST: bundles A09-A010). We decided to handle these as separate bundles, as they were unrelated assessments, for different classes, focusing on different content and skills, and displayed different features. It should be borne in mind, however, that the overlap in teachers may introduce some bias in the results. Table&#x00A0;1 shows how the bundles represented a range of subject areas, age-groups and assessment types, although written assessments predominate (14 out of 17). This is in line with the types of assessment teachers most often reported using in the questionnaire. With respect to educational stream, the over-representation of <italic>vwo</italic> (pre-university) assessments in the sample reflects the actual spread of Dutch BSE, which is most common in <italic>vwo</italic> tracks (Mearns et al., 2023). One assessment bundle (A11) did not contain graded examples of student work.</p>
						<table-wrap id="T000001">
							<caption>
								<title>Table&#x00A0;1</title>
								<p>Assessment bundles</p>
							</caption>
							<table frame="void" cellpadding="5">
								<thead>
									<tr>
										<th>
											<p>Bundle</p>
										</th>
										<th>
											<p>Teacher</p>
										</th>
										<th>
											<p>Subject</p>
										</th>
										<th>
											<p>Domain/topic</p>
										</th>
										<th>
											<p>Test/ assignment</p>
										</th>
										<th>
											<p>Assessment type</p>
										</th>
										<th>
											<p>Year (Track)</p>
										</th>
									</tr>
								</thead>
								<tbody>
									<tr>
										<td>
											<p>A01</p>
										</td>
										<td>
											<p>T01_TEB</p>
										</td>
										<td>
											<p>English</p>
										</td>
										<td>
											<p>Creative writing</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Written (diary entry)</p>
										</td>
										<td>
											<p>2 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A02</p>
										</td>
										<td>
											<p>T02_TEB</p>
										</td>
										<td>
											<p>English</p>
										</td>
										<td>
											<p>Speaking</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Oral (short speech)</p>
										</td>
										<td>
											<p>3 (havo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A03</p>
										</td>
										<td>
											<p></p>
										</td>
										<td>
											<p></p>
										</td>
										<td>
											<p>Grammar; writing</p>
										</td>
										<td>
											<p>Test</p>
										</td>
										<td>
											<p>Written (gap-fill tasks + application letter)</p>
										</td>
										<td>
											<p>2 (havo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A04</p>
										</td>
										<td>
											<p></p>
										</td>
										<td>
											<p></p>
										</td>
										<td>
											<p>Creative writing</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Written (short story)</p>
										</td>
										<td>
											<p>1 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A05</p>
										</td>
										<td>
											<p></p>
										</td>
										<td>
											<p></p>
										</td>
										<td>
											<p>Literature</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Written (tasks concerning a novel)</p>
										</td>
										<td>
											<p>1 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A06</p>
										</td>
										<td>
											<p>T03_ST</p>
										</td>
										<td>
											<p>Geography</p>
										</td>
										<td>
											<p>Geology</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Oral (research presentation)</p>
										</td>
										<td>
											<p>2 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A07</p>
										</td>
										<td>
											<p>T04_ST</p>
										</td>
										<td>
											<p>History</p>
										</td>
										<td>
											<p>World War&#x00A0;I propaganda</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Written (research report)</p>
										</td>
										<td>
											<p>3 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A08</p>
										</td>
										<td>
											<p>T05_ST</p>
										</td>
										<td>
											<p>Geography</p>
										</td>
										<td>
											<p>Wealth differences</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Written (research report)</p>
										</td>
										<td>
											<p>1 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A09</p>
										</td>
										<td>
											<p>T06_ST</p>
										</td>
										<td>
											<p>History</p>
										</td>
										<td>
											<p>Napoleon</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Written (research report) or oral (research presentation)&#x00A0;&#x2013; student&#x2019;s choice</p>
										</td>
										<td>
											<p>2 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A10</p>
										</td>
										<td>
											<p></p>
										</td>
										<td>
											<p></p>
										</td>
										<td>
											<p>Cold War</p>
										</td>
										<td>
											<p>Test</p>
										</td>
										<td>
											<p>Written (closed and open questions)</p>
										</td>
										<td>
											<p>3 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A11</p>
										</td>
										<td>
											<p>T07_ST</p>
										</td>
										<td>
											<p>Biology</p>
										</td>
										<td>
											<p>Reproduction</p>
										</td>
										<td>
											<p>Test</p>
										</td>
										<td>
											<p>Written (closed and open questions)</p>
										</td>
										<td>
											<p>2 (havo/ vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A12</p>
										</td>
										<td>
											<p>T08_ST</p>
										</td>
										<td>
											<p>Biology</p>
										</td>
										<td>
											<p>Senses, heredity and evolution</p>
										</td>
										<td>
											<p>Test</p>
										</td>
										<td>
											<p>Written (closed and open questions)</p>
										</td>
										<td>
											<p>3 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A13</p>
											<p>(a/b)</p>
										</td>
										<td>
											<p>T09_ST</p>
										</td>
										<td>
											<p>Science</p>
										</td>
										<td>
											<p>Light</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Written (research report; A13a) Product/design (spectroscope; A13b)</p>
										</td>
										<td>
											<p>2 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A14</p>
										</td>
										<td>
											<p>T10_ST</p>
										</td>
										<td>
											<p>Biology</p>
										</td>
										<td>
											<p>Food and digestion</p>
										</td>
										<td>
											<p>Test</p>
										</td>
										<td>
											<p>Written (closed and open questions)</p>
										</td>
										<td>
											<p>2 (vmbo/ havo/vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A15</p>
										</td>
										<td>
											<p>T11_ST</p>
										</td>
										<td>
											<p>Maths</p>
										</td>
										<td>
											<p>Statistics</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Written (research report)</p>
										</td>
										<td>
											<p>2 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A16</p>
										</td>
										<td>
											<p>T12_ST</p>
										</td>
										<td>
											<p>Chemistry</p>
										</td>
										<td>
											<p>Classification of substances, reactions and energy</p>
										</td>
										<td>
											<p>Test</p>
										</td>
										<td>
											<p>Written (open questions)</p>
										</td>
										<td>
											<p>3 (vwo)</p>
										</td>
									</tr>
									<tr>
										<td>
											<p>A17</p>
										</td>
										<td>
											<p>T13_ST</p>
										</td>
										<td>
											<p>Biology</p>
										</td>
										<td>
											<p>Plants</p>
										</td>
										<td>
											<p>Assignment</p>
										</td>
										<td>
											<p>Written (research report)</p>
										</td>
										<td>
											<p>1 (havo/ vwo)</p>
										</td>
									</tr>
								</tbody>
							</table>
						</table-wrap>
					</sec>
					<sec sec-type="heading-4">
						<label>2.2.2.3</label>
						<title>Analyses</title>
						<p>Analysis of the assessment bundles and the accompanying background information provided by the teachers (Appendix S2) was carried out deductively, using a pre-determined set of themes formulated based on insights from research discussed above (Appendix S3). The first two themes coincide with those addressed in the questionnaire, namely: (1) the relative roles of content and language, and (2) the use of accommodations and translanguaging. Additionally, we analyzed (3) the cognitive and linguistic demands of each assessment.</p>
						<p>Focus on content and/or language (theme 1) was identified from teachers&#x2019; responses to the background questionnaire, and the weighting of and attention to content and language in grading and/or feedback, based on assessment criteria and graded student work. Regarding theme 2, we examined teachers&#x2019; responses to the background questionnaire, and any relevant information in assessment instructions and tasks. Finally, to gather insight into the cognitive and linguistic demands (theme 3), we coded the assessments using a framework proposed by Lo and Lin (2014). Within this framework, CLIL assessments can be positioned in a three-by-three matrix, including cognitive (recall, application, analysis) and linguistic levels (vocabulary, sentence, text). At the recall level, students are asked to report or repeat what they have learned. For application, they apply what they have learned to new situations or problems. At the analysis level, they engage in higher-order thinking processes (e.g. synthesizing, evaluating). Along the linguistic dimension, assessment tasks can either require students to understand or produce language on word-level (e.g. subject-specific vocabulary), sentence-level (e.g. sentence patterns commonly used to explain, describe, etc.), and text-level (e.g. a subject-specific text genre).</p>
						<p>Qualitative analysis was carried out by the second author, with samples dual-coded by the first author. Unresolved issues were discussed with the last author.</p>
					</sec>
				</sec>
			</sec>
		</sec>
		<sec sec-type="heading-1">
			<label>3</label>
			<title>Results</title>
			<p>We describe the results for each stage of data collection (questionnaire and assessment bundles) in turn.</p>
			<sec sec-type="heading-2">
				<label>3.1</label>
				<title>Stage 1: Self-reported assessment practices (questionnaire)</title>
				<sec sec-type="heading-3">
					<label>3.1.1</label>
					<title>The relative roles of content and language in assessment</title>
					<p>Responses to the question &#x201C;What is most commonly evaluated? (language, content, or both)&#x201D; are summarized in Table 2. Most TEB&#x202F;s reported that they assess both language and content, either together (30&#x202F;%) or in separate assessments (40&#x202F;%). The majority of ST&#x202F;s stated that they assess only content, although 19&#x202F;% reported assessing both language and content, in most cases in combined assessments.</p>
					<table-wrap id="T000002">
						<caption>
							<title>Table 2</title>
							<p>Focus of assessments according to TEB&#x202F;s and ST&#x202F;s</p>
						</caption>
						<table frame="void" cellpadding="5">
							<thead>
								<tr>
									<th>
										<p></p>
									</th>
									<th colspan="2">
										<p><italic>TEB&#x202F;s (N = 10)</italic></p>
									</th>
									<th colspan="2">
										<p><italic>ST&#x202F;s (N = 32)</italic></p>
									</th>
								</tr>
								<tr>
									<th>
										<p></p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
								</tr>
							</thead>
							<tbody>
								<tr>
									<td>
										<p>Language only</p>
									</td>
									<td>
										<p>2</p>
									</td>
									<td>
										<p>20</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Content only</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>26</p>
									</td>
									<td>
										<p>81</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Both (total)</p>
									</td>
									<td>
										<p>8</p>
									</td>
									<td>
										<p>80</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>19</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>&#x2013; <italic>Separately</italic></p>
									</td>
									<td>
										<p><italic>4</italic></p>
									</td>
									<td>
										<p><italic>40</italic></p>
									</td>
									<td>
										<p><italic>2</italic></p>
									</td>
									<td>
										<p><italic>6</italic></p>
									</td>
								</tr>
								<tr>
									<td>
										<p>&#x2013; <italic>Together</italic></p>
									</td>
									<td>
										<p><italic>3</italic></p>
									</td>
									<td>
										<p><italic>30</italic></p>
									</td>
									<td>
										<p><italic>4</italic></p>
									</td>
									<td>
										<p><italic>13</italic></p>
									</td>
								</tr>
								<tr>
									<td>
										<p>&#x2013; <italic>Not specified</italic></p>
									</td>
									<td>
										<p><italic>1</italic></p>
									</td>
									<td>
										<p><italic>10</italic></p>
									</td>
									<td>
										<p><italic>0</italic></p>
									</td>
									<td>
										<p><italic>0</italic></p>
									</td>
								</tr>
							</tbody>
						</table>
					</table-wrap>
					<table-wrap id="T000003">
						<caption>
							<title>Table 3</title>
							<p>Language aspects taken into account when grading tests and assignments</p>
						</caption>
						<table frame="void" cellpadding="5">
							<thead>
								<tr>
									<th>
										<p>Language aspect in grade</p>
									</th>
									<th colspan="4">
										<p>TEB&#x202F;s (N = 10)</p>
									</th>
									<th colspan="4">
										<p>ST&#x202F;s (N = 32)</p>
									</th>
								</tr>
								<tr>
									<th>
										<p></p>
									</th>
									<th colspan="2">
										<p><italic>Test</italic></p>
									</th>
									<th colspan="2">
										<p><italic>Assignment</italic></p>
									</th>
									<th colspan="2">
										<p><italic>Test</italic></p>
									</th>
									<th colspan="2">
										<p><italic>Assignment</italic></p>
									</th>
								</tr>
								<tr>
									<th>
										<p></p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
								</tr>
							</thead>
							<tbody>
								<tr>
									<td>
										<p>Pronunciation</p>
									</td>
									<td>
										<p>3</p>
									</td>
									<td>
										<p>30</p>
									</td>
									<td>
										<p>5</p>
									</td>
									<td>
										<p>50</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>3</p>
									</td>
									<td>
										<p>9</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Spelling</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>8</p>
									</td>
									<td>
										<p>80</p>
									</td>
									<td>
										<p>2</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>4</p>
									</td>
									<td>
										<p>13</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Vocabulary</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>5</p>
									</td>
									<td>
										<p>16</p>
									</td>
									<td>
										<p>4</p>
									</td>
									<td>
										<p>13</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Grammar</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>1</p>
									</td>
									<td>
										<p>3</p>
									</td>
									<td>
										<p>4</p>
									</td>
									<td>
										<p>13</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Structure</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>1</p>
									</td>
									<td>
										<p>3</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>19</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Clarity</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>60</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>100</p>
									</td>
									<td>
										<p>5</p>
									</td>
									<td>
										<p>16</p>
									</td>
									<td>
										<p>7</p>
									</td>
									<td>
										<p>22</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Register</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>60</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>100</p>
									</td>
									<td>
										<p>2</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>4</p>
									</td>
									<td>
										<p>13</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Subject-specific language</p>
									</td>
									<td>
										<p>5</p>
									</td>
									<td>
										<p>50</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>60</p>
									</td>
									<td>
										<p>8</p>
									</td>
									<td>
										<p>25</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>28</p>
									</td>
								</tr>
							</tbody>
						</table>
					</table-wrap>
					<p>Teachers were asked which aspects of language they take into account when grading and when giving feedback. The results of these questions are summarized in Table 3 and Table&#x00A0;4, respectively.</p>
					<p>With regard to both grading and feedback, TEB&#x202F;s again more often reported paying attention to language than did ST&#x202F;s. The percentage of TEB&#x202F;s indicating that they paid attention to language aspects was between 50&#x202F;% and 100&#x202F;%, with the exception of the aspect of pronunciation in tests (30&#x202F;%). For ST&#x202F;s, the range was lower, namely for grading, between 0&#x202F;% (pronunciation in tests) and 28&#x202F;% (subject-specific language in assignments), and for feedback, between 3&#x202F;% (pronunciation in tests) and 38&#x202F;% (clarity and subject-specific language in assignments). It is worth noting that four ST&#x202F;s who had first indicated that they evaluated only content, did indicate taking one or more language elements into account when grading. Apparently, when confronted with the precise terms for language aspects, more teachers realized they do take them into account.</p>
					<table-wrap id="T000004">
						<caption>
							<title>Table&#x00A0;4</title>
							<p>Language aspects taken into account in feedback on tests and assignments</p>
						</caption>
						<table frame="void" cellpadding="5">
							<thead>
								<tr>
									<th>
										<p>Language aspect in feedback</p>
									</th>
									<th colspan="4">
										<p>TEB&#x202F;s (N = 10)</p>
									</th>
									<th colspan="4">
										<p>ST&#x202F;s (N = 32)</p>
									</th>
								</tr>
								<tr>
									<th>
										<p></p>
									</th>
									<th colspan="2">
										<p><italic>Test</italic></p>
									</th>
									<th colspan="2">
										<p><italic>Assignment</italic></p>
									</th>
									<th colspan="2">
										<p><italic>Test</italic></p>
									</th>
									<th colspan="2">
										<p><italic>Assignment</italic></p>
									</th>
								</tr>
								<tr>
									<th>
										<p></p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
								</tr>
							</thead>
							<tbody>
								<tr>
									<td>
										<p>Pronunciation</p>
									</td>
									<td>
										<p>3</p>
									</td>
									<td>
										<p>30</p>
									</td>
									<td>
										<p>8</p>
									</td>
									<td>
										<p>80</p>
									</td>
									<td>
										<p>1</p>
									</td>
									<td>
										<p>3</p>
									</td>
									<td>
										<p>8</p>
									</td>
									<td>
										<p>25</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Spelling</p>
									</td>
									<td>
										<p>7</p>
									</td>
									<td>
										<p>70</p>
									</td>
									<td>
										<p>7</p>
									</td>
									<td>
										<p>70</p>
									</td>
									<td>
										<p>11</p>
									</td>
									<td>
										<p>34</p>
									</td>
									<td>
										<p>11</p>
									</td>
									<td>
										<p>34</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Vocabulary</p>
									</td>
									<td>
										<p>8</p>
									</td>
									<td>
										<p>80</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>8</p>
									</td>
									<td>
										<p>25</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>31</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Grammar</p>
									</td>
									<td>
										<p>7</p>
									</td>
									<td>
										<p>70</p>
									</td>
									<td>
										<p>7</p>
									</td>
									<td>
										<p>70</p>
									</td>
									<td>
										<p>8</p>
									</td>
									<td>
										<p>25</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>31</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Structure</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>60</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>7</p>
									</td>
									<td>
										<p>22</p>
									</td>
									<td>
										<p>11</p>
									</td>
									<td>
										<p>34</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Clarity</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>60</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>31</p>
									</td>
									<td>
										<p>12</p>
									</td>
									<td>
										<p>38</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Register</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>60</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>100</p>
									</td>
									<td>
										<p>4</p>
									</td>
									<td>
										<p>13</p>
									</td>
									<td>
										<p>8</p>
									</td>
									<td>
										<p>25</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Subject-specific language</p>
									</td>
									<td>
										<p>5</p>
									</td>
									<td>
										<p>50</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>60</p>
									</td>
									<td>
										<p>11</p>
									</td>
									<td>
										<p>34</p>
									</td>
									<td>
										<p>12</p>
									</td>
									<td>
										<p>38</p>
									</td>
								</tr>
							</tbody>
						</table>
					</table-wrap>
					<p>In addition to which aspects were graded, teachers were also asked about the relative emphasis (&#x2018;weighting&#x2019;) placed on content- and language-related aspects of assessment when grading. Table&#x00A0;5 shows the ranges of relative weighting as reported by TEB&#x202F;s and ST&#x202F;s for aspects of language and content, for summative tests and assignments. In line with the previous findings, the ranges for TEB&#x202F;s lean more towards language and those for ST&#x202F;s towards content. This difference is most pronounced in relation to tests.</p>
					<table-wrap id="T000005">
						<caption>
							<title>Table&#x00A0;5</title>
							<p>Ranges of relative weighting of content, language, and other aspects when grading</p>
						</caption>
						<table frame="void" cellpadding="5">
							<thead>
								<tr>
									<th>
										<p>Assessment type</p>
									</th>
									<th>
										<p>Aspect</p>
									</th>
									<th>
										<p>TEB&#x202F;s</p>
									</th>
									<th>
										<p>ST&#x202F;s</p>
									</th>
								</tr>
							</thead>
							<tbody>
								<tr>
									<td>
										<p>Tests</p>
									</td>
									<td>
										<p>Language</p>
									</td>
									<td>
										<p>50-100&#x202F;%</p>
									</td>
									<td>
										<p>0-20&#x202F;%</p>
									</td>
								</tr>
								<tr>
									<td>
										<p></p>
									</td>
									<td>
										<p>Content</p>
									</td>
									<td>
										<p>0-20&#x202F;%</p>
									</td>
									<td>
										<p>80-100&#x202F;%</p>
									</td>
								</tr>
								<tr>
									<td>
										<p></p>
									</td>
									<td>
										<p>Other</p>
									</td>
									<td>
										<p>0-10&#x202F;%*</p>
									</td>
									<td>
										<p>0&#x202F;%</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Assignments</p>
									</td>
									<td>
										<p>Language</p>
									</td>
									<td>
										<p>30-60&#x202F;%</p>
									</td>
									<td>
										<p>0-30&#x202F;%</p>
									</td>
								</tr>
								<tr>
									<td>
										<p></p>
									</td>
									<td>
										<p>Content</p>
									</td>
									<td>
										<p>33-70&#x202F;%</p>
									</td>
									<td>
										<p>60-100&#x202F;%</p>
									</td>
								</tr>
								<tr>
									<td>
										<p></p>
									</td>
									<td>
										<p>Other</p>
									</td>
									<td>
										<p>10-33&#x202F;%*</p>
									</td>
									<td>
										<p>5-20&#x202F;%**</p>
									</td>
								</tr>
							</tbody>
						</table>
						<table-wrap-foot>
							<p>* rationale (test) &amp; creativity (assignment); ** lay-out, structure, on-time, critical thinking (assignments)</p>
						</table-wrap-foot>
					</table-wrap>
				</sec>
				<sec sec-type="heading-3">
					<label>3.1.2</label>
					<title>Accommodations and translanguaging in assessment</title>
					<p>Table 6 displays the percentage of teachers reporting to allow different kinds of accommodations for tests and assignments, for TEB&#x202F;s and ST&#x202F;s separately. Twenty-four teachers (out of a total of 41: one ST never used tests) reported to allow some kind of accommodations during test administration: asking the teacher and using dictionaries are most often allowed. For assignments (n = 41; one ST reported never to give assignments), most (34) teachers report that some type of accommodation is allowed.</p>
					<table-wrap id="T000006">
						<caption>
							<title>Table 6</title>
							<p>Accommodations allowed by TEB&#x202F;s and ST&#x202F;s in tests and assignments</p>
						</caption>
						<table frame="void" cellpadding="5">
							<thead>
								<tr>
									<th>
										<p>Accommodations</p>
									</th>
									<th colspan="4">
										<p>TEB&#x202F;s (N = 10)</p>
									</th>
									<th colspan="4">
										<p>ST&#x202F;s (N = 31)</p>
									</th>
								</tr>
								<tr>
									<th>
										<p></p>
									</th>
									<th colspan="2">
										<p><italic>Test</italic></p>
									</th>
									<th colspan="2">
										<p><italic>Assignment</italic></p>
									</th>
									<th colspan="2">
										<p><italic>Test</italic></p>
									</th>
									<th colspan="2">
										<p><italic>Assignment</italic></p>
									</th>
								</tr>
								<tr>
									<th>
										<p></p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
									<th>
										<p>n</p>
									</th>
									<th>
										<p>%</p>
									</th>
								</tr>
							</thead>
							<tbody>
								<tr>
									<td>
										<p>Dictionaries</p>
									</td>
									<td>
										<p>5</p>
									</td>
									<td>
										<p>50</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>90</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>19</p>
									</td>
									<td>
										<p>16</p>
									</td>
									<td>
										<p>52</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Ask the teacher</p>
									</td>
									<td>
										<p>2</p>
									</td>
									<td>
										<p>20</p>
									</td>
									<td>
										<p>7</p>
									</td>
									<td>
										<p>70</p>
									</td>
									<td>
										<p>13</p>
									</td>
									<td>
										<p>42</p>
									</td>
									<td>
										<p>22</p>
									</td>
									<td>
										<p>71</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Online translator</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>7</p>
									</td>
									<td>
										<p>70</p>
									</td>
									<td>
										<p>2</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>17</p>
									</td>
									<td>
										<p>55</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Glossaries</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>5</p>
									</td>
									<td>
										<p>50</p>
									</td>
									<td>
										<p>2</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>12</p>
									</td>
									<td>
										<p>39</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Personal idiom file</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>1</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>2</p>
									</td>
									<td>
										<p>6</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>32</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Personal notes</p>
									</td>
									<td>
										<p>1</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Ask classmates/parents</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>2</p>
									</td>
									<td>
										<p>6</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Pictures</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>1</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>32</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>32</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Translations</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>7</p>
									</td>
									<td>
										<p>23</p>
									</td>
									<td>
										<p>9</p>
									</td>
									<td>
										<p>29</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>Other: Dyslexia support</p>
									</td>
									<td>
										<p>1</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>1</p>
									</td>
									<td>
										<p>10</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
									<td>
										<p>0</p>
									</td>
								</tr>
							</tbody>
						</table>
					</table-wrap>
					<p>Teachers appear to make little use of translanguaging strategies. Although up to 29&#x202F;% of ST&#x202F;s report providing some support for their assessments using translations, the main language of the instructions and of test and assignment formats was reported to be English. One ST adds that the choice of language(s) may depend on year. Likewise, student answers were also usually only allowed in English: 2 ST&#x202F;s allowed some Dutch, one of whom added this was only allowed in year 1.</p>
				</sec>
			</sec>
			<sec sec-type="heading-2">
				<label>3.2</label>
				<title>Stage 2: Actual assessment practices (assessment bundles)</title>
				<p>We now move on to the results of the qualitative analysis of the assessment bundles submitted by teachers. For an overview of the contents of the bundles, see Table&#x00A0;1 (Method section). As with the questionnaire, the results will be presented under each of the three main analytical themes: roles of content and language, accommodations and translanguaging, and cognitive and linguistic demands.</p>
				<sec sec-type="heading-3">
					<label>3.2.1</label>
					<title>The relative roles of content and language in assessment</title>
					<p>Table&#x00A0;7 summarizes our findings regarding the relative roles of content and language in the assessments, based on three indicators: (1) the focus of each assessment as described in answer to the question, &#x201C;What are the materials aimed at assessing? What did you want students to show they knew/could do?&#x201D;; (2) the relative weighting of content and language in grading, as evidenced in assessment criteria and assigned grades; and (3) the focus of feedback provided by teachers in sample learner products.</p>
					<p>The patterns visible in Table&#x00A0;7 resemble those observed in the questionnaire data. The reported role of language is greater in TEB&#x202F;s&#x2019; assessments than in ST&#x202F;s&#x2019; assessments, whereas for content we see the opposite. In describing the focus of assessments, TEB&#x202F;s mostly mention linguistic knowledge and skills, with the exception of &#x2018;<italic>understanding of literature</italic>&#x2019; as an aim of A05. ST&#x202F;s, on the other hand, describe the focus of their assessments almost exclusively in terms of content knowledge and skills, with no explicit mention of language. Some of the descriptions do refer to linguistic aspects implicitly, by mentioning subject-relevant language functions (e.g., &#x2018;<italic>explain</italic>&#x2019;; A08) or genres (e.g., &#x2018;<italic>writing a lab report</italic>&#x2019;; A17).</p>
					<p>When looking at the weighting of different aspects in assessment criteria, a similar pattern emerges. In TEB&#x202F;s&#x2019; assessments, the weighting of language is 33&#x202F;%-100&#x202F;%, and that of content 10&#x202F;%-30&#x202F;%. In ST&#x202F;s&#x2019; assessments, content weighs 40&#x202F;%-100&#x202F;%, whereas the weight of language is 0&#x202F;%-20&#x202F;%. Other assessment criteria include creativity, text structure or organization, lay-out, referencing, active participation, and presentation skills, and weigh 0&#x202F;%-40&#x202F;% in TEB&#x202F;s&#x2019;, and 0&#x202F;%-53&#x202F;% in ST&#x202F;s&#x2019; assessments.</p>
					<p>To provide more insight into the role of content in TEB&#x202F;s&#x2019; grading and of language in ST&#x202F;s&#x2019; grading, Table&#x00A0;7 also specifies which content-related (for TEB&#x202F;s) and linguistic (for ST&#x202F;s) aspects were actually present in assessment criteria. From the four TEB assessments in which content plays a role in grading (A01, A02, A04, A05), one assessment (A05) clearly focuses explicitly on subject-relevant content (literature). In one other case (A02), the content assessed does not represent cultural or linguistic knowledge that could be considered part of the English-as-discipline curriculum (e.g. &#x2018;<italic>appear[ing] knowledgeable</italic>&#x2019; on any chosen presentation topic). In the remaining two cases, the content aspects assessed relate to students&#x2019; creative writing skills, namely, &#x2018;<italic>the use of supporting details</italic>&#x2019; in a diary entry (A01) and the &#x2018;<italic>plot, character, idea/message</italic>&#x2019; of a short story (A04). When language plays an explicit role in ST&#x202F;s&#x2019; grading (A06, A08, A09, A13, A15, A17), the focus is usually on accuracy of formulations (word choice, grammar, spelling, punctuation). In two cases (A06, A15), it remains unclear which linguistic aspects are taken into account as the relevant criteria are broadly formulated as &#x2018;<italic>use of English</italic>&#x2019; or &#x2018;<italic>use of language</italic>&#x2019;. In one case (A09), students are assessed on using &#x2018;<italic>their own words</italic>&#x2019;.</p>
					<p>Finally, from analysing teachers&#x2019; feedback on student work the first observation that stands out is the absence of feedback in ten of the assessment bundles. In the seven cases where feedback was provided, four assessment bundles included feedback on both content and language (3 times by the same TEB: A02, A04, A05, once by ST: A15), in two instances the feedback only related to content-related aspects (both by ST&#x202F;s: A07, A14), and in the final case, only feedback on language was provided (by TEB: A03).</p>
					<table-wrap id="T000007">
						<caption>
							<title>Table&#x00A0;7</title>
							<p>Relative roles of content and language in assessment bundles</p>
						</caption>
						<table frame="void" cellpadding="5">
							<thead>
								<tr>
									<th>
										<p>Assessment and teacher</p>
									</th>
									<th>
										<p>Subject, domain/topic</p>
									</th>
									<th>
										<p>(1) Focus/goal of assessment (as described by teachers)</p>
									</th>
									<th colspan="2">
										<p>Role of content and language (as apparent in assessment criteria and graded student work)</p>
									</th>
								</tr>
								<tr>
									<th>
										<p></p>
									</th>
									<th>
										<p>Assessment type</p>
									</th>
									<th>
										<p></p>
									</th>
									<th>
										<p><italic>(2) Weighting</italic></p>
									</th>
									<th>
										<p><italic>(3) Feedback</italic></p>
									</th>
								</tr>
							</thead>
							<tbody>
								<tr>
									<td>
										<p>A01; T01_TEB</p>
									</td>
									<td>
										<p>English; creative writing</p>
										<p></p>
										<p>Diary entry</p>
										<p></p>
									</td>
									<td>
										<p>Creativity, spelling/grammar, word choice, fluidity, organization</p>
									</td>
									<td>
										<p>Language: 60&#x202F;%</p>
										<p><italic>Content:</italic> 20&#x202F;% (development: use of supporting details)</p>
										<p><italic>Other:</italic> 20&#x202F;% (creativity)</p>
									</td>
									<td>
										<p>No feedback (other than completed rubric)</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A02; T02_TEB</p>
									</td>
									<td>
										<p>English; speaking</p>
										<p></p>
										<p>Speech</p>
									</td>
									<td>
										<p>Mostly speaking skills, also critical thinking, presenting/organizing information</p>
									</td>
									<td>
										<p>Language: 60&#x202F;%</p>
										<p><italic>Content:</italic> 10&#x202F;% (subject knowledge: was the subject appropriate for the topic/did speaker appear knowledgeable)</p>
										<p><italic>Other:</italic> 30&#x202F;% (presenting skills)</p>
									</td>
									<td>
										<p>Feedback on both language and content</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A03; T02_TEB</p>
									</td>
									<td>
										<p>English; grammar/ writing</p>
										<p></p>
										<p>Gap-fill tasks + application letter</p>
									</td>
									<td>
										<p>Mostly grammar, also vocabulary, writing skills</p>
									</td>
									<td>
										<p>Language: 100&#x202F;%</p>
										<p></p>
									</td>
									<td>
										<p>Feedback on language</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A04; T02_TEB</p>
									</td>
									<td>
										<p>English; creative writing</p>
										<p></p>
										<p>Short story</p>
									</td>
									<td>
										<p>Mostly creative writing skills; also grammar, vocabulary, text organization</p>
									</td>
									<td>
										<p>Language: 70&#x202F;%</p>
										<p><italic>Content:</italic> 30&#x202F;% (plot, character, idea/message)</p>
									</td>
									<td>
										<p>Feedback on both language and content</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A05; T02_TEB</p>
									</td>
									<td>
										<p>English; literature</p>
										<p></p>
										<p>Booklet with tasks concerning a novel</p>
									</td>
									<td>
										<p>Mostly understanding of novel; also writing, executive skills (organizing work, keeping up with homework, etc.)</p>
									</td>
									<td>
										<p>Language: 33&#x202F;%</p>
										<p><italic>Content:</italic> 27&#x202F;% (accurate completion of tasks)</p>
										<p><italic>Other:</italic> 40&#x202F;% (creativity; active participation)</p>
									</td>
									<td>
										<p>Feedback on both language and content</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A06; T03_ST</p>
									</td>
									<td>
										<p>Geography; geology</p>
										<p></p>
										<p>Research presentation</p>
									</td>
									<td>
										<p>General system of plate tectonics; specifics on one kind of natural disaster</p>
									</td>
									<td>
										<p>Content: 40&#x202F;%</p>
										<p><italic>Language:</italic> 7&#x202F;% (use of language)</p>
										<p><italic>Other:</italic> 53&#x202F;% (presentation skills)</p>
									</td>
									<td>
										<p>No feedback (other than completed rubric)</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A07; T04_ST</p>
									</td>
									<td>
										<p>History; World War&#x00A0;I propaganda</p>
										<p></p>
										<p>Research report</p>
									</td>
									<td>
										<p>Knowledge of propaganda techniques, analyzing propaganda, creating propaganda</p>
									</td>
									<td>
										<p>Content: 100&#x202F;%</p>
										<p>Language: 0&#x202F;%</p>
										<p></p>
									</td>
									<td>
										<p>Feedback on content</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A08; T05_ST</p>
									</td>
									<td>
										<p>Geography; wealth differences</p>
										<p></p>
										<p>Research report</p>
									</td>
									<td>
										<p>Identifying indicators used to measure development; ranking countries based on data; analysing research data; explaining ranking of countries</p>
									</td>
									<td>
										<p>Content: 60&#x202F;%</p>
										<p><italic>Language:</italic> 20&#x202F;% (sentence fluency)</p>
										<p><italic>Other:</italic> 20&#x202F;% (design, lay-out, organization)</p>
										<p></p>
									</td>
									<td>
										<p>No feedback (other than completed rubric)</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A09; T06_ST</p>
									</td>
									<td>
										<p>History; Napoleon</p>
										<p></p>
										<p>Research presentation or report</p>
									</td>
									<td>
										<p>Research, presentation</p>
									</td>
									<td>
										<p>Content: 57&#x202F;%</p>
										<p><italic>Language:</italic> 14&#x202F;% (use of own words)</p>
										<p><italic>Other:</italic> 20&#x202F;% (structure, lay-out, references)</p>
										<p></p>
									</td>
									<td>
										<p>No feedback (other than completed rubric)</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A10; T06_ST</p>
									</td>
									<td>
										<p>History; Cold War</p>
										<p></p>
										<p>Written test with closed and open questions</p>
									</td>
									<td>
										<p>(Use of) knowledge</p>
									</td>
									<td>
										<p>Content: 100&#x202F;%</p>
										<p>Language: 0&#x202F;%</p>
										<p></p>
									</td>
									<td>
										<p>No feedback</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A11; T07_ST</p>
									</td>
									<td>
										<p>Biology; reproduction</p>
										<p></p>
										<p>Written test with closed and open questions</p>
									</td>
									<td>
										<p>Learning and understanding skills</p>
									</td>
									<td>
										<p>Content: 100&#x202F;%</p>
										<p>Language: 0&#x202F;%</p>
										<p></p>
									</td>
									<td>
										<p>Unknown (no student work available)</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A12; T08_ST</p>
									</td>
									<td>
										<p>Biology; senses, heredity and evolution</p>
										<p></p>
										<p>Written test with closed and open questions</p>
									</td>
									<td>
										<p>Names of parts of the senses and their functions, how evolution works (describe in their own words), how Mendelian genetics works, making Punnet squares and calculations</p>
									</td>
									<td>
										<p>Content: 100&#x202F;%</p>
										<p>Language: 0&#x202F;%</p>
										<p></p>
									</td>
									<td>
										<p>No feedback</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A13 (a &amp; b); T09_ST</p>
									</td>
									<td>
										<p>Science; light</p>
										<p></p>
										<p>Research report and product/design (spectroscope)</p>
									</td>
									<td>
										<p>Writing a scientific report; following instructions for experiment of their choice; explaining phenomena related to light that occurred with experiment</p>
									</td>
									<td>
										<p>Content: 86%</p>
										<p><italic>Language:</italic> 7&#x202F;% (accurate use of (scientific) language)</p>
										<p><italic>Other:</italic> 7&#x202F;% (lay-out)</p>
										<p></p>
									</td>
									<td>
										<p>No feedback (other than completed rubric)</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A14; T10_ST</p>
									</td>
									<td>
										<p>Biology; food and digestion</p>
										<p></p>
										<p>Written test with closed and open questions</p>
									</td>
									<td>
										<p>Theory</p>
									</td>
									<td>
										<p>Content: 100&#x202F;%</p>
										<p>Language: 0&#x202F;%</p>
										<p></p>
									</td>
									<td>
										<p>On content; one instance of feedback on language (&#x2018;strange sentence&#x2019;)</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A15; T11_ST</p>
									</td>
									<td>
										<p>Maths; statistics</p>
										<p></p>
										<p>Research report</p>
									</td>
									<td>
										<p>Basic concepts of statistics: calculating mean, median, mode; being able to make several kinds of charts</p>
									</td>
									<td>
										<p>Content: 90&#x202F;%</p>
										<p><italic>Language:</italic> 10&#x202F;% (use of English)</p>
										<p></p>
									</td>
									<td>
										<p>Feedback on content and language (i.e., grammar, spelling, punctuation)</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A16; T12_ST</p>
									</td>
									<td>
										<p>Chemistry; classification of substances, reactions and energy</p>
										<p></p>
										<p>Written test with open questions</p>
									</td>
									<td>
										<p>Recognising an oxide from its chemical symbol, writing balanced chemical equations, understanding of the different types of chemical bonding using Bohr diagrams</p>
									</td>
									<td>
										<p>Content: 100&#x202F;%</p>
										<p>Language: 0&#x202F;%</p>
										<p></p>
									</td>
									<td>
										<p>No feedback</p>
									</td>
								</tr>
								<tr>
									<td>
										<p>A17; T13_ST</p>
									</td>
									<td>
										<p>Biology; plants</p>
										<p></p>
										<p>Research report</p>
									</td>
									<td>
										<p>Research skills: designing an experimental set-up, executing a research plan, collecting data, making tables/graphs, reflecting on process, writing a lab report</p>
									</td>
									<td>
										<p>Content: 93&#x202F;%</p>
										<p><italic>Language:</italic> 7&#x202F;% (grammar, spelling, punctuation)</p>
										<p></p>
									</td>
									<td>
										<p>No feedback (other than completed rubric)</p>
									</td>
								</tr>
							</tbody>
						</table>
					</table-wrap>
					<p>In most cases, what teachers describe to be the focus of their assessments, the assessment criteria, and the foci in their feedback, are aligned. In some bundles, however, this is not the case. Firstly, most ST&#x202F;s who take linguistic aspects into account when grading do not make explicit reference to those aspects in their description of assessment goals (Table&#x00A0;7, column&#x00A0;3). Secondly, in the two cases where there is more implicit mention of linguistic aspects such as subject-specific language functions or genres in ST&#x202F;s&#x2019; assessment descriptions (i.e., A13, A17), only in one case (A13) do the assessment criteria also foreground the use of subject-specific language (although formulated quite broadly: &#x2018;<italic>accurate use of (scientific) language</italic>&#x2019;). In A17 on the other hand, while the focus in the description is on a subject-specific genre (&#x2018;<italic>writing a lab-report</italic>&#x2019;), the assessment criteria concern accurate use of grammar, spelling, and punctuation. Finally, in one case (A14), the ST provided feedback on language (one instance only) whereas language was not a focus in grading or assessment criteria.</p>
				</sec>
				<sec sec-type="heading-3">
					<label>3.2.2</label>
					<title>Accommodations and translanguaging in assessment</title>
					<p>In the background questionnaire, we asked teachers which accommodations they allowed their students to use during assessment. We also analysed assessment instructions to determine how teachers used accommodations and translanguaging. Moreover, assessment tasks as well as student products were examined to establish which languages were used in assessments.</p>
					<p>With respect to accommodations, a clear difference emerged between tests and assignments. While no accommodations were apparently incorporated or allowed during tests, a variety of support materials were available during completion of assignments: namely, online sources/internet (5&#x00D7;); any type of resource (3&#x00D7;); books (3&#x00D7;); (online) dictionaries (2&#x00D7;); lesson materials, such as PowerPoints or instructional videos (2&#x00D7;); cue cards (1&#x00D7;); ask the teacher (1&#x00D7;). Conversely, no examples were found of accommodations incorporated in assignment format, such as supporting visuals, graphic organisers or writing frames.</p>
					<p>Regarding the use of translanguaging, a similarly monolingual image arises from the assessment bundles as from the questionnaire data. In fact, while a small number of questionnaire respondents indicated that they provided some support in Dutch, only English was used in the actual assessment instructions and formats submitted for analysis. Likewise, in the assessment bundles, students were only allowed to use English to complete the tasks.</p>
				</sec>
				<sec sec-type="heading-3">
					<label>3.2.3</label>
					<title>Cognitive and linguistic demands of assessments</title>
					<p>To map the cognitive and linguistic demands of the assignments, we used the framework proposed by Lo and Lin (2014), described in the Method section. Figure 1 illustrates where the assessments can be positioned in the matrix. Whereas Lo and Lin differentiate between assessments that require comprehension only (i.e. receptive level) and assessments in which students need to produce language (i.e. productive level), we chose not to make this distinction in Figure 1, as all assessments analysed necessitated language reception (e.g. reading assessment task, consulting sources) as well as production (e.g. answering open test questions, writing a research report). Moreover, assessments often pose demands on different levels (e.g. content analysis also implies content recall; production at text level inherently requires production at sentence level). We therefore chose to classify assessments based on the highest (cognitive and linguistic) level they require.</p>
					<p>Three of the five assessments submitted by TEB&#x202F;s (A01, A03, A04) could not be placed in the matrix. Assessment A03, a fully language-focused test, could only be scored on the linguistic dimension, and therefore does not appear in Figure&#x00A0;1. In A01 and A04, the aspects identified by teachers to represent content relate to creative writing (e.g. plot, character details). From the data provided in the assessment bundles, it was not clear to what extent these aspects followed explicit instruction on literary competence, or whether they were simply vehicles for learners to demonstrate their writing skills. This contrasts with the role of creative writing assignments in assessment A05, which build on exploration of the techniques used in a literary work. Furthermore, the lack of detail in the materials (which included learner work and completed rubrics, but no written instructions) prevented us from being able to gauge the cognitive demands of the tasks assigned. These two assessments therefore also do not appear in Figure 1.</p>
					<fig id="IMG000001" orientation="portrait" position="anchor">
						<caption>
							<title>Figure 1</title>
							<p>Cognitive and linguistic demands of assessments, based on Lo and Lin (2014)</p>
						</caption>
						<graphic xlink:href="DuJAL15756_i0001.png"/>
					</fig>
					<p>As Figure 1 demonstrates, both cognitive and linguistic demands of assessments are generally high. None of the assessments remain at the lower demand recall and/or vocabulary levels. Three assessments could be characterized to pose medium cognitive and linguistic demands (e.g. a written test with open questions in which students have to apply learned content, and formulate answers using full sentences). Two assessments likewise demand language production at the sentence level, but require cognitive processes at the analysis level (e.g. a written test with open questions which require students to perform an evaluation or comparison). The majority of assessments (9 out of the 14 assessments of which both the cognitive and linguistic demands could be analysed), however, can be positioned in the most demanding cell of the matrix. In those assessments, students are expected to perform complex cognitive processes and report on them in written or oral texts (e.g. conducting and reporting on an experiment).</p>
					<p>When we examine the years and educational tracks in which the different assessments were used, it does not seem to be the case that cognitive and/or linguistic demands increase with school year (1-3). In fact, the three year 1 assessments (A05, A08, A17) can all be characterized as maximally demanding, both cognitively and linguistically, whereas three out of the five year 3 assessments are less demanding (A10; A12, A16). Similarly, no clear pattern emerges with respect to the different educational tracks (pre-vocational education: <italic>vmbo</italic>, general higher education: <italic>havo</italic>; pre-university education: <italic>vwo</italic>). Both <italic>havo</italic> and <italic>vwo</italic> assessments can be found in the cell representing the lowest demands in our sample (application/sentence) as well as in the most demanding category (analysis/text). Since there is only one assessment which is (also) used in the <italic>vmbo</italic> track, no meaningful conclusion can be drawn about the demands of <italic>vmbo</italic> assessments as compared to assessments for <italic>havo</italic> and <italic>vwo</italic>.</p>
				</sec>
			</sec>
		</sec>
		<sec sec-type="heading-1">
			<label>4</label>
			<title>Discussion</title>
			<p>This small-scale study aimed to increase our understanding of ways in which teachers of English in bilingual education (TEB&#x202F;s) and CLIL subject teachers (ST&#x202F;s) in a Dutch CLIL context approach summative assessment. Two types of data collection were used: a self-report questionnaire to take inventory of perceived practices, and in-depth analysis of assessment materials to gain insight into actual (or operational) practices (van den Akker, 2003). The assessments submitted were largely written tests and assignments, in line with findings from previous CLIL research highlighting the predominance of written assessment (Otto, 2017; Reierstam &amp; Sylv&#x00E9;n, 2019). Together, the two data sources shed light on: (1) the interplay between content and language in these assessments; (2) the use of assessment accommodations and translanguaging as means to mitigate potential interference of lower L2 proficiency; and (3) the cognitive and linguistic demands assessments pose on students. Below, we discuss the significance of the findings in each of these areas, in relation to the limited literature on summative assessment practices in CLIL and other L2 content teaching settings.</p>
			<sec sec-type="heading-2">
				<label>4.1</label>
				<title>Relative roles of content and language</title>
				<p>It is perhaps not surprising that TEB&#x202F;s placed more emphasis on language in assessment, and ST&#x202F;s on content. This trend was apparent in both the self-report data and in the grading and feedback practices represented in the assessment bundles. This reflects earlier findings suggesting that teachers&#x2019; disciplinary identity influences the focus of their assessments (Otto &amp; Estrada, 2019).</p>
				<p>The content observed in TEB&#x202F;s&#x2019; assessment materials was not always linked to subject-relevant learning objectives (e.g. belonging to the fields of literature, cultural studies or linguistics), but in some cases was used as a vehicle to assess a linguistic goal (e.g. &#x2018;appearing knowledgeable&#x2019; when giving a speech). A complexity that we observed when analysing the materials provided by teachers was the position of creative writing. Teachers identified content criteria in creative writing exercises, although in two of the three examples of this, it was not clear whether and how those criteria aligned with prior learning related to literary competence, or if they were extensions of learners&#x2019; writing skills. Literature on CLIL rarely refers to creative writing, and when it does, it tends to be positioned as a means of processing input (e.g. exploring a scene from a play, as in Dale (2020)) or of practising writing skills (e.g. Weiss et al., 2023), rather than of demonstrating literary competence as a form of content. That teachers in this study appeared to consider creative writing to have a content goal, although that goal was not always aligned with our understanding of disciplinary content, highlights the lack of clarity regarding what constitutes disciplinary content in the language classroom.</p>
				<p>When ST&#x202F;s assessed language, they appeared to pay most attention to lower-order aspects of language (e.g. spelling, grammar, punctuation, pronunciation) or to unspecified language &#x201C;use&#x201D;, in spite of assessments being linguistically and cognitively demanding. While self-report data suggested that subject-specific language use was sometimes a focus of grading and feedback, this was rarely reflected in the assessment bundles. This echoes earlier findings from the Dutch CLIL context (Busz et al., 2014), where teachers provided micro-level feedback similar to that found in the current study, as opposed to feedback on subject-specific communication. Our findings might suggest that the increasing emphasis on disciplinary literacies in CLIL (Coyle &amp; Meyer, 2021) as key to subject learning is not yet reflected in teaching practice, at least in terms of assessment. As Lo et al. (2019) emphasise, this could limit CLIL&#x2019;s effectiveness in terms of the development of subject-specific language, as washback from assessment could lead ST&#x202F;s to neglect the linguistic aspects of their subject in their teaching. Furthermore, in line with earlier findings by H&#x00F6;nig (2010), analysis of the assessment materials and self-report data suggest that language does play a role in ST&#x202F;s&#x2019; assessment of content, even when it is not identified as a goal of assessment or mentioned explicitly in instructions. As identified by Lo and Fung (2020), this could lead to underestimation of learners&#x2019; content knowledge or to learners being penalized on the basis of &#x201C;invisible&#x201D; language criteria (Llinares et al., 2012, p.&#x00A0;274).</p>
			</sec>
			<sec sec-type="heading-2">
				<label>4.2</label>
				<title>Accommodations and translanguaging</title>
				<p>CLIL teachers in this study reported that they allow room for a range of accommodations to support completion of summative assignments (e.g. projects, presentations), and to a lesser extent for timed tests taken in classroom settings. In the assessment bundles, we saw that learners were allowed to draw on diverse support materials when working on summative assignments, although we found few instances of embedded accommodations such as visual support or text modification, to help learners understand input or instructions, or graphic organisers to support output. This contrasts to findings from a similar exploration by Otto (2017), who found visual support and guided questioning to be the most common forms of accommodation in the CLIL assessments she analysed, although she too observed an absence of support for language production. As earlier studies have suggested that a variety of accommodation strategies is needed to respond to different learner needs (De Backer et al., 2019a), this appears to be an area in which CLIL teachers may be able to support their learners better. As the multiple-choice options in the questionnaire were limited, however, we do not know whether this trend extends beyond the seventeen assessments analysed. Further research could explore this aspect of CLIL assessment in more detail, in order to make more concrete recommendations.</p>
				<p>Translanguaging in assessment appeared not to be common practice. This confirms our expectation based on the known tendency for languages other than English to be excluded from CLIL classrooms in the Netherlands (van Kampen et al., 2018). On the few occasions where translanguaging was used to support learners or permitted in learners&#x2019; responses, teachers qualified their responses by stating that this was permitted only in the first year. This may reflect Oattes et al.&#x2019;s (2018) observation that ST&#x202F;s are implicitly aware that translanguaging can be a useful pedagogical tool, and tend to use it mostly with younger learners. In light of Oattes et al.&#x2019;s (2020) finding that assessing first year CLIL learners&#x2019; content knowledge in English can be detrimental to their performance, this might be considered a well-reasoned approach.</p>
			</sec>
			<sec sec-type="heading-2">
				<label>4.3</label>
				<title>Cognitive and linguistic demands</title>
				<p>As exemplified in the analysed assessment bundles, both cognitive and linguistic demands of content-focused assessment were generally high. This was the case irrespective of year (1, 2 or 3) and educational track (pre-vocational/<italic>vmbo</italic>, general/<italic>havo</italic> or pre-university/<italic>vwo</italic>). Lo and Fung (2020) warn about the risks this combination of high demands can pose to assessment validity due to learners&#x2019; developing L2 proficiency, as they found that increased linguistic demands adversely affected learners&#x2019; performance in terms of content. In the context of our study where, in contrast to Lo and Fung&#x2019;s findings, high demands on both axes were found in the earliest years of CLIL, and in light of Oattes et al.&#x2019;s (2020) conclusion that language can be a barrier to younger learners in CLIL content assessments, the question can be posed as to whether it is fair and valid to assess them in this way. That said, emphasis on disciplinary literacies as integral to content learning emphasizes the inextricable connection between language and cognition in any learning situation (Dalton-Puffer, 2013). From this perspective, higher cognitive demands and higher linguistic demands will likely go hand in hand, but need to be built-up in structured and scaffolded ways (Coyle &amp; Meyer, 2021). Rather than suggesting that teachers lower demands in assessments, it might be better to address the use of scaffolded approaches to teaching and assessment (e.g. using accommodations and translanguaging), in CLIL teacher education and professional development.</p>
			</sec>
		</sec>
		<sec sec-type="heading-1">
			<label>5</label>
			<title>Conclusions and implications</title>
			<p>This study was limited by its relatively small number of respondents. Nonetheless, the combination of exploring practice on perceived and operational levels allowed for in-depth analysis and triangulation, in order to paint a preliminary picture of current CLIL assessment practices in bilingual lower-secondary education in the Netherlands. While its findings cannot be generalised to the whole population of CLIL teachers in the Netherlands, we hope this study will be a springboard for further research, for example examining the alignment between classroom teaching and assessment, or exploring assessment programmes as a whole, to identify whether the distribution of attention for language and content balances out over the course of an academic year or more. In-depth interviews with teachers could be a valuable supplement to these approaches, in order to gain insight into the thinking behind assessment choices and in what teachers need in order to develop their practice further in this regard. In so doing, more emphasis could also be placed on exploring the relative roles and perspectives of language teachers and teachers of other subjects. As previous research has suggested that experience with CLIL may influence teachers&#x2019; practice in mainstream classes (Oattes et al., 2018), and considering the similarities between CLIL and teaching practices in other multilingual classrooms (Hajer, 2018), all of the areas mentioned above could be explored in relation to CLIL specifically, or in other multilingual or monolingual settings.</p>
			<p>Returning to Sato&#x2019;s (2023) model of &#x2018;separate&#x2019;, &#x2018;weakly integrated&#x2019; and &#x2018;strongly integrated&#x2019; approaches to CLIL assessment, the findings of this study suggest that, in practice, the lines between these approaches are blurred. As has also been found elsewhere, while ST&#x202F;s appeared to believe that they handled language and content separately, the intrinsic relationship between linguistic and content elements nonetheless appeared to play a role in assessment. Learners were often allowed access to support materials, which might contribute to mitigating interference of language proficiency in content assessment. On the whole, however, little evidence was found of adaptations in the format or design of assessments in order to directly support understanding or production of language, nor of opportunities to use translanguaging for this purpose, in spite of both cognitive and linguistic demands of assessments being high. English teachers, conversely, did pay explicit attention to language, but not in relation to meaningful content. This could be a missed opportunity to support learners&#x2019; development in the literacies of English as a discipline in its own right, in particular with an eye to the requirements of the content-heavy IB programme in the senior years.</p>
			<p>As the goal of CLIL is to support learning of both content and language in an integrated way, we would argue for also integrating and meaningfully addressing these elements in assessment, in ways that go beyond evaluating general &#x201C;language use&#x201D; in content subjects and &#x201C;appearing knowledgeable&#x201D; about content in language subjects. This can be achieved through high-demand, high-support approaches that acknowledge both content and language as integral to subject learning, while also acknowledging that language proficiency can affect communication of content knowledge, and therefore providing appropriate linguistic support. As emphasised in work on disciplinary literacies (e.g. Coyle &amp; Meyer, 2020), similar issues are at stake even when learning takes place in the learners&#x2019; L1, as subject learning inevitably involves developing proficiency in disciplinary discourse. Furthermore, addressing disciplinary content in L1 or any foreign language classrooms could help expose more learners to the benefits of content and language integration (Mearns &amp; Platteel, 2020; Michel et al., 2021). To this end, teachers in any setting could benefit from support in developing integrated assessment practices, with appropriate levels of support, through focused attention as part of teacher education and professional development, from a multilingual disciplinary literacies perspective. Coupling professional development activities such as these to teacher action research or professional learning communities could serve the dual purpose of supporting teachers&#x2019; development and providing valuable new insights for the broader CLIL and disciplinary literacies communities.</p>
		</sec>
	</body>
	<back>
		<ref-list>
			<title>References</title>
			<ref id="R000001">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Bentley</surname>, <given-names>K.</given-names></string-name></person-group> (<date><year>2010</year></date>). <source><italic>The TKT Course CLIL Module</italic></source>. <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation>
			</ref>
			<ref id="R000002">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Biggs</surname>, <given-names>J.</given-names></string-name></person-group> (<date><year>1996</year></date>). <article-title>Enhancing teaching through constructive alignment</article-title>. <source><italic>Higher Education</italic></source>, <italic>32</italic>, <page-range>347</page-range>-<page-range>364</page-range>. <ext-link xlink:href="https://doi.org/10.1007/BF00138871">https://doi.org/10.1007/BF00138871</ext-link></mixed-citation>
			</ref>
			<ref id="R000003">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Busz</surname>, <given-names>M.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Helleman</surname>, <given-names>J.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>DeVincent</surname>, <given-names>D.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Verwoerd-Sowariraj</surname>, <given-names>S.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Tonsberg Schlie</surname>, <given-names>K.</given-names></string-name></person-group> (<date><year>2014</year></date>). <article-title>De praktijk van feedback op Engelstalige profielwerkstukken [The practice of giving feedback on extended essays written in English</article-title>]. <source><italic>Levende Talen Tijdschrift</italic></source>, <italic>15</italic>(<issue>4</issue>), <page-range>26</page-range>-<page-range>37</page-range>. <ext-link xlink:href="https://lt-tijdschriften.nl/ojs/index.php/ltt/article/view/903">https://lt-tijdschriften.nl/ojs/index.php/ltt/article/view/903</ext-link></mixed-citation>
			</ref>
			<ref id="R000004">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Butler</surname>, <given-names>F.A.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Stevens</surname>, <given-names>R.</given-names></string-name></person-group> (<date><year>1997</year></date>). <source><italic>Accommodation strategies for english language learners on large-scale assessments: Student characteristics and other considerations</italic></source>.</mixed-citation>
			</ref>
			<ref id="R000005">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Cenoz</surname>, <given-names>J.</given-names></string-name></person-group> (<date><year>2017</year></date>). <article-title>Translanguaging in school context</article-title>. <source>International perspectives. <italic>Journal of Language, Identity and Education</italic></source>, <italic>16</italic>(<issue>4</issue>), <page-range>193</page-range>-<page-range>198</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.1080/15348458.2017.1327816">https://doi.org/https://doi.org/10.1080/15348458.2017.1327816</ext-link></mixed-citation>
			</ref>
			<ref id="R000006">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Coyle</surname>, <given-names>D.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Hood</surname>, <given-names>P.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Marsh</surname>, <given-names>D.</given-names></string-name></person-group> (<date><year>2010</year></date>). <source><italic>CLIL: Content and Language Integrated Learning</italic></source>. <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation>
			</ref>
			<ref id="R000007">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Coyle</surname>, <given-names>D.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Meyer</surname>, <given-names>O.</given-names></string-name></person-group> (<date><year>2021</year></date>). <italic>Beyond CLIL: Pluriliteracies Teaching for Deeper Learning</italic>. Cambridge University Press.</mixed-citation>
			</ref>
			<ref id="R000008">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Dale</surname>, <given-names>L.</given-names></string-name></person-group> (<date><year>2020</year></date>). <italic>On Language Teachers and CLIL. Shifting the Perspectives</italic> University of Amsterdam]. Amsterdam. <ext-link xlink:href="https://dare.uva.nl/search?identifier=98c15569-9e2f-4542-8472-9515aa7bac7d">https://dare.uva.nl/search?identifier=98c15569-9e2f-4542-8472-9515aa7bac7d</ext-link></mixed-citation>
			</ref>
			<ref id="R000009">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Dale</surname>, <given-names>L.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Oostdam</surname>, <given-names>R.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Verspoor</surname>, <given-names>M.</given-names></string-name></person-group> (<date><year>2018</year></date>). <article-title>Juggling ideals and constraints. The position of English teachers in CLIL contexts</article-title>. <source><italic>Dutch Journal of Applied Linguistics</italic></source>, <italic>7</italic>(<issue>2</issue>), <page-range>177</page-range>-<page-range>202</page-range>. <ext-link xlink:href="https://doi.org/doi.org/10.1075/dujal.18002.dal">https://doi.org/doi.org/10.1075/dujal.18002.dal</ext-link></mixed-citation>
			</ref>
			<ref id="R000010">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Dale</surname>, <given-names>L.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Tanner</surname>, <given-names>R.</given-names></string-name></person-group> (<date><year>2012</year></date>). <source><italic>CLIL Activities: A Resource for Subject and Language Teachers</italic></source>. <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation>
			</ref>
			<ref id="R000011">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Dalton-Puffer</surname>, <given-names>C.</given-names></string-name></person-group> (<date><year>2013</year></date>). <article-title>A construct of cognitive discourse functions for conceptualizing contentlanguage integration in CLIL and multilingual education</article-title>. <source><italic>European Journal of Applied Linguistics</italic></source>, <italic>1</italic>(<issue>2</issue>), <page-range>216</page-range>-<page-range>253</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.1515/eujal-2013-0011">https://doi.org/https://doi.org/10.1515/eujal-2013-0011</ext-link></mixed-citation>
			</ref>
			<ref id="R000012">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Dalton-Puffer</surname>, <given-names>C.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Llinares</surname>, <given-names>A.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Lorenzo</surname>, <given-names>F.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Nikula</surname>, <given-names>T.</given-names></string-name></person-group> (<date><year>2014</year></date>). &#x201C;<article-title>You Can Stand Under My Umbrella&#x201D;: Immersion, CLIL and Bilingual Education. A Response to Cenoz</article-title>, <person-group person-group-type="editor"><string-name><given-names>Genesee</given-names></string-name></person-group> &amp; <person-group person-group-type="editor"><string-name><given-names>Gorter</given-names></string-name></person-group> (<date><year>2013</year></date>). <source><italic>Applied Linguistics</italic></source>, <italic>35</italic>(<issue>2</issue>), <page-range>213</page-range>-<page-range>218</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.1093/applin/amu010">https://doi.org/https://doi.org/10.1093/applin/amu010</ext-link></mixed-citation>
			</ref>
			<ref id="R000013">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Dalton-Puffer</surname>, <given-names>C.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Bauer-Marschallinger</surname>, <given-names>S.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Br&#x00FC;ckl-Mackey</surname>, <given-names>K.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Hofmann</surname>, <given-names>V.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Hopf</surname>, <given-names>J.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Kr&#x00F6;ss</surname>, <given-names>L.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Lechner</surname>, <given-names>L.</given-names></string-name></person-group> (<date><year>2018</year></date>). <article-title>Cognitive discourse functions in Austrian CLIL lessons: towards an empirical validation of the CDF Construct</article-title>. <source><italic>European Journal of Applied Linguistics</italic></source>, <italic>6</italic>(<issue>1</issue>), <page-range>5</page-range>-<page-range>29</page-range>. <ext-link xlink:href="https://doi.org/doi:10.1515/eujal-2017-0028">https://doi.org/doi:10.1515/eujal-2017-0028</ext-link></mixed-citation>
			</ref>
			<ref id="R000014">
				<mixed-citation><person-group person-group-type="author"><string-name><suffix>De</suffix> <surname>Backer</surname>, <given-names>F.</given-names></string-name></person-group> (<date><year>2020</year></date>). <article-title>Bridging the gap between learning and evaluation. Lessons learnt from multilingual pupils</article-title>. <source><italic>Journal of Applied Linguistics and Professional Practice</italic></source>, <italic>14</italic>(<issue>1</issue>), <page-range>96</page-range>-<page-range>118</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.1558/jalpp.39770">https://doi.org/https://doi.org/10.1558/jalpp.39770</ext-link></mixed-citation>
			</ref>
			<ref id="R000015">
				<mixed-citation><person-group person-group-type="author"><string-name><suffix>De</suffix> <surname>Backer</surname>, <given-names>F.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Baele</surname>, <given-names>J.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><suffix>van</suffix> <surname>Avermaet</surname>, <given-names>P.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Slembrouck</surname>, <given-names>S.</given-names></string-name></person-group> (<date><year>2019</year></date>a). <article-title>Pupils&#x2019; Perceptions on Accommodations in Multilingual Assessment of Science</article-title>. <source><italic>Language Assessment Quarterly</italic></source>, <italic>16</italic>(<issue>4-5</issue>), <page-range>426</page-range>-<page-range>446</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.1080/15434303.2019.1666847">https://doi.org/https://doi.org/10.1080/15434303.2019.1666847</ext-link></mixed-citation>
			</ref>
			<ref id="R000016">
				<mixed-citation><person-group person-group-type="author"><string-name><suffix>De</suffix> <surname>Backer</surname>, <given-names>F.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Slembrouck</surname>, <given-names>S.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><suffix>van</suffix> <surname>Avermaet</surname>, <given-names>P.</given-names></string-name></person-group> (<date><year>2019</year></date>b). <article-title>Assessment accommodations for multilingual learners: pupils&#x2019; perceptions of fairness</article-title>. <source><italic>Journal of Multilingual and Multicultural Development</italic></source>, <italic>40</italic>(<issue>9</issue>). <ext-link xlink:href="https://doi.org/https://doi.org/10.1080/01434632.2019.1571596">https://doi.org/https://doi.org/10.1080/01434632.2019.1571596</ext-link></mixed-citation>
			</ref>
			<ref id="R000017">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Hajer</surname>, <given-names>M.</given-names></string-name></person-group> (<date><year>2018</year></date>). <article-title>Teaching content through Dutch as a second language. How &#x2018;Language Oriented Content Teaching&#x2019; unfolded in mainstream secondary education</article-title>. <source><italic>Dutch Journal of Applied Linguistics</italic></source>, <italic>7</italic>(<issue>2</issue>), <page-range>246</page-range>-<page-range>263</page-range>. <ext-link xlink:href="https://doi.org/10.1075/dujal.18001.haj">https://doi.org/10.1075/dujal.18001.haj</ext-link></mixed-citation>
			</ref>
			<ref id="R000018">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>H&#x00F6;nig</surname>, <given-names>I.</given-names></string-name></person-group> (<date><year>2010</year></date>). <article-title>Assessment in CLIL &#x2013; A case study</article-title>. <source><italic>Vienna English Working Papers: Current Research on CLIL</italic></source>, <italic>19</italic>(<issue>3</issue>).</mixed-citation>
			</ref>
			<ref id="R000019">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Llinares</surname>, <given-names>A.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Morton</surname>, <given-names>T.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Whittaker</surname>, <given-names>R.</given-names></string-name></person-group> (<date><year>2012</year></date>). <source><italic>The Roles of Language in CLIL</italic></source>. <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation>
			</ref>
			<ref id="R000020">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Lo</surname>, <given-names>Y.Y.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Fung</surname>, <given-names>D.</given-names></string-name></person-group> (<date><year>2020</year></date>). <article-title>Assessments in CLIL: the interplay between cognitive and linguistic demands and their progression in secondary education</article-title>. <source><italic>International Journal of Bilingual Education and Bilingualism</italic></source>, <italic>23</italic>(<issue>10</issue>), <page-range>1192</page-range>-<page-range>1210</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.1080/13670050.2018.1436519">https://doi.org/https://doi.org/10.1080/13670050.2018.1436519</ext-link></mixed-citation>
			</ref>
			<ref id="R000021">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Lo</surname>, <given-names>Y.Y.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Lin</surname>, <given-names>A.M.Y.</given-names></string-name></person-group> (<date><year>2014</year></date>). <article-title>Designing assessment tasks with language awareness: Balancing cognitive and linguistic demands</article-title>. <source><italic>Assessment and Learning</italic></source>, <italic>3</italic>, <page-range>97</page-range>-<page-range>119</page-range>.</mixed-citation>
			</ref>
			<ref id="R000022">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Lo</surname>, <given-names>Y.Y.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Lui</surname>, <given-names>W.-M.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Wong</surname>, <given-names>M.</given-names></string-name></person-group> (<date><year>2019</year></date>). <article-title>Scaffolding for cognitive and linguistic challenges in CLIL science assessments</article-title>. <source><italic>Journal of Immersion and Content-Based Language Education</italic></source>, <italic>7</italic>(<issue>2</issue>), <page-range>289</page-range>-<page-range>314</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.1075/jicb.18028.lo">https://doi.org/https://doi.org/10.1075/jicb.18028.lo</ext-link></mixed-citation>
			</ref>
			<ref id="R000023">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Mearns</surname>, <given-names>T.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><suffix>de</suffix> <surname>Graaff</surname>, <given-names>R.</given-names></string-name></person-group> (<date><year>2018</year></date>). <article-title>Bilingual education and CLIL in the Netherlands. The paradigm and the pedagogy</article-title>. <source><italic>Dutch Journal of Applied Linguistics</italic></source>, <italic>7</italic>(<issue>2</issue>), <page-range>122</page-range>-<page-range>128</page-range>. <ext-link xlink:href="https://doi.org/10.1075/dujal.00002.int">https://doi.org/10.1075/dujal.00002.int</ext-link></mixed-citation>
			</ref>
			<ref id="R000024">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Mearns</surname>, <given-names>T.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Platteel</surname>, <given-names>T.</given-names></string-name></person-group> (<date><year>2020</year></date>). <article-title>Exploring teacher support for a content and language integrated modern languages curriculum</article-title>. <source><italic>Language, Culture and Curriculum</italic></source>, <page-range>1</page-range>-<page-range>17</page-range>. <ext-link xlink:href="https://doi.org/10.1080/07908318.2020.1809665">https://doi.org/10.1080/07908318.2020.1809665</ext-link></mixed-citation>
			</ref>
			<ref id="R000025">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Mearns</surname>, <given-names>T.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><suffix>van</suffix> <surname>Kampen</surname>, <given-names>E.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Admiraal</surname>, <given-names>W.</given-names></string-name></person-group> (<date><year>2023</year></date>). <chapter-title>CLIL in The Netherlands: Three decades and innovation and development</chapter-title>. In <person-group person-group-type="editor"><string-name><given-names>D.L.</given-names> <surname>Banegas</surname></string-name></person-group> &amp; <person-group person-group-type="editor"><string-name><given-names>S.</given-names> <surname>Zappa-Hollman</surname></string-name></person-group> (Eds.), <source><italic>The Routledge Handbook of Content and Language Integrated Learning</italic></source> (pp. <page-range>403</page-range>-<page-range>418</page-range>). <publisher-name>Routledge</publisher-name>.</mixed-citation>
			</ref>
			<ref id="R000026">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Mehisto</surname>, <given-names>P.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Ting</surname>, <given-names>T.</given-names></string-name></person-group> (<date><year>2017</year></date>). <source><italic>CLIL essentials for secondary school teachers</italic></source>. <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation>
			</ref>
			<ref id="R000027">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Michel</surname>, <given-names>M.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Vidon</surname>, <given-names>C.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><suffix>De</suffix> <surname>Graaff</surname>, <given-names>R.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Lowie</surname>, <given-names>W.</given-names></string-name></person-group> (<date><year>2021</year></date>). <article-title>Language Learning beyond English in the Netherlands: A fragile future</article-title>. <source><italic>European Journal of Applied Liguistics</italic></source>, <italic>9</italic>(<issue>1</issue>), <page-range>159</page-range>-<page-range>182</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.1515/eujal-2020-0020">https://doi.org/https://doi.org/10.1515/eujal-2020-0020</ext-link></mixed-citation>
			</ref>
			<ref id="R000028">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Morton</surname>, <given-names>T.</given-names></string-name></person-group> (<date><year>2020</year></date>). <article-title>Cognitive Discourse Functions: A Bridge between Content, Literacy and Language for Teaching and Assessment in CLIL</article-title>. <source><italic>CLIL Journal of Innovation and Research in Plurilingual and Pluricultural Education</italic></source>, <italic>3</italic>(<issue>1</issue>), <page-range>7</page-range>-<page-range>17</page-range>.<ext-link xlink:href="https://doi.org/https://doi.org/10.5565/rev/clil.33">https://doi.org/https://doi.org/10.5565/rev/clil.33</ext-link></mixed-citation>
			</ref>
			<ref id="R000029">
				<mixed-citation><person-group person-group-type="author"><string-name><given-names>Nuffic</given-names></string-name></person-group>. (<date><year>2019</year></date>). <source><italic>Kwaliteitsstandaard tweetalig onderwijs 2.0</italic> [Quality Standard for Bilingual Education 2.0</source>]. The Hague: Nuffic Retrieved from <ext-link xlink:href="https://www.nuffic.nl/publicaties/kwaliteitsstandaard-tweetalig-onderwijs-20/">https://www.nuffic.nl/publicaties/kwaliteitsstandaard-tweetalig-onderwijs-20/</ext-link></mixed-citation>
			</ref>
			<ref id="R000030">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Oattes</surname>, <given-names>H.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Fukkink</surname>, <given-names>R.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Oostdam</surname>, <given-names>R.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><suffix>de</suffix> <surname>Graaff</surname>, <given-names>R.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Wilschut</surname>, <given-names>A.</given-names></string-name></person-group> (<date><year>2020</year></date>). <article-title>A showdown between bilingual and mainstream education: the impact of language of instruction on learning subject content knowledge</article-title>. <source><italic>International Journal of Bilingual Education and Bilingualism</italic></source>, <page-range>1</page-range>-<page-range>14</page-range>. <ext-link xlink:href="https://doi.org/10.1080/13670050.2020.1718592">https://doi.org/10.1080/13670050.2020.1718592</ext-link></mixed-citation>
			</ref>
			<ref id="R000031">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Oattes</surname>, <given-names>H.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Oostdam</surname>, <given-names>R.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><suffix>de</suffix> <surname>Graaff</surname>, <given-names>R.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Wilschut</surname>, <given-names>A.</given-names></string-name></person-group> (<date><year>2018</year></date>). <article-title>The challenge of balancing content and language: Perceptions of Dutch bilingual education history teachers</article-title>. <source><italic>Teaching and Teacher Education</italic></source>, <italic>70</italic>, <page-range>165</page-range>-<page-range>174</page-range>. <ext-link xlink:href="https://doi.org/10.1016/j.tate.2017.11.022">https://doi.org/10.1016/j.tate.2017.11.022</ext-link></mixed-citation>
			</ref>
			<ref id="R000032">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Otto</surname>, <given-names>A.</given-names></string-name></person-group> (<date><year>2017</year></date>). <source><italic>Assessment in CLIL: The Balance Between the Content and the Language. Madrid Bilingual Secondary Schools as a Case Study</italic></source> [Unpublished, <publisher-name>University of Alcal&#x00E1;</publisher-name>]. <publisher-loc>Madrid</publisher-loc>.</mixed-citation>
			</ref>
			<ref id="R000033">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Otto</surname>, <given-names>A.</given-names></string-name></person-group> (<date><year>2019</year></date>). <article-title>Assessing Language in Content and Language Integrated Learning: A Review of the Literature towards a Functional Model</article-title>. <source><italic>Latin American Journal of Content &amp; Language Integrated Learning</italic></source>, <italic>11</italic>(<issue>2</issue>). <ext-link xlink:href="https://doi.org/10.5294/laclil.2018.11.2.6">https://doi.org/10.5294/laclil.2018.11.2.6</ext-link></mixed-citation>
			</ref>
			<ref id="R000034">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Otto</surname>, <given-names>A.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Estrada</surname>, <given-names>J.L.</given-names></string-name></person-group> (<date><year>2019</year></date>). <article-title>Towards an Understanding of CLIL in a European Context: Main Assessment Tools and the Role of Language in Content Subjects</article-title>. <source><italic>CLIL Journal of Innovation and Research in Plurilingual and Pluricultural Education</italic></source>, <italic>2</italic>(<issue>1</issue>), <page-range>31</page-range>-<page-range>42</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.5565/rev/clil.11">https://doi.org/https://doi.org/10.5565/rev/clil.11</ext-link></mixed-citation>
			</ref>
			<ref id="R000035">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Reierstam</surname>, <given-names>H.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Sylv&#x00E9;n</surname>, <given-names>L.K.</given-names></string-name></person-group> (<date><year>2019</year></date>). <chapter-title>Assessment in CLIL</chapter-title>. In <person-group person-group-type="editor"><string-name><given-names>L.K.</given-names> <surname>Sylv&#x00E9;n</surname></string-name></person-group> (Ed.), <source><italic>Investigating Content and Language Integrated Learning: Insights from Swedish High Schools</italic>. Multilingual Matters</source>.</mixed-citation>
			</ref>
			<ref id="R000036">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Sato</surname>, <given-names>T.</given-names></string-name></person-group> (<date><year>2023</year></date>). <chapter-title>Assessment in CLIL</chapter-title>. In <person-group person-group-type="editor"><string-name><given-names>D.L.</given-names> <surname>Banegas</surname></string-name></person-group> &amp; <person-group person-group-type="editor"><string-name><given-names>S.</given-names> <surname>Zappa-Hollman</surname></string-name></person-group> (Eds.), <source><italic>The Routledge Handbook of Content and Language Integrated Learning</italic></source>. <publisher-name>Routledge</publisher-name>.</mixed-citation>
			</ref>
			<ref id="R000037">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Serra</surname>, <given-names>C.</given-names></string-name></person-group> (<date><year>2007</year></date>). <article-title>Assessing CLIL at Primary School: A Longitudinal Study</article-title>. <source><italic>International Journal of Bilingual Education and Bilingualism</italic></source>, <italic>10</italic>(<issue>5</issue>), <page-range>582</page-range>-<page-range>602</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.2167/beb461.0">https://doi.org/https://doi.org/10.2167/beb461.0</ext-link></mixed-citation>
			</ref>
			<ref id="R000038">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Sluijsmans</surname>, <given-names>D.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Joosten-ten Brinke</surname>, <given-names>D.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><suffix>van der</suffix> <surname>Vleuten</surname>, <given-names>C.</given-names></string-name></person-group> (<date><year>2013</year></date>). <source><italic>Toetsen met leerwaarde; Een reviewstudie naar de effectieve kenmerken van formatief toetsen</italic> [Testing with value for learning; A review study on characteristics of effective formative assessment</source>].</mixed-citation>
			</ref>
			<ref id="R000039">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Trenkic</surname>, <given-names>D.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Warmington</surname>, <given-names>M.</given-names></string-name></person-group> (<date><year>2019</year></date>). <article-title>Language and literacy skills of home and international university students: How different are they, and does it matter?</article-title> <source><italic>Bilingualism: Language and Cognition</italic></source>, <italic>22</italic>(<issue>2</issue>), <page-range>349</page-range>-<page-range>365</page-range>. <ext-link xlink:href="https://doi.org/https://doi.org/10.1017/S136672891700075X">https://doi.org/https://doi.org/10.1017/S136672891700075X</ext-link></mixed-citation>
			</ref>
			<ref id="R000040">
				<mixed-citation><person-group person-group-type="author"><string-name><suffix>van</suffix> <surname>Beuningen</surname>, <given-names>C.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Poli&#x0161;ensk&#x00E1;</surname>, <given-names>D.</given-names></string-name></person-group> (<date><year>2019</year></date>). <article-title>Meertaligheid in het voortgezet onderwijs: Een inventarisatiestudie naar opvattingen en praktijken van talendocenten [Multilingualism in secondary education: An inventory study into the views and practices of language teachers</article-title>]. <source><italic>Levende Talen Tijdschrift</italic></source>, <italic>20</italic>(<issue>4</issue>), <page-range>25</page-range>-<page-range>36</page-range>.</mixed-citation>
			</ref>
			<ref id="R000041">
				<mixed-citation><person-group person-group-type="author"><string-name><suffix>van den</suffix> <surname>Akker</surname>, <given-names>J.</given-names></string-name></person-group> (<date><year>2003</year></date>). <chapter-title>Curriculum perspectives: An introduction</chapter-title>. In <person-group person-group-type="editor"><string-name><given-names>J.</given-names> <suffix>van den</suffix> <surname>Akker</surname></string-name></person-group>, <person-group person-group-type="editor"><string-name><given-names>W.</given-names> <surname>Kuiper</surname></string-name></person-group>, &amp; <person-group person-group-type="editor"><string-name><given-names>U.</given-names> <surname>Hameyer</surname></string-name></person-group> (Eds.), <source><italic>Curriculum landscapes and trends</italic></source> (pp. <page-range>1</page-range>-<page-range>10</page-range>). <publisher-name>Kluwer</publisher-name>. <ext-link xlink:href="https://doi.org/https://doi.org/10.1007/978-94-017-1205-7">https://doi.org/https://doi.org/10.1007/978-94-017-1205-7</ext-link></mixed-citation>
			</ref>
			<ref id="R000042">
				<mixed-citation><person-group person-group-type="author"><string-name><suffix>van</suffix> <surname>Kampen</surname>, <given-names>E.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Mearns</surname>, <given-names>T.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Meirink</surname>, <given-names>J.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Admiraal</surname>, <given-names>W.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>Berry</surname>, <given-names>A.</given-names></string-name></person-group> (<date><year>2018</year></date>). <article-title>How do we measure up? A review of Dutch CLIL subject pedagogies against an international backdrop</article-title>. <source><italic>Dutch Journal of Applied Linguistics</italic></source>, <italic>7</italic>(<issue>2</issue>), <page-range>129</page-range>-<page-range>155</page-range>. <ext-link xlink:href="https://doi.org/10.1075/dujal.18004.kam">https://doi.org/10.1075/dujal.18004.kam</ext-link></mixed-citation>
			</ref>
			<ref id="R000043">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Weiss</surname>, <given-names>D.</given-names></string-name></person-group>, <person-group person-group-type="author"><string-name><surname>Gierlinger</surname>, <given-names>E.M.</given-names></string-name></person-group>, &amp; <person-group person-group-type="author"><string-name><surname>H&#x00FC;tter</surname>, <given-names>J.</given-names></string-name></person-group> (<date><year>2023</year></date>). <chapter-title>Teaching genre-based writing in CLIL secondary education. The implementation of a language learning model in the subject Pedagogy</chapter-title>. In <person-group person-group-type="editor"><string-name><given-names>E.M.</given-names> <surname>Gierlinger</surname></string-name></person-group>, <person-group person-group-type="editor"><string-name><given-names>M.</given-names> <surname>D&#x00F6;ll</surname></string-name></person-group>, &amp; <person-group person-group-type="editor"><string-name><given-names>G.</given-names> <surname>Keplinger</surname></string-name></person-group> (Eds.), <source><italic>TALK in Multilingual Classrooms</italic></source> (pp. <page-range>261</page-range>-<page-range>296</page-range>). <publisher-name>Waxmann</publisher-name>.</mixed-citation>
			</ref>
			<ref id="R000044">
				<mixed-citation><person-group person-group-type="author"><string-name><surname>Yang</surname>, <given-names>X.</given-names></string-name></person-group> (<date><year>2020</year></date>). <article-title>Assessment accommodations for emergent bilinguals in mainstream classroom assessments: a targeted literature review</article-title>. <source><italic>International Multilingual Research Journal</italic></source>, <italic>14</italic>(<issue>3</issue>), <page-range>217</page-range>-<page-range>232</page-range>. <ext-link xlink:href="https://doi.org/10.1080/19313152.2019.1681615">https://doi.org/10.1080/19313152.2019.1681615</ext-link></mixed-citation>
			</ref>
		</ref-list>
	</back>
</article>