Audiovisual text (hereinafter AVT) is defined by Chaume (2004, 16) as âa semiotic construct comprising several signifying codes that operate simultaneously in the production of meaningâ and is regarded as one of the most prominent polysemiotic text types. Subtitling, an indispensable part of AVT study, is defined by DĂaz Cintas and Remael (2007, 8) as consisting of the presentation of
a written text, generally on the lower part of the screen, that endeavors to recount the original dialogue of the speakers, as well as the discursive elements that appear in the image (letters, inserts, graffiti, inscriptions, placards and the like), and the information that is contained on the soundtrack.
Over the past two decades, scholars (Zabalbeascoa 1996; Chuang 2006) have studied film translation from a polysemiotic perspective and claimed that âno text can be made entirely of verbal signs because such signs always need some sort of physical supportâ (Zabalbeascoa 1996, 338). Other scholars have started to explore the interaction between verbal and non-verbal elements in subtitle translation (Baldry and Thibault 2006; Gottlieb 2008; Kourdis 2015). However, up to now, scholarly attention in translation studies has mainly concentrated on verbal texts while the function of non-verbal elements such as intonation, layout, body language, and facial expression has been largely unnoticed (Zabalbeascoa 2008, 24). As Perez-Gonzalez (2014, 185) observes, âa large proportion of research efforts in audiovisual translation still revolve around elaborating taxonomies of different types of equivalence between short, decontextualized stretches of dialogue in the source and target language.â
Although research from the polysemiotic approach started decades ago, it is noticed that the aims of previous studies were rather general and the approaches were lack of theoretical support (Yu and Song 2017, 604; Lee 2018, 58; EI-Farahaty 2018, 37). When it comes to the study of AVT translation between Chinese and English, very few scholarly works could be found. A notable study is Chuang, who noted that âdifferent semiotic modes contribute different kinds of meanings to the film textâ (2006, 381). But the author did not give a thorough theoretical account of how those semiotic modes affect the Chinese-English translation. In order to provide further understanding about how the visual track impacts translation choices, this study proposes a theoretical framework to investigate Chinese-English AVT translation and applies it to the analysis of a Chinese costume drama.
Costume dramas, normally setting against historical background and unique conventions in ancient China, have always played an important role in the Chinese TV industry and social media. Recent years have witnessed the export of several popular costume dramas abroad such as Zhenhuan Zhuan (Empresses in the Palace), Yanxi Gong Lue (Story of Yanxi Palace), and Lang Ya Bang (Nirvana in Fire). They have attracted the attention of overseas audiences.
This chapter investigates the English translation of the Chinese costume drama Zhenhuan Zhuan by adopting a multimodal discourse analysis approach. Being the first costume drama of its kind officially exported overseas, it has aroused great interest from an international audience (Liu and Zhang 2017, 64). This drama is about the growth of the girl Zhenhuan in the harem of the Qing Dynasty, transforming from an innocent girl to a sophisticated empress dowager. Netflix, the largest media service provider in the United States, released its English-language version in 2014. Due to its high recognition, and the intersemiotic nature and translation quality assured by Netflix, this costume drama serves as good material for translation studies. In recent years, scholars have carried out research on the English translation of this drama. Shi and Fu (2016) took a practical approach and focused on the losses and gains in translating addressing terms and suggested several translation techniques like direct translation and addition to produce more effective translation products. Taking a sociological approach, Xiong (2016) discussed possible reasons for shortening the originally rather long drama into six episodes and commented positively on the translation provided by Netflix. However, very little research has been done on the abundant non-verbal modes of this drama, and even less in relation to translation. It is hoped that by investigating the interaction between verbal and non-verbal modes, this study will be able to systematically map out the intersemiotic shifts in this TV drama and examine the role of non-verbal modes in the translation process. Specific research questions of this study are as follows: (1) What intersemiotic shifts could be observed in the subtitle translation of Zhenhuan Zhuan? (2) Why do the shifts happen, and what are the roles of the non-verbal modes?