Mostrar el registro sencillo del ítem
Characterization of Institutional Texts for an Automated Golden Standard: Enhancing Machine Translation Quality Assessment between English and Spanish
dc.contributor.author | Romana García, María Luisa | es-ES |
dc.contributor.author | Hernández Pardo, Blanca | es-ES |
dc.date.accessioned | 2025-06-26T12:19:07Z | |
dc.date.available | 2025-06-26T12:19:07Z | |
dc.date.issued | 2024-07-06 | es_ES |
dc.identifier.uri | http://hdl.handle.net/11531/99668 | |
dc.description | Presentación en congreso | es_ES |
dc.description.abstract | . | es-ES |
dc.description.abstract | The purpose of this paper is to collect a set of features that can contribute to the linguistic characterization of the institutional textual genre. The aim is to describe as exhaustively as possible the archetypal text to be obtained as a target text in this type of specialized translation. The tools used were Orange Data Mining© and Google Colab (Python code), and the data was obtained using the following processing mechanisms: word cloud, text preprocessing (cleaning, tokenization, normalization, lemmatization and PoS annotation). With these tools, lexical and grammatical frequencies, lexical and documentary embeddings, cosine distances, hierarchical clustering, and 20-component dimensionality reduction (t-SNE) were extracted. As a result, a series of useful descriptive parameters have been obtained for the characterization of model texts for economic translation of institutional domains into Spain Spanish: lexical and terminological density, phraseological and terminological lexicalizations, grammatical frequencies, and semantic maps. In conclusion, the study provides several quantifiable features that characterize the analyzed register and opens the way for further research to deepen these parameters and develop the research by searching for complementary parameters until a complete and exhaustive picture of the reference model in this genre is obtained. | en-GB |
dc.format.mimetype | application/pdf | es_ES |
dc.language.iso | en-GB | es_ES |
dc.rights | es_ES | |
dc.rights.uri | es_ES | |
dc.source | Descripción: International Conference ‘New Trends in Translation and Technology’ (NeTTT’2024) Página Inicio: 138 Página Fin: 155 | es_ES |
dc.title | Characterization of Institutional Texts for an Automated Golden Standard: Enhancing Machine Translation Quality Assessment between English and Spanish | es_ES |
dc.type | info:eu-repo/semantics/other | es_ES |
dc.description.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.rights.holder | politica editorial | es_ES |
dc.rights.accessRights | info:eu-repo/semantics/restrictedAccess | es_ES |
dc.keywords | . | es-ES |
dc.keywords | Machine Translation, Golden Standard, Translation Quality Assessment, Specialized Translation, AI Processing. | en-GB |
Ficheros en el ítem
Este ítem aparece en la(s) siguiente(s) colección(ones)
-
Artículos
Artículos de revista, capítulos de libro y contribuciones en congresos publicadas.