Resources

Corpus and computational linguistics tools for translation research

Corpus Resources

Comprehensive corpora for translation and linguistic research

COCA

Corpus of Contemporary American English

A vast resource for linguistic research, offering a wide range of texts for analysis. One of the largest corpora of American English.

American English Large Corpus
Access Corpus

Sketch Engine

Corpus Analysis Platform

A powerful tool for corpus analysis, with access to over 500 corpora in more than 90 languages. Advanced word sketch and collocation features.

Multi-language Analysis Tools
Visit Website

British National Corpus

BNC

A large text corpus of written and spoken English from diverse sources. Essential resource for British English studies.

British English Spoken & Written
Access BNC

TED Talks Corpus

Spoken Language Resource

A resource for studying spoken English and its translation, containing transcripts of TED Talks and their translations.

Spoken Language Subtitles
Learn More

OpenSubtitles

Movie & TV Subtitles Corpus

A collection of subtitles for movies and TV shows, useful for linguistic analysis and translation studies.

Subtitles Multilingual
Visit Website

Project Gutenberg

Free eBooks Library

Over 60,000 free eBooks for linguistic and translation studies. Excellent source for literary texts.

Literature Free Access
Browse Library

Computational Linguistics Tools

Software and platforms for NLP and text analysis

NLTK

Natural Language Toolkit

A leading platform for building Python programs to work with human language data. Essential for NLP research.

Python NLP Open Source
Documentation

spaCy

Industrial-Strength NLP

An advanced NLP library in Python for processing and understanding large volumes of text. Fast and production-ready.

Python Fast Production
Get Started

Stanford CoreNLP

NLP Toolkit

A suite of NLP tools for performing a wide range of linguistic analysis tasks. Java-based with multiple language support.

Java Comprehensive
Documentation

Transformers

Hugging Face Library

A library that provides general-purpose architectures for natural language understanding and generation. State-of-the-art models.

Deep Learning Transformers SOTA
Explore Models

TensorFlow

Machine Learning Platform

An open-source platform for machine learning, widely used in computational linguistics research.

Machine Learning Neural Networks
Get Started

ACL Anthology

Research Paper Archive

A digital archive of research papers in computational linguistics, providing access to thousands of publications.

Papers Research Archive
Browse Papers

Need Help Finding Resources?

Can't find what you're looking for? We're happy to help you identify the right resources for your research needs.

Contact Us