Welcome to CQPweb at Beijing Foreign Studies University
Please contact Dr. Jiajin XU for access

Please select a corpus below to enter.
(Both user ID and password are "test" for guest users.)
DEAP Family (Database of English for Academic Purposes)
 
AgriDEAP (5M words of agriculture English research articles, created by Jing Lǚ, SCAU)
 
 
BioDEAP (5M words of life science English research articles, created by Gong Peng, UCAS)
 
 
ChemDEAP (5M words of chemistry English research articles, created by Lanfeng Zhong , UJS)
 
 
CivDEAP (5M words of civil engineering English research articles, created by Baicheng Zhang, CQJTU)
 
 
EconDEAP (6M words of economics English research articles, created by Xia Liu, SWUFE)
 
 
EduDEAP (5M words of education English research articles, created by Li Wang, SHNU)
 
 
GeoDEAP (6M words of geography English research articles, created by Lei Liu, YSU)
 
 
InfoDEAP (5M words of information science English research articles, created by Yaochen Deng, DLUFL)
 
 
LinDEAP (5M words of linguistics English research articles, created by Zhantin Bu, QDU)
 
 
LitDEAP (5M words of literary studies English research articles, created by Tao Yu, JSNU)
 
 
MatDEAP (5M words of materials science English research articles, created by Pengfei Yan, BIT)
 
 
MedDEAP (5M words of medicine English research articles, created by Xin Feng, FJMU)
 
 
MilDEAP (5M words of junshi kexue English research articles, created by Xiaolei Ma)
 
 
PhilDEAP (5M words of philosophy English research articles, created by Zhantin Bu, QDU)
 
 
PolDEAP (5M words of political science English research articles, created by Guobing Liu, HNU)
 
 
PsyDEAP (6M words of psychology English research articles, created by Jiehui Hu, UESTC)
 
  
English corpora
 
Brown corpus (AmE 1961)
 
 
Business English Corpus (2M words, created by Lifei Wang, UIBE)
 
 
Contemporary College English (textbook, for in-house use only)
 
 
China Daily Political News 2011
 
 
CLOB corpus (Brown family, BrE 2009, , created by Jiajin Xu et al, BFSU)
 
 
COLEN (textbook corpus)
 
 
Crown corpus (Brown family, AmE 2009, created by Jiajin Xu et al, BFSU)
 
 
Novels by Charles Dickens
 
 
Durban Climate Talks Corpus (China Daily & New York Times)
 
 
Friends (Sitcom transcripts)
 
 
The Independent Corpus (2009-2015, ca. 231 million words)
 
 
MedAca (Medical English discourse of Academia) Corpus, 1M words, created by Xin Feng et al, FJMU
 
 
NESSIE Corpus 1st release (NESSIEv1, Native English Speakers Similarly or Identically-prompted Essays, , created by Jiajin Xu, BFSU)
 
 
NESSIE Corpus 2nd release (NESSIEv2, Native English Speakers Similarly or Identically-prompted Essays, , created by Jiajin Xu, BFSU)
 
 
PATTIE corpus (Preschoolers- and Teenagers-oriented Texts in English, created by Jie Ji, CFAU)
 
 
TED Speeches (En)
 
 
TIME Magazine Corpus (1923-2008,ca. 196 million words)
 
 
Learner English corpora
 
Chinese Learners English Corpus (CLEC, copyright-protected, in-house use only)
 
 
The TECCL corpus V1.1 (Ten-thousand English Compositions of Chinese Learners, created by Jiajin Xu, BFSU)
 
 
WECCL 2 (Written part of SWECCL 2, copyright-protected, in-house use only)
 
Parallel corpora
 
Babel Parallel Corpus (en->cn)
 
 
Babel Parallel Corpus (cn->en)
 
 
TED Speeches (cn->en)
 
 
TED Speeches (en->cn)
 
  
Translated English corpora
 
Hong Lou Meng (Trans by Xianyi Yang and Gladys Yang)
 
  
Corpora of European Languages
 
Chinese Learners Icelandic Corpus (CLIC2012, created by Shuhui Wang, BFSU)
 
 
Griechische Nachrichten Korpus
 
 
Grimm Maerchen (Grimms Fairy Tales)
 
 
Hong Lou Meng (Russian Translation)
 
 
Icelandic Parsed Historical Corpus (IcePaHC, PoS tagged version)
 
 
Icelandic Theses by Native Icelandic Speakers (NativeICE)
 
 
Spanish News Corpus, created by Yuanqi Liu, BFSU
 
 
Spanish Novel Corpus, created by Yuanqi Liu, BFSU
 
 
Spoken Spanish Corpus
 
 
Strafgesetzbuch (The German Penal Code)
 
 
Sunzi Kunst des Krieges (German translation of Sunzi Bingfa)
 
 
German version of Twilight by Stephenie Meyer (for in-house use only)
 
 
Spanish Novels by Award-winning Writers (CNEPH v1.1, created by Yuanqi Liu, BFSU)
 
  
Corpora of Other Asian Languages
 
United Nations Corpus (Arabic)
 
 
Welcome Speech by President of Tokyo University (test data)
 
 
Translated Chinese corpora
 
The Contemporary Chinese Translated Fiction Corpus (CCTFC), created by Xianyao Hu, SWU
 
 
ZCTC corpus (ZJU Corpus of Translational Chinese, created by Richard Xiao)
 
 
Original Chinese corpora
 
Lancaster Corpus of Mandarin Chinese version 1 (LCMCv1, Brown family, 1991, created by Richard Xiao)
 
 
Lancaster Corpus of Mandarin Chinese version 2 (LCMCv2) , created by Richard Xiao
 
 
Works of Mo Yan (Chinese Nobel Laureate for Literature) (for in-house use only)
 
 
TORCH2009 (Texts of Recent Chinese, Brown family, 2009, 2013 summer edition, created by Jiajin Xu, BFSU)
 
 
The UCLA Corpus of Written Chinese (2nd edition), created by Hongyin Tao, UCLA
 
 
System messages
2020-10-05 BFSU CQPweb
The BFSU CQPweb was maintained by Prof. Jiajin Xu and Dr.
Liangping Wu of the National Research Centre for Foreign Language
Education and the National Research Centre for State Language
Capacity, Beijing Foreign Studies University, China.
2019-07-06 How to cite
Please refer to 'Andrew Hardie. 2012. CQPweb - combining power,
flexibility and usability in a corpus analysis tool. IJCL 17(3):
380-409' and 'Jiajin Xu & Liangping Wu. 2014. Web-based fourth
generation corpus analysis tools and the BFSU CQPweb case, Waiyu
Dianhua Jiaoxue [Computer-assisted Foreign Language Education]
(5)' for English and Chinese introductions to CQPweb.
2014-01-26 A new metric (Effect Size or %DIFF) of keyword analysis
Recently we have implemented a new complementary metric (Effect Size or %DIFF) to log likelihood ratio (LL) of keyword computation proposed by Dr. Costas Gabrielatos and Anna Marchi. Please refer to http://repository.edgehill.ac.uk/4100/ and http://repository.edgehill.ac.uk/4196/ for explanations about Effect Size of keyword analysis.
2012-10-06 Disclaimer
The corpora mounted at our site are for academic purposes only. Please let us know, if any of the texts contained in our corpora might cause any potential infringement of your copyright. We will remove the portion of text(s) asap.

CQPweb v3.0.7 © 2008-2012 [Admin logon] You are not logged in