The Corpora of Vietnamese Texts was completed by Giang Pham (formerly Giang Tang) under the supervision of Kathryn Kohnert, Ph.D. CCC-SLP. Funding was provided by the Graduate Research Partnership Program in the Department of Speech-Language-Hearing Sciences at the University of Minnestoa. ** When printing, please note that this document is over 270 pages. **

Please cite this work using the following reference:

Pham, G., Kohnert, K., & Carney, E. (2008). Corpora of Vietnamese Texts: Lexical Effects of Intended Audience and Publication Place. Behavior Research Methods, 40, 154-163. Vietnamese Newspaper Corpus Word Frequency List of 851,174 total words

# Occurrence Percent Word 11479 1.34% cu3a 11221 1.31% và 9225 1.08% là 8977 1.05% có 8116 0.95% mô5t 7893 0.92% các 7553 0.88% trong 7196 0.84% cho 6880 0.80% ddã 6836 0.80% không 5700 0.67% nhu'4ng 5659 0.66% ngu'o'2i 5625 0.66% ddu'o'5c 4908 0.57% vo'1i 4462 0.52% ddê3 4077 0.48% công 3967 0.46% này 3945 0.46% dân 3667 0.43% ra 3643 0.43% nhu' 3638 0.42% ddê1n 3616 0.42% vào 3605 0.42% cu4ng 3534 0.41% chính 3404 0.40% vê2 3399 0.40% o'3 3326 0.39% na(m 3276 0.38% ông 3274 0.38% khi 3234 0.38% tôi 3218 0.38% quô1c 3118 0.36% pha3i 3118 0.36% ddó 3108 0.36% ta5i 3046 0.36% làm 2950 0.34% nam 2938 0.34% bi5 2916 0.34% nhà

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 1 2891 0.34% nu'o'1c 2854 0.33% tu'2 2843 0.33% la5i 2771 0.32% thê3 2743 0.32% su'5 2721 0.32% nhiê2u 2691 0.31% thành 2657 0.31% se4 2656 0.31% thê1 2641 0.31% viê5t 2575 0.30% nhân 2575 0.30% chi3 2530 0.30% sô1 2487 0.29% trên 2463 0.29% gia 2448 0.29% còn 2441 0.29% ngày 2364 0.28% mà 2345 0.27% viê5c 2330 0.27% thì 2257 0.26% quyê2n 2244 0.26% hô5i 2240 0.26% ddâ2u 2239 0.26% nhu'ng 2224 0.26% do 2182 0.25% sau 2172 0.25% chu3 2159 0.25% vì 2155 0.25% theo 2087 0.24% ddô2ng 2075 0.24% ho5c 2042 0.24% quan 2021 0.24% trung 2017 0.24% ddô5ng 1969 0.23% ca3 1967 0.23% ho'n 1879 0.22% nói 1868 0.22% ky2 1844 0.22% ho5 1819 0.21% viên 1808 0.21% tru'o'2ng 1807 0.21% hiê5n 1804 0.21% anh 1795 0.21% bô5 1777 0.21% qua 1772 0.21% con 1768 0.21% cuô5c 1765 0.21% hai 1732 0.20% biê1t 1730 0.20% ddi 1726 0.20% lên 1699 0.20% chúng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 2 1679 0.20% hoa 1663 0.19% ddang 1659 0.19% ddiê2u 1650 0.19% ra(2ng 1648 0.19% sinh 1636 0.19% tê1 1622 0.19% tru'o'1c 1605 0.19% ddây 1602 0.19% cách 1581 0.18% nay 1577 0.18% nhâ1t 1567 0.18% vu5 1550 0.18% mo'1i 1549 0.18% hay 1538 0.18% ddi5nh 1533 0.18% my4 1532 0.18% khác 1515 0.18% hàng 1481 0.17% gio'1i 1480 0.17% mình 1447 0.17% tu'5 1445 0.17% hành 1443 0.17% nào 1436 0.17% ddô1i 1423 0.17% thu3 1418 0.17% tháng 1395 0.16% chu'1c 1389 0.16% râ1t 1386 0.16% nhâ5n 1373 0.16% tho'2i 1371 0.16% tình 1363 0.16% dda5i 1344 0.16% va(n 1329 0.16% phát 1328 0.16% 2006 1321 0.15% pháp 1316 0.15% ta 1315 0.15% tu' 1314 0.15% cô5ng 1309 0.15% báo 1299 0.15% nên 1295 0.15% lý 1268 0.15% kinh 1259 0.15% nô5i 1253 0.15% ddê2 1241 0.14% cùng 1237 0.14% vâ4n 1236 0.14% tiê1p 1225 0.14% tin 1219 0.14% co' 1217 0.14% kê1t 1205 0.14% sa3n

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 3 1187 0.14% chiê1n 1177 0.14% thu'1 1163 0.14% 1 1161 0.14% ddô5 1160 0.14% thông 1150 0.13% an 1150 0.13% liên 1150 0.13% cao 1145 0.13% trình 1140 0.13% 2 1131 0.13% ba3n 1127 0.13% gia3i 1120 0.13% ba(2ng 1114 0.13% tiê2n 1106 0.13% ý 1103 0.13% quân 1102 0.13% hình 1092 0.13% ho'5p 1087 0.13% dda3ng 1086 0.13% giá 1079 0.13% câ2u 1076 0.13% thu'5c 1075 0.13% ddô5i 1075 0.13% tri5 1073 0.13% giáo 1062 0.12% bình 1058 0.12% tâm 1054 0.12% quyê1t 1050 0.12% toàn 1044 0.12% ddâ1t 1034 0.12% nê1u 1033 0.12% phâ2n 1018 0.12% ba 1001 0.12% thi 999 0.12% lu'5c 998 0.12% tô3 993 0.12% thâ1y 991 0.12% lâ2n 985 0.12% dda5o 979 0.11% tranh 975 0.11% tô3ng 971 0.11% du'5 968 0.11% xuâ1t 966 0.11% lo'1n 952 0.11% tài 938 0.11% vi5 933 0.11% chê1 932 0.11% bà 928 0.11% hà 924 0.11% câ2n 920 0.11% thô1ng 917 0.11% ba(1t

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 4 915 0.11% ngoài 915 0.11% câ1p 911 0.11% ddu'o'2ng 899 0.11% nguyê4n 893 0.10% 3 891 0.10% si4 886 0.10% luâ5t 884 0.10% chu'a 883 0.10% gio'2 882 0.10% thu'o'2ng 881 0.10% ddu'1c 880 0.10% tro5ng 874 0.10% nhiên 873 0.10% tham 868 0.10% tro'3 864 0.10% ddiê3m 862 0.10% em 861 0.10% sách 855 0.10% thi5 847 0.10% cái 846 0.10% sô1ng 846 0.10% lãnh 846 0.10% ddình 845 0.10% gian 844 0.10% lo'2i 842 0.10% ba3o 842 0.10% tiê1ng 839 0.10% to'1i 837 0.10% muô1n 836 0.10% gì 836 0.10% ta(ng 833 0.10% lúc 829 0.10% lâ5p 828 0.10% minh 807 0.09% qua3 806 0.09% ddâ1u 806 0.09% phu'o'ng 802 0.09% bâ1t 802 0.09% 7 797 0.09% ma5nh 793 0.09% thu'o'ng 789 0.09% ban 788 0.09% ngoa5i 786 0.09% tác 776 0.09% nghi5 775 0.09% tay 771 0.09% vâ1n 769 0.09% ddê2u 769 0.09% ai 768 0.09% ma(5t 766 0.09% xã 759 0.09% thanh

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 5 756 0.09% hê5 749 0.09% quá 746 0.09% rô2i 744 0.09% vâ5y 741 0.09% ty 735 0.09% ma5ng 730 0.09% lu'o'5ng 729 0.09% trâ5n 725 0.08% na(ng 718 0.08% tuy 717 0.08% nguyên 717 0.08% triê5u 716 0.08% 5 716 0.08% tru'o'3ng 715 0.08% du5ng 713 0.08% nghiê5p 710 0.08% phòng 707 0.08% tên 707 0.08% tiêu 705 0.08% vn 704 0.08% ba5n 704 0.08% bô1 704 0.08% tiê1n 701 0.08% xe 701 0.08% ddu'a 698 0.08% thu'1c 696 0.08% tính 692 0.08% biê3u 691 0.08% chí 689 0.08% tuô3i 687 0.08% hóa 686 0.08% nhau 686 0.08% giu'4a 685 0.08% bo3 683 0.08% 6 679 0.08% án 676 0.08% tre3 670 0.08% thay 669 0.08% thâ5t 668 0.08% tu5c 668 0.08% ddiê5n 666 0.08% cu'3 664 0.08% bê5nh 660 0.08% gâ2n 660 0.08% diê4n 656 0.08% cô 656 0.08% mo5i 655 0.08% ngay 655 0.08% triê3n 654 0.08% chuyê5n 654 0.08% thái 652 0.08% 4

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 6 651 0.08% a3nh 649 0.08% nu'4a 648 0.08% ca3nh 644 0.08% a(n 640 0.07% cáo 639 0.07% giao 638 0.07% bao 638 0.07% thêm 636 0.07% tâ1t 636 0.07% lo'5i 636 0.07% khoa3ng 635 0.07% bài 635 0.07% dù 634 0.07% tìm 632 0.07% phô1 629 0.07% bán 629 0.07% vu'2a 623 0.07% ddi5a 622 0.07% hôm 618 0.07% ddô5c 615 0.07% ddông 613 0.07% loa5i 612 0.07% viê5n 610 0.07% no'i 608 0.07% ta5o 608 0.07% máy 606 0.07% khó 605 0.07% chô1ng 604 0.07% hê1t 600 0.07% tra3 600 0.07% pha5m 597 0.07% ba(1c 595 0.07% go5i 595 0.07% phu3 593 0.07% sao 592 0.07% ddô3i 591 0.07% ddoàn 585 0.07% tu'2ng 584 0.07% châu 583 0.07% tâ1n 582 0.07% áp 581 0.07% mô4i 571 0.07% tiên 571 0.07% khu 570 0.07% gây 567 0.07% thu 560 0.07% luâ5n 560 0.07% cu'1 556 0.06% ddá 555 0.06% hoàn 555 0.06% ti3nh 555 0.06% 10

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 7 553 0.06% ký 553 0.06% chi 552 0.06% bên 551 0.06% khai 549 0.06% hô2 548 0.06% tra 548 0.06% kê3 547 0.06% ddánh 547 0.06% thân 545 0.06% vô 542 0.06% so'3 539 0.06% yêu 539 0.06% hô5 539 0.06% chung 537 0.06% cuô1i 533 0.06% nhâ5p 532 0.06% tô5i 532 0.06% di5ch 530 0.06% ho3i 529 0.06% 000 529 0.06% hoa5t 527 0.06% su'1c 526 0.06% mu'1c 526 0.06% chuyên 523 0.06% vùng 523 0.06% tâ5p 522 0.06% nghi4a 521 0.06% phi 517 0.06% nhiê5m 517 0.06% sát 515 0.06% hiê5u 511 0.06% su'3 509 0.06% doanh 506 0.06% ca3m 505 0.06% khách 505 0.06% nhâ5t 503 0.06% vu'5c 499 0.06% càng 499 0.06% giúp 499 0.06% diê5n 497 0.06% xét 493 0.06% truyê2n 491 0.06% dda(5c 489 0.06% lòng 488 0.06% xem 487 0.06% vâ5n 487 0.06% tuyê3n 486 0.06% gia3 486 0.06% mu5c 484 0.06% vu4 483 0.06% kê1 483 0.06% biê5t

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 8 482 0.06% dda 481 0.06% nó 481 0.06% sang 479 0.06% xin 479 0.06% mua 475 0.06% hoa(5c 474 0.06% phu5 470 0.05% chu'1ng 468 0.05% sáng 467 0.05% thiê1t 466 0.05% dâ4n 464 0.05% châ1t 462 0.05% bóng 461 0.05% mâ1t 461 0.05% csvn 460 0.05% thuô5c 460 0.05% kiê1n 458 0.05% lâ1y 456 0.05% bush 456 0.05% ddáng 455 0.05% rõ 455 0.05% tích 452 0.05% câu 451 0.05% cu'1u 450 0.05% ddu3 449 0.05% niên 449 0.05% vâ5t 449 0.05% thích 447 0.05% tra5ng 446 0.05% ít 444 0.05% dài 444 0.05% xây 442 0.05% bàn 442 0.05% danh 441 0.05% cô1 438 0.05% bo'3i 437 0.05% du'5ng 437 0.05% viê1t 436 0.05% dâ2u 435 0.05% hô2i 433 0.05% tro'5 432 0.05% trang 431 0.05% tô1t 429 0.05% tu'o'ng 429 0.05% hòa 429 0.05% ha3i 427 0.05% thu' 426 0.05% lan 425 0.05% du'o'1i 425 0.05% trách 425 0.05% vi 424 0.05% tây

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 9 422 0.05% hâ5u 422 0.05% vê5 422 0.05% khí 421 0.05% nhóm 420 0.05% ddo'2i 420 0.05% li5ch 419 0.05% chu'o'ng 418 0.05% nga 417 0.05% giu'4 416 0.05% na5n 416 0.05% ddu'1ng 415 0.05% vo5ng 413 0.05% câ2m 412 0.05% di 410 0.05% tu'o'3ng 410 0.05% ddo'n 410 0.05% hiê5p 410 0.05% tha(1ng 408 0.05% ti5ch 406 0.05% chi5 406 0.05% pha3n 405 0.05% su' 405 0.05% ga(5p 405 0.05% mo'3 404 0.05% 8 403 0.05% thiê1u 402 0.05% tu'o'1ng 402 0.05% chê1t 402 0.05% tuâ2n 401 0.05% vòng 401 0.05% kiê5n 400 0.05% dùng 399 0.05% y 398 0.05% ddúng 398 0.05% khoa 398 0.05% phút 398 0.05% hoa5ch 397 0.05% gia3m 396 0.05% u'1ng 395 0.05% mang 395 0.05% tu'1c 395 0.05% dda(5t 393 0.05% phâ3m 393 0.05% trâ2n 392 0.05% biê1n 391 0.05% 20 391 0.05% tinh 391 0.05% tuyên 390 0.05% phía 390 0.05% chi5u 387 0.05% â1n 386 0.05% hoàng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 10 385 0.04% thâ2n 385 0.04% nhìn 384 0.04% ha5 384 0.04% trí 383 0.04% xu'3 383 0.04% tu'3 380 0.04% hu'o'3ng 378 0.04% â1y 378 0.04% ddô 378 0.04% phu5c 377 0.04% khá 376 0.04% xa3y 376 0.04% tôn 374 0.04% ngành 374 0.04% riêng 374 0.04% ddòi 374 0.04% phong 374 0.04% kha3 372 0.04% nô3i 372 0.04% thuâ5t 372 0.04% giám 371 0.04% khiê1n 370 0.04% nho3 369 0.04% vài 366 0.04% xuô1ng 365 0.04% cung 365 0.04% sân 363 0.04% quy 363 0.04% lê5 362 0.04% luôn 362 0.04% kho3i 361 0.04% cô3 360 0.04% âu 359 0.04% liê5u 358 0.04% châ1p 358 0.04% phí 357 0.04% ddâ2y 357 0.04% lo 357 0.04% chiê1c 357 0.04% kiê3m 355 0.04% chuyê3n 353 0.04% tu'o'5ng 353 0.04% lê 351 0.04% ddâu 351 0.04% kim 350 0.04% hiê3u 349 0.04% nu'4 349 0.04% ddài 347 0.04% le4 347 0.04% chân 347 0.04% phân 346 0.04% du5c

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 11 345 0.04% miê2n 342 0.04% phá 342 0.04% dde5p 341 0.04% ky4 340 0.04% me5 339 0.04% hu'4u 338 0.04% hu'o'1ng 337 0.04% xa 337 0.04% tô1 337 0.04% ddô1c 336 0.04% tp 335 0.04% ca 333 0.04% v 332 0.04% ca(n 331 0.04% yê1u 331 0.04% tiê3u 330 0.04% khô1i 330 0.04% nghi4 329 0.04% 07 329 0.04% phóng 328 0.04% nê2n 327 0.04% tô1i 327 0.04% môn 326 0.04% xác 325 0.04% ha5i 324 0.04% cu5 324 0.04% tô5c 323 0.04% 30 323 0.04% phó 323 0.04% ddào 321 0.04% bác 321 0.04% ddàn 320 0.04% khu3ng 316 0.04% nghe 315 0.04% u3ng 315 0.04% cha5y 315 0.04% tt 315 0.04% iraq 315 0.04% ha5n 315 0.04% qua3n 314 0.04% lao 314 0.04% cho5n 313 0.04% ke3 312 0.04% ngu’o’2i 312 0.04% chuâ3n 310 0.04% cá 310 0.04% 11 308 0.04% tiê1t 308 0.04% lâu 308 0.04% ty3 307 0.04% ddóng 307 0.04% 9

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 12 306 0.04% ninh 305 0.04% du 305 0.04% na(2m 304 0.04% ma(1t 302 0.04% nhu’4ng 302 0.04% binh 302 0.04% hàn 301 0.04% to3 300 0.04% nghê5 300 0.04% á 299 0.03% la5c 299 0.03% thuâ5n 299 0.03% ti3 297 0.03% cha(3ng 296 0.03% trái 296 0.03% ngàn 295 0.03% thuô1c 295 0.03% dda5t 295 0.03% qua3ng 295 0.03% giang 294 0.03% vô1n 293 0.03% phép 292 0.03% kéo 292 0.03% tân 291 0.03% gòn 291 0.03% cho'i 291 0.03% trò 290 0.03% (lên 290 0.03% ma(5c 290 0.03% nông 289 0.03% ho5p 289 0.03% biên 289 0.03% pha5t 289 0.03% cha(1c 288 0.03% so 288 0.03% nguy 288 0.03% cup 287 0.03% biê3n 287 0.03% nghiên 287 0.03% world 287 0.03% ky3 286 0.03% sài 284 0.03% bay 283 0.03% gái 281 0.03% cu'5c 281 0.03% vàng 281 0.03% 12 280 0.03% cu'3a 280 0.03% du'o'ng 279 0.03% so' 279 0.03% thí 278 0.03% bâ2u

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 13 277 0.03% o’3 277 0.03% ca5nh 276 0.03% thu'o'5ng 276 0.03% hoà 275 0.03% làng 274 0.03% mong 273 0.03% dê4 272 0.03% israel 272 0.03% sáu 272 0.03% ma5i 270 0.03% nghiê5m 270 0.03% nguô2n 270 0.03% mai 269 0.03% hcm 269 0.03% hâ2u 268 0.03% coi 268 0.03% góp 268 0.03% 2005 267 0.03% ngân 265 0.03% tru'5c 264 0.03% giâ1y 264 0.03% na(5ng 264 0.03% hy 263 0.03% tín 263 0.03% tù 262 0.03% gu'3i 260 0.03% buô5c 259 0.03% bang 259 0.03% chiê2u 258 0.03% mâ1y 258 0.03% rô5ng 258 0.03% bào 258 0.03% nghèo 257 0.03% quê 257 0.03% duy 256 0.03% kha(n 256 0.03% lo'1p 256 0.03% thoa5i 256 0.03% lai 255 0.03% cán 255 0.03% lu'u 254 0.03% chiê1m 254 0.03% ddu’o’5c 252 0.03% dda3o 251 0.03% 15 250 0.03% thâ1t 249 0.03% nhanh 249 0.03% lô5 248 0.03% ghi 248 0.03% chu'1 247 0.03% chia 246 0.03% tro'2i

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 14 246 0.03% dung 245 0.03% ca3i 244 0.03% ddô2 244 0.03% phe 243 0.03% thuyê2n 242 0.03% kêu 241 0.03% toán 241 0.03% gô2m 240 0.03% thiê5t 238 0.03% u'o'1c 237 0.03% sai 237 0.03% nho'2 237 0.03% huyê5n 236 0.03% hãy 236 0.03% dda(ng 235 0.03% khâ3u 235 0.03% thiên 235 0.03% mô1i 234 0.03% 22 233 0.03% ngôn 233 0.03% nha(2m 232 0.03% iran 232 0.03% phim 230 0.03% lê4 230 0.03% to'2 230 0.03% thiê5n 229 0.03% hùng 229 0.03% cu'o'2ng 228 0.03% so'5 228 0.03% tha(m 227 0.03% ddâ3y 227 0.03% ddem 227 0.03% ddoa5n 227 0.03% huy 226 0.03% lu'o'ng 226 0.03% bu'1c 226 0.03% ba3y 225 0.03% cha 225 0.03% hiê3m 224 0.03% nghê2 222 0.03% biê5n 222 0.03% ddô3 222 0.03% linh 222 0.03% 23 222 0.03% chô2ng 222 0.03% xuân 222 0.03% ddà 220 0.03% tha(3ng 219 0.03% ddáp 219 0.03% so'n 219 0.03% ba(ng 219 0.03% thôi

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 15 219 0.03% sa(1c 218 0.03% ddêm 217 0.03% ba5o 217 0.03% 100 216 0.03% buô3i 216 0.03% nha5c 215 0.03% hãng 215 0.03% cu' 215 0.03% tu5 214 0.03% máu 214 0.03% ngu'o'5c 214 0.03% hu'o'ng 213 0.02% cho'2 213 0.02% xuyên 213 0.02% vo’1i 212 0.02% chô4 211 0.02% câ5p 211 0.02% ddo5c 211 0.02% tha3o 211 0.02% bí 210 0.02% cu'5u 210 0.02% ngo'2 210 0.02% chú 209 0.02% du4ng 209 0.02% trào 208 0.02% 0 208 0.02% ddích 207 0.02% tru'o'ng 206 0.02% long 205 0.02% hô2ng 205 0.02% ddau 205 0.02% suô1t 204 0.02% nhu4ng 204 0.02% ngo5c 203 0.02% thúc 203 0.02% nga5i 203 0.02% lê5nh 202 0.02% xúc 201 0.02% tra(m 201 0.02% t 201 0.02% bu'o'1c 201 0.02% in 200 0.02% tâ5n 199 0.02% dành 199 0.02% rút 199 0.02% phúc 199 0.02% quang 199 0.02% cây 198 0.02% khô3 198 0.02% sa(1p 196 0.02% gio’1i 196 0.02% nô3

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 16 195 0.02% thuyê1t 195 0.02% 2004 194 0.02% câ1m 193 0.02% suy 193 0.02% 21 192 0.02% usd 192 0.02% âm 192 0.02% bé 192 0.02% quý 192 0.02% vo'5 191 0.02% phú 191 0.02% u3y 191 0.02% hoá 191 0.02% thâ2y 190 0.02% sâu 190 0.02% tê5 190 0.02% bày 189 0.02% ca3ng 189 0.02% 06 189 0.02% ngôi 189 0.02% thu5 188 0.02% kiê1m 188 0.02% tránh 188 0.02% nga(n 188 0.02% phiê1u 187 0.02% chu5c 187 0.02% ta5m 186 0.02% kích 186 0.02% thâ1p 185 0.02% nha 185 0.02% 50 185 0.02% to 184 0.02% lâ4n 184 0.02% qui 184 0.02% tàn 184 0.02% chuyê1n 183 0.02% da5y 183 0.02% kha(1p 183 0.02% chu’1c 183 0.02% ngô2i 183 0.02% tiê5n 183 0.02% vui 182 0.02% thuê1 182 0.02% 17 182 0.02% vu'o'5t 181 0.02% loan 181 0.02% thâ5p 180 0.02% châ1m 180 0.02% so'1m 180 0.02% can 179 0.02% dâ1u 179 0.02% kém

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 17 179 0.02% vai 179 0.02% quay 179 0.02% quâ5n 178 0.02% ddh 177 0.02% ma(1c 177 0.02% cha(1n 176 0.02% quán 176 0.02% thâ5m 176 0.02% kiê3u 176 0.02% du' 176 0.02% giành 175 0.02% wto 175 0.02% nhiê4m 175 0.02% tòa 174 0.02% song 174 0.02% tu’2 174 0.02% môi 173 0.02% tra5i 173 0.02% kính 173 0.02% mo'2i 173 0.02% bom 173 0.02% 18 172 0.02% giam 172 0.02% mô 172 0.02% ddi5ch 172 0.02% ba5c 172 0.02% liê5t 172 0.02% hô4 171 0.02% sa(4n 171 0.02% toà 171 0.02% trì 170 0.02% tàu 170 0.02% lu'o'1i 170 0.02% ác 170 0.02% giô1ng 169 0.02% tai 169 0.02% phô1i 168 0.02% ddôi 168 0.02% nho'1 167 0.02% nghi 167 0.02% luyê5n 166 0.02% trao 166 0.02% gô1c 166 0.02% ba5i 166 0.02% trích 166 0.02% lu'o'5c 165 0.02% tá 165 0.02% dda3m 164 0.02% cánh 164 0.02% 25 164 0.02% may 164 0.02% kê

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 18 164 0.02% nhu 164 0.02% na(1m 164 0.02% lành 163 0.02% la 162 0.02% niê5m 162 0.02% chu'4 162 0.02% ddo3 161 0.02% c 161 0.02% sông 160 0.02% nguyê5n 160 0.02% mùa 160 0.02% 2003 160 0.02% du'1t 160 0.02% a 160 0.02% phiên 160 0.02% ràng 159 0.02% chùa 159 0.02% soát 159 0.02% thu'o'3ng 158 0.02% quen 157 0.02% ddô5t 157 0.02% abramoff 156 0.02% palestine 156 0.02% quên 156 0.02% 16 155 0.02% trai 155 0.02% 19 155 0.02% co'2 155 0.02% tho' 154 0.02% mâ5t 154 0.02% giác 154 0.02% nha(1c 153 0.02% giai 153 0.02% al 153 0.02% thu’1 153 0.02% phô3 153 0.02% nâ2y 152 0.02% viê4n 152 0.02% hezbollah 152 0.02% úc 152 0.02% nuôi 152 0.02% ddoán 151 0.02% lâm 151 0.02% la(1m 151 0.02% internet 151 0.02% bây 150 0.02% áo 150 0.02% nô4i 149 0.02% tâ1m 149 0.02% khán 149 0.02% quanh 148 0.02% lá

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 19 148 0.02% phái 147 0.02% nu’o’1c 147 0.02% cu4 147 0.02% phê 147 0.02% phán 146 0.02% tru'2 146 0.02% tu'o'2ng 145 0.02% diê5t 145 0.02% cam 145 0.02% tri 144 0.02% sgk 144 0.02% ngu'4 144 0.02% tuâ1n 143 0.02% u'o'ng 143 0.02% nghiêm 143 0.02% thu'3 143 0.02% 24 143 0.02% thua 142 0.02% (vnn 142 0.02% tuyê1n 142 0.02% nhiêu 142 0.02% thiê5u 142 0.02% sóng 142 0.02% thú 142 0.02% tô2n 142 0.02% kha(1c 142 0.02% ba(1n 141 0.02% phâ5n 141 0.02% xong 141 0.02% xa(ng 141 0.02% dâ2n 141 0.02% xung 141 0.02% hút 141 0.02% ba3ng 141 0.02% xu'1 141 0.02% 13 140 0.02% lô1i 140 0.02% siêu 140 0.02% ba5ch 139 0.02% chiê1u 139 0.02% ddàm 139 0.02% lu5c 138 0.02% yên 138 0.02% ha5t 138 0.02% dám 137 0.02% tái 137 0.02% kiê2u 137 0.02% ddo'4 137 0.02% bè 137 0.02% hê2 137 0.02% loa5t 137 0.02% thôn

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 20 137 0.02% thù 136 0.02% di5p 136 0.02% cu5c 136 0.02% món 135 0.02% lô4i 135 0.02% giê1t 135 0.02% cu'o'1p 135 0.02% kiên 134 0.02% khám 134 0.02% cuô1n 134 0.02% già 134 0.02% 14 133 0.02% lu'3a 133 0.02% thu3y 133 0.02% uy 131 0.02% tung 131 0.02% ddu'2ng 131 0.02% ngu4 131 0.02% ddu'1a 131 0.02% cu'o'2i 129 0.02% cháu 129 0.02% u'u 129 0.02% ddu'o'ng 129 0.02% ve3 129 0.02% liê2n 128 0.01% chánh 128 0.01% chóng 128 0.01% xê1p 128 0.01% su'1 128 0.01% quy4 127 0.01% ích 127 0.01% bô2 127 0.01% dõi 126 0.01% kia 126 0.01% mo' 126 0.01% ddoa5t 126 0.01% li4nh 126 0.01% thác 126 0.01% trùn 125 0.01% kho'3i 125 0.01% ma5c 125 0.01% cho'5 125 0.01% du'5a 125 0.01% tru5 125 0.01% lang 125 0.01% dda5n 124 0.01% thâ3m 124 0.01% ô3n 124 0.01% hát 124 0.01% thi5nh 123 0.01% chu'4a 123 0.01% ca(ng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 21 123 0.01% cha(5t 123 0.01% xâ1u 122 0.01% khoa3n 122 0.01% tô1n 122 0.01% huâ1n 122 0.01% vinh 122 0.01% me4 122 0.01% quâ2n 122 0.01% sàng 122 0.01% ô 121 0.01% nhiê5t 121 0.01% lính 121 0.01% nô1i 121 0.01% thoát 120 0.01% niê2m 120 0.01% ca(1t 120 0.01% ddo'5i 120 0.01% 40 120 0.01% túc 120 0.01% kha(3ng 119 0.01% ha5ng 119 0.01% 200 119 0.01% nu'3a 118 0.01% lâ2m 118 0.01% vu'4ng 118 0.01% ddt 118 0.01% bo'1t 118 0.01% co3 118 0.01% nga(1n 118 0.01% oan 118 0.01% tha3m 117 0.01% chúa 117 0.01% cu'o'ng 117 0.01% uô1ng 117 0.01% tuyê5t 117 0.01% di4 116 0.01% nâng 116 0.01% gd 116 0.01% dây 116 0.01% bo5n 115 0.01% khích 115 0.01% co'n 114 0.01% buôn 113 0.01% ddo'5t 113 0.01% ha5nh 113 0.01% bo'2 113 0.01% hào 113 0.01% nhe5 112 0.01% nghi3 112 0.01% liban 112 0.01% phâ5t 112 0.01% hiê1n

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 22 112 0.01% gan 112 0.01% la5 112 0.01% ddoan 111 0.01% vnn 111 0.01% ho'i 111 0.01% ta3i 111 0.01% cân 111 0.01% hu'1a 111 0.01% da 110 0.01% su’5 110 0.01% mu'u 110 0.01% ddê2n 109 0.01% mâ4u 109 0.01% thuê 109 0.01% xu'o'ng 109 0.01% phu'o'2ng 109 0.01% khiê1u 109 0.01% mãi 108 0.01% gio3i 108 0.01% phù 108 0.01% ha(3n 108 0.01% thu'2a 108 0.01% ngang 108 0.01% ta(1c 107 0.01% tru’o’1c 107 0.01% chút 107 0.01% phan 107 0.01% bô1n 107 0.01% tha 107 0.01% chi3nh 106 0.01% lu'5a 106 0.01% lu4 106 0.01% dòng 106 0.01% nô5p 106 0.01% b 106 0.01% mu'2ng 106 0.01% chu'2ng 105 0.01% câ5u 105 0.01% ddiê3n 105 0.01% hoa3 105 0.01% ddô4 105 0.01% thu’5c 105 0.01% dden 104 0.01% cà 104 0.01% vu'o'ng 104 0.01% tâ2m 104 0.01% bão 104 0.01% hè 104 0.01% ho3a 104 0.01% 2002 104 0.01% tim 103 0.01% lâ5u

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 23 103 0.01% nêu 103 0.01% 2001 103 0.01% lo5t 103 0.01% m 103 0.01% ro'2i 102 0.01% gâ1p 102 0.01% nô4 101 0.01% dduô3i 101 0.01% ta3 101 0.01% ddón 101 0.01% bê1n 100 0.01% bãi 100 0.01% thô 100 0.01% miê1n 100 0.01% vi4nh 100 0.01% kha3o 99 0.01% ddói 99 0.01% ga(1ng 99 0.01% câ5n 99 0.01% tho3a 99 0.01% su'3a 99 0.01% n 99 0.01% uy3 99 0.01% cha(5n 99 0.01% dda(1c 99 0.01% h 99 0.01% du5 98 0.01% co’ 98 0.01% ha5ch 98 0.01% triê2u 97 0.01% putin 97 0.01% hamas 97 0.01% giàu 97 0.01% ép 97 0.01% tâ2ng 97 0.01% chô1i 97 0.01% hô1i 97 0.01% ta5p 97 0.01% thi5t 96 0.01% trúng 96 0.01% la5t 96 0.01% nóng 96 0.01% ghê1 96 0.01% ddê5 96 0.01% loa5n 95 0.01% ho’n 95 0.01% xanh 95 0.01% ga(1n 95 0.01% ru'2ng 95 0.01% chào 95 0.01% võ 95 0.01% tho'2

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 24 95 0.01% màu 95 0.01% cs 95 0.01% ddâ1y 94 0.01% lái 94 0.01% tru’o’3ng 94 0.01% kg 94 0.01% kháng 94 0.01% ha(2ng 94 0.01% no'5 94 0.01% dde 94 0.01% li 93 0.01% dàng 93 0.01% na(4ng 93 0.01% nha(1m 93 0.01% 60 93 0.01% nhu’ 92 0.01% tru'2ng 92 0.01% tra3i 92 0.01% 80 92 0.01% ho5a 92 0.01% re3 92 0.01% ca3n 92 0.01% lúa 92 0.01% thánh 91 0.01% ddu5ng 91 0.01% bù 91 0.01% tru'ng 91 0.01% ngu3 91 0.01% miê4n 91 0.01% ô1ng 91 0.01% ddám 91 0.01% ddâ5p 90 0.01% ro'i 90 0.01% banh 90 0.01% hoa3ng 90 0.01% râ5p 90 0.01% la(1ng 90 0.01% ddi3nh 90 0.01% ôm 90 0.01% bâ5c 89 0.01% go'4 89 0.01% thu'a 89 0.01% mu'a 89 0.01% che 89 0.01% vu'o'2n 89 0.01% ly 88 0.01% mâ5u 88 0.01% to’1i 88 0.01% xô 88 0.01% gió 88 0.01% la5nh 88 0.01% bô3

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 25 88 0.01% 28 88 0.01% dda(3ng 88 0.01% súng 88 0.01% huê1 88 0.01% rice 88 0.01% sa 88 0.01% ngoái 87 0.01% hu3y 87 0.01% he5n 87 0.01% truy 87 0.01% da5 86 0.01% nxbgd 86 0.01% (theo 86 0.01% bi 86 0.01% châ1n 86 0.01% mô5 86 0.01% tra5m 86 0.01% thô3 85 0.01% hài 85 0.01% mê 85 0.01% du'4 85 0.01% xu'a 85 0.01% hâ1p 85 0.01% la(5ng 84 0.01% ánh 84 0.01% do5a 84 0.01% ma 84 0.01% thuâ4n 84 0.01% rô1i 84 0.01% dda(5ng 84 0.01% 300 83 0.01% ki5p 83 0.01% tu 83 0.01% san 83 0.01% châ5m 83 0.01% ddô1n 83 0.01% tán 83 0.01% sô3 83 0.01% ki5ch 82 0.01% cu'1ng 82 0.01% kho3e 82 0.01% chu5p 82 0.01% cho'3 82 0.01% trô5m 82 0.01% mãn 82 0.01% trâ1n 82 0.01% washington 82 0.01% trú 82 0.01% tra(1ng 81 0.01% gia3n 81 0.01% the 81 0.01% nxb

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 26 81 0.01% thùng 81 0.01% màn 81 0.01% cha(ng 81 0.01% gio5ng 81 0.01% bò 80 0.01% canada 80 0.01% chu 80 0.01% ddai 80 0.01% bút 80 0.01% buô2n 80 0.01% thiê3u 79 0.01% soa5n 79 0.01% ngài 79 0.01% miê5ng 79 0.01% the3 79 0.01% cháy 79 0.01% tiê1c 79 0.01% ruô5ng 78 0.01% q 78 0.01% bá 78 0.01% liba(ng 78 0.01% mác 78 0.01% khuyê1n 78 0.01% treo 77 0.01% hô 77 0.01% 27 77 0.01% cha(m 77 0.01% ra(1n 77 0.01% 500 77 0.01% tô1c 77 0.01% du’5 77 0.01% bô1i 77 0.01% bâ1y 77 0.01% gia3ng 77 0.01% bia 77 0.01% nhu5c 77 0.01% câ1t 77 0.01% 00 76 0.01% cú 76 0.01% ddua 76 0.01% tan 76 0.01% nhuâ5n 76 0.01% o'n 76 0.01% ubnd 76 0.01% 2000 76 0.01% ( 76 0.01% discovery 76 0.01% viêm 76 0.01% bâ5t 76 0.01% mê5nh 76 0.01% lu’5c 75 0.01% sàn

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 27 75 0.01% ddô1t 75 0.01% khóa 75 0.01% kho 75 0.01% ddành 75 0.01% d 75 0.01% tùy 75 0.01% dâ5y 75 0.01% mau 75 0.01% hô2n 75 0.01% ngu'2ng 74 0.01% klinsmann 74 0.01% ta(5ng 74 0.01% of 74 0.01% sóc 74 0.01% tràn 74 0.01% triê1t 74 0.01% california 74 0.01% quí 74 0.01% pakistan 74 0.01% lô2 73 0.01% lu'o'5t 73 0.01% thách 73 0.01% nê2 73 0.01% ddiê5u 73 0.01% tha3 73 0.01% co'm 73 0.01% tiê4n 73 0.01% khô3ng 73 0.01% suâ1t 73 0.01% mái 72 0.01% lãng 72 0.01% khu'1 72 0.01% lu'2a 72 0.01% tha(1c 72 0.01% hôn 72 0.01% gô4 72 0.01% nhu’ng 71 0.01% 26 71 0.01% than 71 0.01% 90 71 0.01% thâ5n 71 0.01% qaeda 71 0.01% nhâ1n 71 0.01% a5 71 0.01% ha(1n 70 0.01% bô4ng 70 0.01% cdd 70 0.01% ái 70 0.01% hoang 70 0.01% se3 70 0.01% vân 70 0.01% lhq

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 28 70 0.01% thu’o’ng 70 0.01% thang 69 0.01% nha3y 69 0.01% tho5 69 0.01% bs 69 0.01% g8 69 0.01% thao 69 0.01% cãi 69 0.01% vua 69 0.01% góc 69 0.01% thoa3 69 0.01% dô5i 69 0.01% tha(ng 69 0.01% 70 69 0.01% tô 69 0.01% vong 69 0.01% ngã 69 0.01% nasa 69 0.01% núi 68 0.01% mét 68 0.01% kèm 68 0.01% singapore 68 0.01% sy4 68 0.01% xóm 68 0.01% sa5ch 68 0.01% pho3ng 68 0.01% oanh 67 0.01% indonesia 67 0.01% mu'o'2i 67 0.01% canh 67 0.01% da5ng 67 0.01% thu5y 67 0.01% thiê5p 67 0.01% a3 67 0.01% ân 67 0.01% ts 67 0.01% x 67 0.01% khuôn 67 0.01% xâm 67 0.01% ballack 66 0.01% lu’o’5ng 66 0.01% sa5n 66 0.01% trùm 66 0.01% ddôla 66 0.01% mo’1i 66 0.01% tu5t 66 0.01% (hà 66 0.01% giây 66 0.01% 1975 66 0.01% nhì 66 0.01% e 66 0.01% trùng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 29 65 0.01% gián 65 0.01% vé 65 0.01% khuynh 65 0.01% huyê1t 65 0.01% mã 65 0.01% syria 65 0.01% cúm 65 0.01% gio’2 65 0.01% giâ1u 65 0.01% new 64 0.01% triê5t 64 0.01% ddèn 64 0.01% hiê2n 64 0.01% du'o'2ng 64 0.01% tru’o’2ng 64 0.01% sô1t 64 0.01% sút 63 0.01% ôn 63 0.01% tròn 63 0.01% trông 63 0.01% ru'o'5u 63 0.01% lây 63 0.01% trô1n 63 0.01% gánh 63 0.01% trà 62 0.01% 31 62 0.01% khánh 62 0.01% dao 62 0.01% kha3i 62 0.01% pha 62 0.01% giày 61 0.01% ta3n 61 0.01% vê1t 61 0.01% chúc 61 0.01% sót 61 0.01% xá 61 0.01% du'o'4ng 61 0.01% ma(1n 61 0.01% cóc 61 0.01% thâ2u 61 0.01% xì 61 0.01% su5p 61 0.01% luân 60 0.01% web 60 0.01% ta(1t 60 0.01% tru'a 60 0.01% mày 60 0.01% 000dd 60 0.01% quà 60 0.01% 2007 60 0.01% do5c 60 0.01% hung

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 30 60 0.01% mô3 59 0.01% nhu'o'5ng 59 0.01% ngu'ng 59 0.01% ddê1 59 0.01% tê1t 59 0.01% cai 59 0.01% u' 59 0.01% gu'o'ng 59 0.01% câ3m 59 0.01% câ1u 59 0.01% mu4i 59 0.01% tam 59 0.01% bánh 58 0.01% zarqawi 58 0.01% tu'o'i 58 0.01% ddan 58 0.01% lôi 58 0.01% ruô5t 58 0.01% du'2ng 57 0.01% tâ5t 57 0.01% nga5c 57 0.01% khúc 57 0.01% dda5p 57 0.01% va5n 57 0.01% bê2n 57 0.01% non 57 0.01% ám 56 0.01% lu'ng 56 0.01% thai 56 0.01% tha(2ng 56 0.01% k 56 0.01% loài 56 0.01% trâ2m 56 0.01% mâu 56 0.01% hoan 56 0.01% tàng 55 0.01% ddiê2n 55 0.01% út 55 0.01% u 55 0.01% nhi 55 0.01% giâ5n 55 0.01% khóc 55 0.01% chàng 55 0.01% â1m 55 0.01% nát 55 0.01% tóm 55 0.01% tiê2m 55 0.01% van 55 0.01% xóa 55 0.01% lâ5t 55 0.01% quy2nh 54 0.01% gói

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 31 54 0.01% huynh 54 0.01% vô5i 54 0.01% bô3ng 54 0.01% ngâ2m 54 0.01% bách 54 0.01% lê5ch 54 0.01% ha3o 54 0.01% bu'4a 54 0.01% ca(1p 54 0.01% liê2u 53 0.01% 04 53 0.01% doa5 53 0.01% phiê1n 53 0.01% ddu’a 53 0.01% xu'1ng 53 0.01% giu’4a 53 0.01% túng 53 0.01% khung 53 0.01% vang 53 0.01% bu5ng 52 0.01% bê1 52 0.01% chu’o’ng 52 0.01% ô1c 52 0.01% kênh 52 0.01% tto 52 0.01% nhâ4n 52 0.01% ma5ch 52 0.01% nét 52 0.01% tru'4 52 0.01% chu'1a 52 0.01% cia 52 0.01% khiêm 51 0.01% tro’3 51 0.01% trúc 51 0.01% rào 51 0.01% 45 51 0.01% vo5t 51 0.01% ca(5p 51 0.01% ung 51 0.01% ngõ 51 0.01% re4 51 0.01% giáp 51 0.01% dã 51 0.01% gà 51 0.01% o'i 51 0.01% rác 51 0.01% thâm 51 0.01% nghìn 50 0.01% u'1c 50 0.01% manh 50 0.01% 29 50 0.01% cô1t

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 32 50 0.01% ddo 50 0.01% la(1p 50 0.01% ve4 50 0.01% serbia 50 0.01% ddáo 50 0.01% dô1i 50 0.01% rumsfeld 50 0.01% thoi 50 0.01% tuê5 50 0.01% trô2ng 50 0.01% tóc 50 0.01% cô5t 50 0.01% ddu’1c 50 0.01% ddeo 50 0.01% thép 50 0.01% che4 50 0.01% â3n 50 0.01% sôi 49 0.01% vu’2a 49 0.01% kín 49 0.01% dde3 49 0.01% tu'1 49 0.01% cu'o'4ng 49 0.01% zidane 49 0.01% km 49 0.01% tám 49 0.01% bùi 49 0.01% sô 49 0.01% chu3ng 49 0.01% mô2 49 0.01% 150 49 0.01% pntr 49 0.01% huyê2n 49 0.01% túi 49 0.01% beirut 49 0.01% nghi5ch 49 0.01% ddiê5p 48 0.01% tho'5 48 0.01% dô2n 48 0.01% chót 48 0.01% ta5c 48 0.01% tu’5 48 0.01% dáng 48 0.01% khô1n 48 0.01% ddãi 48 0.01% du'o'5c 48 0.01% bênh 48 0.01% dda(1t 48 0.01% khiê3n 48 0.01% khôn 48 0.01% lâ5n 47 0.01% dinh

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 33 47 0.01% james 47 0.01% lô5n 47 0.01% lui 47 0.01% hu'ng 47 0.01% móc 47 0.01% ty5 47 0.01% lo’2i 47 0.01% sa(n 47 0.01% 01 47 0.01% dd 47 0.01% dính 47 0.01% vu 47 0.01% ta5ng 47 0.01% ào 47 0.01% de5p 47 0.01% huy2nh 47 0.01% york 47 0.01% chuô5c 47 0.01% 8406 47 0.01% thoa3i 47 0.01% dò 47 0.01% 600 47 0.01% hoài 47 0.01% bu'u 47 0.01% khô1ng 47 0.01% la5m 47 0.01% tô3n 47 0.01% lò 46 0.01% tang 46 0.01% ha 46 0.01% dê1 46 0.01% heo 46 0.01% hu'1ng 46 0.01% ngãi 46 0.01% thpt 46 0.01% cu'o'1i 46 0.01% não 46 0.01% thuâ2n 46 0.01% p 46 0.01% tháp 46 0.01% 56 46 0.01% bi3 46 0.01% bó 46 0.01% vay 46 0.01% ma3nh 46 0.01% ví 45 0.01% phâ4u 45 0.01% nu'1o'c 45 0.01% phu'1c 45 0.01% thâ2m 45 0.01% g 45 0.01% john

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 34 45 0.01% 2010 45 0.01% lùng 45 0.01% costa 45 0.01% lão 45 0.01% chôn 45 0.01% khâ3n 45 0.01% chô1t 45 0.01% ney 45 0.01% bô2i 45 0.01% nhi5p 45 0.01% l 45 0.01% say 44 0.01% brazil 44 0.01% chìm 44 0.01% tho’2i 44 0.01% khuyên 44 0.01% lân 44 0.01% ngo5n 44 0.01% bùng 44 0.01% hiê1u 44 0.01% hãi 44 0.01% i 44 0.01% khâ1u 44 0.01% ddàng 44 0.01% chuyê2n 44 0.01% ho’5p 44 0.01% cát 44 0.01% dê5t 44 0.01% thô3i 44 0.01% kiê5t 44 0.01% bám 44 0.01% chênh 44 0.01% mù 43 0.01% ngô 43 0.01% go'3i 43 0.01% chu’1ng 43 0.01% giâ1c 43 0.01% thi3nh 43 0.01% giêng 43 0.01% ngoan 43 0.01% hiv 43 0.01% duyê5t 43 0.01% 75 43 0.01% mô5ng 43 0.01% sa(1t 43 0.01% mu’1c 43 0.01% mo'2 43 0.01% nv2 43 0.01% basayev 43 0.01% lô 43 0.01% ngâ5p 43 0.01% mông

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 35 43 0.01% vi5nh 43 0.01% leo 43 0.01% mo’3 43 0.01% (1 43 0.01% gay 43 0.01% 1998 43 0.01% sung 43 0.01% 05 43 0.01% (tu'1c 43 0.01% 1980 43 0.01% ii 43 0.01% im 43 0.01% pho 42 0.00% quê1 42 0.00% hòn 42 0.00% argentina 42 0.00% ta3ng 42 0.00% man 42 0.00% ra(ng 42 0.00% khoán 42 0.00% huô1ng 42 0.00% óc 42 0.00% xu 42 0.00% tu’3 42 0.00% phiê2n 42 0.00% hiê1m 42 0.00% lu'o'4ng 41 0.00% khói 41 0.00% ddiêu 41 0.00% dâu 41 0.00% ddâ5u 41 0.00% to'1 41 0.00% ddu4a 41 0.00% viê1ng 41 0.00% khen 41 0.00% xhcn 41 0.00% phâ4n 41 0.00% rica 41 0.00% co'1 41 0.00% tho'3 41 0.00% uranium 41 0.00% ti4nh 40 0.00% bô5i 40 0.00% co'3i 40 0.00% khuyê1t 40 0.00% ngu'5c 40 0.00% cu3ng 40 0.00% 1999 40 0.00% ukraine 40 0.00% rãi 40 0.00% vo'4 40 0.00% trô1ng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 36 40 0.00% ddô2i 40 0.00% hâ2m 40 0.00% saddam 40 0.00% na(1ng 40 0.00% tháo 40 0.00% to3a 40 0.00% vietnam 40 0.00% hèn 40 0.00% làn 40 0.00% cu’1u 40 0.00% tuân 39 0.00% (tu'2 39 0.00% blair 39 0.00% khâu 39 0.00% mê5t 39 0.00% cha3y 39 0.00% va(1ng 39 0.00% 39 0.00% nô 39 0.00% co 39 0.00% tâ3y 39 0.00% ta5 39 0.00% saudi 39 0.00% ddu’o’2ng 39 0.00% tro’5 39 0.00% tu’ 39 0.00% ga(1t 39 0.00% hô3 39 0.00% lãm 39 0.00% tôm 39 0.00% lô5t 38 0.00% mu'o'i 38 0.00% ngu 38 0.00% vây 38 0.00% ddô2n 38 0.00% mo3i 38 0.00% ha(ng 38 0.00% thu’o’2ng 38 0.00% mê1n 38 0.00% ap 38 0.00% quát 38 0.00% giu’4 38 0.00% sharon 38 0.00% láng 38 0.00% khát 38 0.00% lo5c 38 0.00% truyê5n 38 0.00% vo'2i 38 0.00% mê2m 38 0.00% bv 38 0.00% ddòn 38 0.00% ddinh

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 37 38 0.00% khiêu 37 0.00% rao 37 0.00% sweden 37 0.00% lo’1n 37 0.00% 400 37 0.00% texas 37 0.00% ra(1c 37 0.00% ngô5 37 0.00% cua3 37 0.00% dàn 37 0.00% tu5ng 37 0.00% 55 37 0.00% lô5c 37 0.00% mùi 37 0.00% khiê1p 37 0.00% bóp 37 0.00% nghiê5n 37 0.00% 38 37 0.00% su'o'ng 37 0.00% la(ng 37 0.00% sánh 37 0.00% tách 37 0.00% tùng 37 0.00% 48 37 0.00% w 37 0.00% liêm 37 0.00% mu'5c 37 0.00% trân 37 0.00% tu’o’1ng 37 0.00% dda(2ng 37 0.00% de 37 0.00% vi4 37 0.00% di5 37 0.00% borowski 36 0.00% chán 36 0.00% sa(1m 36 0.00% ma5n 36 0.00% lewis 36 0.00% ddu'5ng 36 0.00% hi 36 0.00% nhi5 36 0.00% 34 36 0.00% nhánh 36 0.00% vô4 36 0.00% phu'o'1c 36 0.00% no'3 36 0.00% giâ5t 36 0.00% lu5t 36 0.00% khoe 36 0.00% thu'o'1c 36 0.00% xu'o'3ng 36 0.00% trù

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 38 36 0.00% bbc 36 0.00% 32 36 0.00% kiê5m 36 0.00% cuô2ng 36 0.00% hu’4u 36 0.00% da(1t 36 0.00% bâ5n 36 0.00% ddè 36 0.00% giâ2u 36 0.00% chuô5t 36 0.00% trá 36 0.00% saigon 36 0.00% cúp 36 0.00% dâng 35 0.00% o 35 0.00% trinh 35 0.00% 35 35 0.00% ngu'o'4ng 35 0.00% ta(1m 35 0.00% lãi 35 0.00% dày 35 0.00% bô1c 35 0.00% 42 35 0.00% hoãn 35 0.00% húc 35 0.00% no’i 35 0.00% hàm 35 0.00% asean 35 0.00% mát 35 0.00% khô 35 0.00% móng 35 0.00% trô5n 35 0.00% cô3ng 34 0.00% 33 34 0.00% thoái 34 0.00% dô2i 34 0.00% 1995 34 0.00% bâ2y 34 0.00% com 34 0.00% montenegro 34 0.00% khoá 34 0.00% vo3 34 0.00% ddâ5m 34 0.00% cha5m 34 0.00% george 34 0.00% reuters 34 0.00% trâ5t 34 0.00% lô4 34 0.00% ddâm 34 0.00% michael 34 0.00% khoan 34 0.00% bê

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 39 34 0.00% vâ1t 34 0.00% hiê1p 34 0.00% ecuador 34 0.00% mexico 34 0.00% apec 34 0.00% eu 34 0.00% hiê3n 34 0.00% khoe3 34 0.00% sâ5p 33 0.00% ngo'5i 33 0.00% tq 33 0.00% muôn 33 0.00% 02 33 0.00% mâ2m 33 0.00% mo3 33 0.00% miss 33 0.00% muô4i 33 0.00% 09 33 0.00% nhàng 33 0.00% mu'o'1n 33 0.00% lênin 33 0.00% ma5o 33 0.00% ngu5 33 0.00% co'4 33 0.00% nhâ2m 33 0.00% va3 33 0.00% tiêm 33 0.00% bê2 33 0.00% hâ5n 33 0.00% nghiê5t 33 0.00% yê3m 33 0.00% nhu'o'2ng 33 0.00% 1994 32 0.00% huê2 32 0.00% chín 32 0.00% kiêm 32 0.00% chu’a 32 0.00% (có 32 0.00% cp 32 0.00% phâ1n 32 0.00% rô5 32 0.00% huy3 32 0.00% quyên 32 0.00% xu'ng 32 0.00% tha5c 32 0.00% clinton 32 0.00% chim 32 0.00% boeing 32 0.00% hu'u 32 0.00% giê2ng 32 0.00% bu'5c 32 0.00% lu'o'2ng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 40 32 0.00% duyên 32 0.00% milosevic 32 0.00% câ3n 32 0.00% cám 32 0.00% quy3 32 0.00% hán 32 0.00% lâ1n 32 0.00% khô1c 32 0.00% buýt 31 0.00% afp 31 0.00% 36 31 0.00% êm 31 0.00% tre 31 0.00% chechnya 31 0.00% hái 31 0.00% bô5c 31 0.00% tuy2 31 0.00% tha5ch 31 0.00% tao 31 0.00% 1990 31 0.00% táng 31 0.00% 2008 31 0.00% gdp 31 0.00% tô2i 31 0.00% hh 31 0.00% 800 31 0.00% pháo 31 0.00% tlt 31 0.00% muô5n 31 0.00% tiê5p 31 0.00% khôi 31 0.00% jesus 31 0.00% times 31 0.00% hãn 31 0.00% bành 31 0.00% cerberus 30 0.00% aids 30 0.00% fallon 30 0.00% thu3ng 30 0.00% hôi 30 0.00% tu'o'1c 30 0.00% 1996 30 0.00% gìn 30 0.00% châ5n 30 0.00% la5ng 30 0.00% tu'5a 30 0.00% xoay 30 0.00% dùi 30 0.00% phu’o’ng 30 0.00% tiê5c 30 0.00% 41 30 0.00% 1945

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 41 30 0.00% na5i 30 0.00% yê1n 30 0.00% lu'o'5m 30 0.00% 46 30 0.00% diê5u 30 0.00% nan 30 0.00% ao 30 0.00% go5n 30 0.00% do'3 30 0.00% mu'o'5n 29 0.00% khéo 29 0.00% va3i 29 0.00% uâ1t 29 0.00% 08 29 0.00% mu4 29 0.00% giu'o'2ng 29 0.00% ha(m 29 0.00% arabia 29 0.00% 37 29 0.00% lo'4 29 0.00% thoa3ng 29 0.00% ném 29 0.00% quái 29 0.00% (my4 29 0.00% hâm 29 0.00% nhu'5a 29 0.00% 39 29 0.00% lúng 29 0.00% florida 29 0.00% kho'i 29 0.00% david 29 0.00% julie 29 0.00% va5ch 29 0.00% quyê3n 29 0.00% dn 29 0.00% hô4n 29 0.00% nhu'o'5c 29 0.00% chinh 29 0.00% chiêu 29 0.00% phu'o'5ng 29 0.00% chai 29 0.00% 57 28 0.00% câ5y 28 0.00% tro5n 28 0.00% nhiê4u 28 0.00% vo'3 28 0.00% nâ1y 28 0.00% toa 28 0.00% do5n 28 0.00% houston 28 0.00% gs 28 0.00% xài

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 42 28 0.00% ho3ng 28 0.00% miê1ng 28 0.00% lùi 28 0.00% nga5ch 28 0.00% bo5c 28 0.00% châ5t 28 0.00% ghana 28 0.00% dè 28 0.00% a5t 28 0.00% hô5p 28 0.00% cô1ng 28 0.00% tra(ng 28 0.00% nê3 28 0.00% chó 28 0.00% (trong 28 0.00% malaysia 28 0.00% ngon 28 0.00% abu 28 0.00% thuy3 28 0.00% venezuela 28 0.00% ôi 27 0.00% xâ3y 27 0.00% and 27 0.00% lu4ng 27 0.00% tú 27 0.00% le3 27 0.00% nàng 27 0.00% trôi 27 0.00% su’3 27 0.00% 250 27 0.00% xo' 27 0.00% cu'3u 27 0.00% rung 27 0.00% ti5 27 0.00% ddâ4m 27 0.00% gác 27 0.00% thói 27 0.00% afghanistan 27 0.00% dâ4u 27 0.00% tráng 27 0.00% rau 27 0.00% nhãn 27 0.00% 27 0.00% fred 27 0.00% nhã 27 0.00% trê4 27 0.00% du’o’4ng 27 0.00% ca(m 27 0.00% tha3i 27 0.00% hussein 27 0.00% kennedy 27 0.00% materazzi

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 43 27 0.00% gán 27 0.00% hãnh 27 0.00% xé 27 0.00% columbia 27 0.00% s 27 0.00% né 26 0.00% nho5c 26 0.00% ngo' 26 0.00% thu’1c 26 0.00% dào 26 0.00% su 26 0.00% vnpt 26 0.00% vu’5c 26 0.00% chép 26 0.00% lam 26 0.00% tu'5u 26 0.00% nu5 26 0.00% hãm 26 0.00% lo3ng 26 0.00% he5p 26 0.00% côte 26 0.00% chà 26 0.00% â1p 26 0.00% da5i 26 0.00% ddi4a 26 0.00% phu 26 0.00% nô2i 26 0.00% ve5n 26 0.00% phiá 26 0.00% 1997 26 0.00% chê 26 0.00% lebanon 26 0.00% delay 26 0.00% baghdad 26 0.00% 44 26 0.00% evn 26 0.00% hô1 26 0.00% su'4a 26 0.00% hlv 26 0.00% 47 26 0.00% nhát 26 0.00% gieo 26 0.00% hu' 26 0.00% râ2y 26 0.00% tha(1t 26 0.00% nha(1n 26 0.00% no5 26 0.00% dô1c 26 0.00% finale 26 0.00% hu5t 26 0.00% roh 25 0.00% xót

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 44 25 0.00% gaza 25 0.00% nga3 25 0.00% ngu’ng 25 0.00% toa3 25 0.00% dâm 25 0.00% xe3 25 0.00% nê1p 25 0.00% mài 25 0.00% ngu'2a 25 0.00% giàn 25 0.00% nhâ5u 25 0.00% gio5t 25 0.00% táo 25 0.00% go'5i 25 0.00% xoá 25 0.00% tru'o'1ng 25 0.00% hoa5 25 0.00% sòng 25 0.00% tru5c 25 0.00% tu’1c 25 0.00% bát 25 0.00% ngâ5m 25 0.00% cu3 25 0.00% tâ2u 25 0.00% ngu'5 25 0.00% ddùa 25 0.00% tu3 25 0.00% tehran 25 0.00% nâ1u 25 0.00% tho'm 25 0.00% mo5c 25 0.00% lông 25 0.00% airlines 25 0.00% ho'3 25 0.00% los 25 0.00% angeles 25 0.00% tràng 25 0.00% túy 25 0.00% nu'o'ng 25 0.00% mãnh 25 0.00% lu'1a 25 0.00% ga5o 24 0.00% dai 24 0.00% mercosur 24 0.00% tuý 24 0.00% clb 24 0.00% thiêng 24 0.00% phiê5t 24 0.00% toa5 24 0.00% châm 24 0.00% nô2ng 24 0.00% hoa5i

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 45 24 0.00% chui 24 0.00% rock 24 0.00% ru'3a 24 0.00% dán 24 0.00% 49 24 0.00% lô5i 24 0.00% mô2i 24 0.00% la(n 24 0.00% rooney 24 0.00% cho’i 24 0.00% 1973 24 0.00% rách 24 0.00% ô3 24 0.00% ttct 24 0.00% usa 24 0.00% vo' 24 0.00% bì 24 0.00% ronaldo 24 0.00% chay 24 0.00% cha(n 24 0.00% (2 24 0.00% cuô5n 24 0.00% 1991 24 0.00% petersburg 24 0.00% xí 24 0.00% lon 24 0.00% chu'3i 24 0.00% khuâ1t 24 0.00% ddê1m 24 0.00% xáo 24 0.00% bóc 24 0.00% so3i 24 0.00% buông 24 0.00% bin 24 0.00% (khoa3ng 24 0.00% ddiên 24 0.00% côn 24 0.00% no 24 0.00% philippines 24 0.00% ha3 24 0.00% thu’o’5ng 24 0.00% lo’5i 24 0.00% bem 24 0.00% tri5nh 23 0.00% thòi 23 0.00% má 23 0.00% hs 23 0.00% na 23 0.00% phôi 23 0.00% dduôi 23 0.00% thám 23 0.00% 5dd

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 46 23 0.00% ro'1t 23 0.00% lào 23 0.00% jordan 23 0.00% mò 23 0.00% ghé 23 0.00% elottery 23 0.00% ddô1ng 23 0.00% (trung 23 0.00% condoleezza 23 0.00% sex 23 0.00% sàigòn 23 0.00% khái 23 0.00% tà 23 0.00% 1974 23 0.00% gãy 23 0.00% thu’ 23 0.00% moscow 23 0.00% phanh 23 0.00% vm 23 0.00% ra3i 23 0.00% ke5t 23 0.00% thô1i 23 0.00% liêu 23 0.00% còi 23 0.00% nigeria 23 0.00% 52 23 0.00% 1979 23 0.00% cha(5ng 23 0.00% ddu5c 23 0.00% cay 23 0.00% cui 23 0.00% smddh 23 0.00% thùy 23 0.00% st 23 0.00% trinidad 23 0.00% tobago 23 0.00% xê1 23 0.00% khu'o'ng 23 0.00% shin 23 0.00% xinh 23 0.00% lâ1p 23 0.00% bính 23 0.00% nhu'1t 22 0.00% dda(1n 22 0.00% nàn 22 0.00% jose 22 0.00% ách 22 0.00% vu4ng 22 0.00% lu’o’ng 22 0.00% klose 22 0.00% xao 22 0.00% huê5

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 47 22 0.00% 700 22 0.00% gào 22 0.00% da5o 22 0.00% ky5 22 0.00% kahn 22 0.00% ddam 22 0.00% úy 22 0.00% ga5t 22 0.00% khô1 22 0.00% bergkamp 22 0.00% vnch 22 0.00% cu’1 22 0.00% arsenal 22 0.00% ngo3 22 0.00% ricardo 22 0.00% su’1c 22 0.00% telecom 22 0.00% 1982 22 0.00% nho'3 22 0.00% h5n1 22 0.00% búa 22 0.00% mo3ng 22 0.00% tô1ng 22 0.00% bích 22 0.00% nha5y 22 0.00% khoát 22 0.00% ddúc 22 0.00% tui 22 0.00% bàu 22 0.00% rê3 22 0.00% ba(1p 22 0.00% ranh 22 0.00% ddo'1n 22 0.00% cho'1 22 0.00% nhâ5m 22 0.00% miên 22 0.00% ddu'1t 22 0.00% líu 22 0.00% vu'o'1ng 22 0.00% vaccine 22 0.00% xách 22 0.00% à 22 0.00% berlin 22 0.00% tru'1o'c 22 0.00% ráp 22 0.00% nhu'1c 22 0.00% mail 22 0.00% ngu'o'i 22 0.00% vc 22 0.00% yasushi 21 0.00% xa5 21 0.00% rình

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 48 21 0.00% tg 21 0.00% ngu'5a 21 0.00% ba5t 21 0.00% chô1n 21 0.00% 1976 21 0.00% donald 21 0.00% henry 21 0.00% ta(m 21 0.00% ham 21 0.00% khe 21 0.00% nghiêng 21 0.00% dô4 21 0.00% lót 21 0.00% ru3i 21 0.00% nô5 21 0.00% hu3 21 0.00% diê2u 21 0.00% tom 21 0.00% khía 21 0.00% league 21 0.00% kehl 21 0.00% neuville 21 0.00% ra5ng 21 0.00% lo' 21 0.00% râ4y 21 0.00% tuâ1t 21 0.00% vu'o'n 21 0.00% du’1t 21 0.00% tunisia 21 0.00% thính 21 0.00% (mô5t 21 0.00% chuô4i 21 0.00% di5u 21 0.00% hao 21 0.00% diego 21 0.00% 58 21 0.00% siê1t 21 0.00% annan 21 0.00% tra(1c 21 0.00% munich 21 0.00% tro5 21 0.00% lu'o'4i 21 0.00% rã 21 0.00% na3y 21 0.00% timor 21 0.00% gai 21 0.00% do'2i 21 0.00% luô2ng 21 0.00% cúi 21 0.00% dâ5p 21 0.00% vi3a 21 0.00% gu5c

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 49 21 0.00% vã 21 0.00% giê1ng 20 0.00% tha(1n 20 0.00% (không 20 0.00% ddò 20 0.00% nu’4 20 0.00% kê5 20 0.00% men 20 0.00% châ3n 20 0.00% voi 20 0.00% 85 20 0.00% thung 20 0.00% universe 20 0.00% haditha 20 0.00% ddôn 20 0.00% ô1m 20 0.00% xít 20 0.00% do'4 20 0.00% miê5t 20 0.00% ahmadinejad 20 0.00% phê1 20 0.00% tha(5ng 20 0.00% râ2m 20 0.00% run 20 0.00% chelsea 20 0.00% reo 20 0.00% cúng 20 0.00% (nhu' 20 0.00% www 20 0.00% trade 20 0.00% khinh 20 0.00% hoành 20 0.00% ru'o'1c 20 0.00% bill 20 0.00% kofi 20 0.00% thoáng 20 0.00% nha(5t 20 0.00% ma(ng 20 0.00% kìm 20 0.00% hddnd 20 0.00% ô2 20 0.00% 51 20 0.00% lehmann 20 0.00% su3a 20 0.00% dâ1n 20 0.00% le5 20 0.00% nuô1t 20 0.00% to5a 20 0.00% laden 20 0.00% lén 20 0.00% 59 20 0.00% ddo5ng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 50 20 0.00% d'ivoire 20 0.00% ddâ1m 20 0.00% kiêng 20 0.00% for 20 0.00% ngo'i 20 0.00% nén 20 0.00% gô1i 20 0.00% xôi 19 0.00% f 19 0.00% 62 19 0.00% lánh 19 0.00% bàng 19 0.00% dda3 19 0.00% ro 19 0.00% me3 19 0.00% ca5n 19 0.00% j 19 0.00% ngâ2n 19 0.00% nút 19 0.00% thê 19 0.00% lm 19 0.00% hê4 19 0.00% add 19 0.00% schweinsteiger 19 0.00% béo 19 0.00% iii 19 0.00% bo'm 19 0.00% angola 19 0.00% kissinger 19 0.00% vddv 19 0.00% hhhv 19 0.00% tiê5m 19 0.00% cù 19 0.00% phu5ng 19 0.00% gazprom 19 0.00% tran 19 0.00% nâu 19 0.00% uý 19 0.00% ddê5m 19 0.00% 53 19 0.00% phui 19 0.00% 120 19 0.00% ddcs 19 0.00% tu3i 19 0.00% pop 19 0.00% chôm 19 0.00% bu'2ng 19 0.00% tqlc 19 0.00% la(1c 19 0.00% quét 19 0.00% 1970 19 0.00% richter

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 51 19 0.00% karaoke 19 0.00% 110 19 0.00% ga 19 0.00% hoa5n 19 0.00% ngoa5n 19 0.00% pho'i 19 0.00% ddáy 19 0.00% rê4 19 0.00% lung 19 0.00% chuô5ng 19 0.00% he3m 19 0.00% cõi 19 0.00% xuôi 19 0.00% ven 19 0.00% nai 19 0.00% 83 19 0.00% sv 19 0.00% xi 19 0.00% hân 19 0.00% vladimir 19 0.00% câ1n 19 0.00% mao 19 0.00% kem 19 0.00% ra5p 18 0.00% hiên 18 0.00% mây 18 0.00% canxi 18 0.00% múa 18 0.00% hâu 18 0.00% â3m 18 0.00% chén 18 0.00% robert 18 0.00% liège 18 0.00% am 18 0.00% da5n 18 0.00% vãn 18 0.00% kê2 18 0.00% hu’o’3ng 18 0.00% lê2 18 0.00% tha5o 18 0.00% kiê1p 18 0.00% kpa(h 18 0.00% (nê1u 18 0.00% ngu5y 18 0.00% son 18 0.00% 03 18 0.00% (sinh 18 0.00% lo'2 18 0.00% xa3o 18 0.00% tí 18 0.00% pghh 18 0.00% cài

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 52 18 0.00% mê4 18 0.00% scanlon 18 0.00% album 18 0.00% chuông 18 0.00% voa 18 0.00% post 18 0.00% 65 18 0.00% tòan 18 0.00% (hay 18 0.00% (tp 18 0.00% ddâ5y 18 0.00% ô2n 18 0.00% xi3 18 0.00% merkel 18 0.00% bê1p 18 0.00% du'ng 18 0.00% hu’ 18 0.00% chích 18 0.00% mô2m 18 0.00% barcelona 18 0.00% 43 18 0.00% giu 18 0.00% khoác 18 0.00% cole 18 0.00% qui4 18 0.00% 1993 18 0.00% ga5ch 18 0.00% ghép 18 0.00% súc 18 0.00% paraguay 18 0.00% top 18 0.00% trô5i 18 0.00% nguyê2n 18 0.00% william 18 0.00% tha5nh 18 0.00% bùn 18 0.00% hét 18 0.00% nho'n 18 0.00% rudy 18 0.00% la(m 18 0.00% bo'i 18 0.00% ddính 18 0.00% a3o 18 0.00% oán 18 0.00% ngo'4 18 0.00% cho5c 18 0.00% scowcroft 17 0.00% java 17 0.00% méo 17 0.00% vo'5t 17 0.00% bu5i 17 0.00% 1986

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 53 17 0.00% dvd 17 0.00% ireland 17 0.00% reed 17 0.00% bén 17 0.00% u’1ng 17 0.00% gia(5c 17 0.00% khao 17 0.00% iaea 17 0.00% nhé 17 0.00% lâ2y 17 0.00% khuâ3n 17 0.00% go 17 0.00% peter 17 0.00% thúy 17 0.00% 170 17 0.00% soi 17 0.00% ngà 17 0.00% còng 17 0.00% ddày 17 0.00% cuba 17 0.00% ngô5t 17 0.00% ddo'1i 17 0.00% tu5i 17 0.00% fc 17 0.00% kho'1p 17 0.00% qh 17 0.00% mahmoud 17 0.00% 2009 17 0.00% màng 17 0.00% cu5m 17 0.00% chèn 17 0.00% vhtt 17 0.00% bn 17 0.00% tn 17 0.00% ghét 17 0.00% xu'o'1ng 17 0.00% dang 17 0.00% ni 17 0.00% ma5t 17 0.00% vác 17 0.00% (a3nh 17 0.00% chu'o'1c 17 0.00% jack 17 0.00% quâ1y 17 0.00% nu’4a 17 0.00% phùng 17 0.00% cu'o'1c 17 0.00% 61 17 0.00% vàn 17 0.00% bet 17 0.00% 1978 17 0.00% báu

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 54 17 0.00% u'a 17 0.00% mi5 17 0.00% thâ1u 17 0.00% liê4u 17 0.00% rô5n 17 0.00% lâ3n 17 0.00% bombay 17 0.00% tro' 17 0.00% hersh 17 0.00% 1968 17 0.00% stuttgart 17 0.00% bâ1m 16 0.00% tour 16 0.00% càn 16 0.00% thét 16 0.00% beckham 16 0.00% 67 16 0.00% o'1t 16 0.00% u'o'1t 16 0.00% ráo 16 0.00% xa3 16 0.00% na(n 16 0.00% ddô1 16 0.00% axít 16 0.00% cu’5c 16 0.00% ngây 16 0.00% hu'o'2ng 16 0.00% ti 16 0.00% va(5t 16 0.00% chu’4a 16 0.00% trói 16 0.00% rê5t 16 0.00% (4 16 0.00% trêu 16 0.00% da(5n 16 0.00% bolivia 16 0.00% bhatia 16 0.00% 160 16 0.00% london 16 0.00% ngu3i 16 0.00% 69 16 0.00% vùi 16 0.00% nghiã 16 0.00% 81 16 0.00% ngo5 16 0.00% thu'2ng 16 0.00% tông 16 0.00% international 16 0.00% pha(3ng 16 0.00% tu’2ng 16 0.00% 86 16 0.00% su'u

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 55 16 0.00% sxh 16 0.00% frings 16 0.00% tu’o’2ng 16 0.00% dép 16 0.00% 63 16 0.00% mít 16 0.00% giòng 16 0.00% tra5ch 16 0.00% shiite 16 0.00% del 16 0.00% beckenbauer 16 0.00% khoái 16 0.00% khép 16 0.00% xoa 16 0.00% nôn 16 0.00% cho'5t 16 0.00% website 16 0.00% btc 16 0.00% (chi3 16 0.00% mâ4n 16 0.00% tô1p 16 0.00% cu'5 16 0.00% xông 16 0.00% ho5ng 16 0.00% ngu' 16 0.00% hecta 16 0.00% nato 16 0.00% thu'5 16 0.00% dãy 16 0.00% mô1c 16 0.00% robben 16 0.00% campuchia 16 0.00% ddsq 16 0.00% australia 16 0.00% virginia 16 0.00% gôn 16 0.00% dortmund 16 0.00% thô1t 16 0.00% cho5i 16 0.00% giã 16 0.00% bo3ng 16 0.00% gas 16 0.00% hong 16 0.00% nhu3 16 0.00% tivi 16 0.00% le 16 0.00% ddê 16 0.00% gao 16 0.00% tony 16 0.00% si3 16 0.00% figo 16 0.00% nghênh

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 56 15 0.00% 350 15 0.00% thèm 15 0.00% dili 15 0.00% 1949 15 0.00% côi 15 0.00% ngâ1t 15 0.00% ghê 15 0.00% mô1t 15 0.00% da(5m 15 0.00% vy 15 0.00% chdcnd 15 0.00% richard 15 0.00% email 15 0.00% mòn 15 0.00% cd 15 0.00% war 15 0.00% quãng 15 0.00% triê2n 15 0.00% 1954 15 0.00% cô5i 15 0.00% lùn 15 0.00% ru'o'3i 15 0.00% gu4i 15 0.00% junta 15 0.00% trán 15 0.00% nga(1m 15 0.00% dhs 15 0.00% dubai 15 0.00% mc 15 0.00% 54 15 0.00% carlos 15 0.00% dô1t 15 0.00% giãn 15 0.00% cáp 15 0.00% tro'5n 15 0.00% bi5p 15 0.00% bô3n 15 0.00% nepal 15 0.00% plastech 15 0.00% tê 15 0.00% chìa 15 0.00% lee 15 0.00% na3n 15 0.00% ê 15 0.00% (chu3 15 0.00% vcd 15 0.00% kong 15 0.00% lít 15 0.00% waterford 15 0.00% ngo5t 15 0.00% mô2ng 15 0.00% dc

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 57 15 0.00% nô1t 15 0.00% lút 15 0.00% ru3a 15 0.00% pap 15 0.00% vét 15 0.00% o' 15 0.00% cheney 15 0.00% cò 15 0.00% news 15 0.00% na(1p 15 0.00% na5p 15 0.00% 64 15 0.00% thà 15 0.00% video 15 0.00% 1989 15 0.00% gâ5y 15 0.00% ca5p 14 0.00% gom 14 0.00% hbsag 14 0.00% rocket 14 0.00% 130 14 0.00% kremlin 14 0.00% tdtt 14 0.00% chu' 14 0.00% giáng 14 0.00% dda(1ng 14 0.00% cairo 14 0.00% cha3 14 0.00% lí 14 0.00% tha(1m 14 0.00% r 14 0.00% (iran 14 0.00% ngùi 14 0.00% gio' 14 0.00% be5p 14 0.00% pho'3 14 0.00% guam 14 0.00% genève 14 0.00% chu’1a 14 0.00% toronto 14 0.00% sanh 14 0.00% hê5t 14 0.00% reagan 14 0.00% da3i 14 0.00% va(ng 14 0.00% virus 14 0.00% no'1i 14 0.00% kiê2m 14 0.00% chiên 14 0.00% (25 14 0.00% su’3a 14 0.00% u'ng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 58 14 0.00% nhi4 14 0.00% tâ2n 14 0.00% vnexpress 14 0.00% nghe5n 14 0.00% nhu'o'4ng 14 0.00% thu5c 14 0.00% lu'2ng 14 0.00% lu4y 14 0.00% buô2ng 14 0.00% bo'1 14 0.00% sheldon 14 0.00% nhào 14 0.00% khâm 14 0.00% city 14 0.00% ngu'3a 14 0.00% hò 14 0.00% bâ4y 14 0.00% pauleta 14 0.00% bâ3n 14 0.00% tâ1p 14 0.00% ru3 14 0.00% loát 14 0.00% fan 14 0.00% thiêu 14 0.00% ri3 14 0.00% hang 14 0.00% tát 14 0.00% dda(1p 14 0.00% ddiê1m 14 0.00% m2 14 0.00% mìn 14 0.00% cho'1p 14 0.00% cm 14 0.00% ulianovxco 14 0.00% khiê1m 14 0.00% nga5o 14 0.00% paul 14 0.00% mi 14 0.00% nho 14 0.00% khoang 14 0.00% cambodia 13 0.00% euro 13 0.00% nv1 13 0.00% gâ5t 13 0.00% 72 13 0.00% basmati 13 0.00% su'o'1ng 13 0.00% nhô2i 13 0.00% oliver 13 0.00% nhan 13 0.00% 82 13 0.00% schneider

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 59 13 0.00% lahm 13 0.00% khuya 13 0.00% phiêu 13 0.00% khoét 13 0.00% rô1t 13 0.00% uma 13 0.00% 1967 13 0.00% peru 13 0.00% quyê1n 13 0.00% taliban 13 0.00% (11 13 0.00% ngâ1m 13 0.00% muô1i 13 0.00% khanh 13 0.00% zambrotta 13 0.00% bundesliga 13 0.00% champions 13 0.00% miê5n 13 0.00% podolski 13 0.00% vi5t 13 0.00% 1992 13 0.00% laser 13 0.00% da(5t 13 0.00% ddbscl 13 0.00% chua 13 0.00% su5t 13 0.00% (xã 13 0.00% nhiê1p 13 0.00% microsoft 13 0.00% nho' 13 0.00% hí 13 0.00% nã 13 0.00% (bô5 13 0.00% toan 13 0.00% nhét 13 0.00% thuy5 13 0.00% nga5n 13 0.00% bddhq 13 0.00% guô2ng 13 0.00% hòng 13 0.00% khuê 13 0.00% vo'1t 13 0.00% (công 13 0.00% thuý 13 0.00% thán 13 0.00% norquist 13 0.00% (3 13 0.00% ddát 13 0.00% pmu 13 0.00% tokyo 13 0.00% hindu 13 0.00% ng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 60 13 0.00% time 13 0.00% bui 13 0.00% tro'1 13 0.00% university 13 0.00% u3i 13 0.00% american 13 0.00% 220 13 0.00% hu’o’1ng 13 0.00% taleban 13 0.00% (ngày 13 0.00% inc 13 0.00% ngào 13 0.00% ddu’1ng 13 0.00% rô2ng 13 0.00% hô1t 13 0.00% lê1t 13 0.00% rà 13 0.00% ” 13 0.00% tu’1 13 0.00% lu’u 13 0.00% 1988 13 0.00% so’3 13 0.00% then 13 0.00% bông 13 0.00% thô1n 13 0.00% liêng 13 0.00% ldd 13 0.00% mâ3u 13 0.00% tia 13 0.00% 77 13 0.00% ru'5c 13 0.00% cu’3 13 0.00% virút 13 0.00% ma3ng 13 0.00% (qua3ng 13 0.00% yahoo 13 0.00% tu'o'1i 12 0.00% lu'4 12 0.00% la(5p 12 0.00% vung 12 0.00% lay 12 0.00% ngón 12 0.00% câm 12 0.00% bo’3i 12 0.00% mãng 12 0.00% tâ1u 12 0.00% vâ2n 12 0.00% olympic 12 0.00% vóc 12 0.00% gài 12 0.00% nhê5n 12 0.00% ha(1c

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 61 12 0.00% gerrard 12 0.00% o’n 12 0.00% thê2 12 0.00% ngán 12 0.00% la(5n 12 0.00% râu 12 0.00% kiêu 12 0.00% so'5i 12 0.00% ê1 12 0.00% zawahiri 12 0.00% nguyê5t 12 0.00% lê2u 12 0.00% dda5m 12 0.00% mâ2u 12 0.00% riê1t 12 0.00% go'1m 12 0.00% sa3nh 12 0.00% carter 12 0.00% nguy5 12 0.00% chxhcn 12 0.00% ve 12 0.00% liverpool 12 0.00% ván 12 0.00% hoi 12 0.00% niêm 12 0.00% (sài 12 0.00% biê1u 12 0.00% tò 12 0.00% ngó 12 0.00% (tu'o'2ng 12 0.00% visa 12 0.00% cu’5u 12 0.00% xôn 12 0.00% philips 12 0.00% tnhh 12 0.00% go5ng 12 0.00% nhô1t 12 0.00% kha 12 0.00% vâng 12 0.00% lát 12 0.00% lu'o'1t 12 0.00% loa 12 0.00% (gio'2 12 0.00% la(2n 12 0.00% 84 12 0.00% penalty 12 0.00% thuram 12 0.00% soon 12 0.00% ml 12 0.00% (tên 12 0.00% oánh 12 0.00% ddi4

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 62 12 0.00% ddh4 12 0.00% sâ2u 12 0.00% jerry 12 0.00% rico 12 0.00% kông 12 0.00% brown 12 0.00% tru'1ng 12 0.00% láo 12 0.00% phô2ng 12 0.00% phâ1t 12 0.00% giu5c 12 0.00% berne 12 0.00% gia(2ng 12 0.00% ohio 12 0.00% nha5t 12 0.00% 78 12 0.00% nu'1t 12 0.00% ru'o'4i 12 0.00% â5p 12 0.00% 900 12 0.00% 12 0.00% bu'ng 12 0.00% arafat 12 0.00% cali 12 0.00% bo'5 12 0.00% (hoa 12 0.00% thiê2n 12 0.00% thâ1m 12 0.00% my 12 0.00% nâ1m 12 0.00% i3 12 0.00% gyanendra 12 0.00% ddo5a 12 0.00% johnson 12 0.00% colombia 12 0.00% ginting 12 0.00% xen 12 0.00% 66 12 0.00% mâ5p 12 0.00% sydney 12 0.00% vuô1t 12 0.00% cúc 12 0.00% ru'o'2ng 11 0.00% la5p 11 0.00% metzelder 11 0.00% (do 11 0.00% puerto 11 0.00% bo'1i 11 0.00% quâ5t 11 0.00% ta(5c 11 0.00% shia 11 0.00% kiley

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 63 11 0.00% so'2 11 0.00% (wto 11 0.00% 1000 11 0.00% té 11 0.00% robot 11 0.00% saint 11 0.00% qua(ng 11 0.00% o3i 11 0.00% bu'o'1u 11 0.00% thòng 11 0.00% girl 11 0.00% qua5t 11 0.00% xà 11 0.00% you 11 0.00% 74 11 0.00% (sô1 11 0.00% takagi 11 0.00% prodi 11 0.00% cóp 11 0.00% trô4i 11 0.00% dnnn 11 0.00% vu5n 11 0.00% lèo 11 0.00% ndd 11 0.00% vê1 11 0.00% cô1c 11 0.00% chu'ng 11 0.00% jeremy 11 0.00% belarus 11 0.00% bo' 11 0.00% gò 11 0.00% mèo 11 0.00% (dda5i 11 0.00% fatah 11 0.00% be3 11 0.00% xát 11 0.00% dick 11 0.00% sar 11 0.00% vèo 11 0.00% phì 11 0.00% lu'o'5n 11 0.00% de3o 11 0.00% ro'4 11 0.00% kèo 11 0.00% suu 11 0.00% chen 11 0.00% rành 11 0.00% nhi5n 11 0.00% kyi 11 0.00% nghe5t 11 0.00% horno 11 0.00% (thu'o'2ng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 64 11 0.00% shell 11 0.00% chóc 11 0.00% mark 11 0.00% mép 11 0.00% valencia 11 0.00% nòng 11 0.00% rét 11 0.00% gddtla 11 0.00% frank 11 0.00% bôi 11 0.00% nha(n 11 0.00% gu’3i 11 0.00% phun 11 0.00% ghraib 11 0.00% wendy 11 0.00% (6 11 0.00% bu'o'u 11 0.00% ttxvn 11 0.00% buô2m 11 0.00% tráo 11 0.00% dance 11 0.00% co5c 11 0.00% nhi3 11 0.00% musharraf 11 0.00% (21 11 0.00% (thuô5c 11 0.00% xu’3 11 0.00% okondor 11 0.00% ma3i 11 0.00% suông 11 0.00% ho 11 0.00% tphcm 11 0.00% 87 11 0.00% ca(1n 10 0.00% lxl 10 0.00% jacquelyn 10 0.00% hollywood 10 0.00% uruguay 10 0.00% juventus 10 0.00% tru’2 10 0.00% morales 10 0.00% bi5t 10 0.00% baucus 10 0.00% gâ1u 10 0.00% dennis 10 0.00% cuô1c 10 0.00% chavez 10 0.00% morgan 10 0.00% lô1 10 0.00% xích 10 0.00% ra3nh 10 0.00% khang

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 65 10 0.00% mo'4 10 0.00% haifa 10 0.00% tb 10 0.00% trút 10 0.00% chóp 10 0.00% ùn 10 0.00% ru4 10 0.00% tuyê1t 10 0.00% va5 10 0.00% ngâm 10 0.00% hagl 10 0.00% pmu18 10 0.00% (nguyên 10 0.00% friedrich 10 0.00% sô1c 10 0.00% bruce 10 0.00% bu’o’1c 10 0.00% xui 10 0.00% landis 10 0.00% duâ3n 10 0.00% gâ2m 10 0.00% action 10 0.00% bilic 10 0.00% vênh 10 0.00% chirac 10 0.00% sistani 10 0.00% cqn 10 0.00% allianz 10 0.00% arena 10 0.00% tv 10 0.00% kê2m 10 0.00% khoáng 10 0.00% sanchez 10 0.00% vo’4 10 0.00% netviet 10 0.00% group 10 0.00% chê5ch 10 0.00% (mà 10 0.00% rêu 10 0.00% cho’3 10 0.00% núp 10 0.00% manuel 10 0.00% giaó 10 0.00% 95 10 0.00% bayern 10 0.00% nu’3a 10 0.00% vu'o'5ng 10 0.00% 1946 10 0.00% rights 10 0.00% hù 10 0.00% lo’1p 10 0.00% bu'1o'c

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 66 10 0.00% ma(5n 10 0.00% tns 10 0.00% la(1k 10 0.00% trâu 10 0.00% xô1p 10 0.00% 1987 10 0.00% bremen 10 0.00% dunga 10 0.00% likud 10 0.00% dda(1m 10 0.00% ak 10 0.00% ke5p 10 0.00% ddùi 10 0.00% dduô1i 10 0.00% olmert 10 0.00% sea 10 0.00% vâ1p 10 0.00% thcs 10 0.00% nhu’o’4ng 10 0.00% 1966 10 0.00% nv3 10 0.00% (tô3ng 10 0.00% camera 10 0.00% darfur 10 0.00% frankfurt 10 0.00% stanford 10 0.00% trê5 10 0.00% ma(1ng 10 0.00% nho5n 10 0.00% china 10 0.00% ct 10 0.00% tdv 10 0.00% nhàm 10 0.00% xô5n 10 0.00% vâ4y 10 0.00% cu’3a 10 0.00% lô5ng 10 0.00% – 10 0.00% owen 10 0.00% ddanh 10 0.00% kho’3i 10 0.00% ddô5n 10 0.00% 76 10 0.00% nhô1i 10 0.00% bê3 10 0.00% epson 10 0.00% 3000 10 0.00% khoa(n 10 0.00% (ngu'o'2i 10 0.00% 1947 10 0.00% háo 10 0.00% xiê1t

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 67 10 0.00% ngàng 10 0.00% (29 10 0.00% show 10 0.00% tha3n 10 0.00% ghen 10 0.00% ddút 10 0.00% lách 10 0.00% du'o'5t 10 0.00% bond 10 0.00% (ddu'o'5c 10 0.00% to' 10 0.00% street 10 0.00% ngách 10 0.00% sudan 10 0.00% su’ 10 0.00% 1985 10 0.00% trót 10 0.00% lu'5u 10 0.00% 181 10 0.00% diana 10 0.00% lo'3 10 0.00% ma5 10 0.00% la5y 10 0.00% dda5c 10 0.00% 180 10 0.00% (còn 10 0.00% geneva 10 0.00% di4a 10 0.00% 89 10 0.00% hormon 10 0.00% bangkok 9 0.00% ém 9 0.00% (8 9 0.00% game 9 0.00% chi4a 9 0.00% quê5 9 0.00% croatia 9 0.00% ddâ1ng 9 0.00% beach 9 0.00% mi5t 9 0.00% koroman 9 0.00% 88 9 0.00% ga(5t 9 0.00% law 9 0.00% lenin 9 0.00% ddttsg 9 0.00% (sau 9 0.00% tro5t 9 0.00% life 9 0.00% pfister 9 0.00% coalition 9 0.00% yeltsin

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 68 9 0.00% kheng 9 0.00% lhasa 9 0.00% vl 9 0.00% ngai 9 0.00% pv 9 0.00% pekerman 9 0.00% ddu 9 0.00% virut 9 0.00% hill 9 0.00% marshall 9 0.00% (các 9 0.00% lì 9 0.00% (pntr 9 0.00% kíp 9 0.00% nannup 9 0.00% nón 9 0.00% taxi 9 0.00% (24 9 0.00% go’3i 9 0.00% thu5t 9 0.00% nhung 9 0.00% ngâ4u 9 0.00% jong 9 0.00% khoanh 9 0.00% cu’o’2ng 9 0.00% (30 9 0.00% ba(n 9 0.00% bu 9 0.00% (ddu'1c 9 0.00% martin 9 0.00% gâ2y 9 0.00% a(1p 9 0.00% ddu'o'2i 9 0.00% 73 9 0.00% u'o'i 9 0.00% marinko 9 0.00% gandhi 9 0.00% dãi 9 0.00% joe 9 0.00% 235 9 0.00% (the 9 0.00% 1950 9 0.00% 1948 9 0.00% dengue 9 0.00% sco 9 0.00% cstq 9 0.00% eric 9 0.00% (trên 9 0.00% xâ1p 9 0.00% saviola 9 0.00% ten 9 0.00% cháo

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 69 9 0.00% poseidon 9 0.00% lemerre 9 0.00% qúa 9 0.00% (10 9 0.00% tô5t 9 0.00% (reuters 9 0.00% ustr 9 0.00% ngao 9 0.00% gia(5t 9 0.00% net 9 0.00% 118 9 0.00% me 9 0.00% choáng 9 0.00% lâ2u 9 0.00% ti5nh 9 0.00% honda 9 0.00% michel 9 0.00% center 9 0.00% gã 9 0.00% (tiê2n 9 0.00% bu'3u 9 0.00% truô2ng 9 0.00% cô2n 9 0.00% lô2ng 9 0.00% tít 9 0.00% rajana 9 0.00% rò 9 0.00% túm 9 0.00% hkd 9 0.00% hoong 9 0.00% coóc 9 0.00% freedom 9 0.00% bâ2n 9 0.00% lè 9 0.00% ra5ch 9 0.00% tâ5u 9 0.00% latinh 9 0.00% kè 9 0.00% shamil 9 0.00% vu'1t 9 0.00% state 9 0.00% ha5m 9 0.00% ro’2i 9 0.00% matxco'va 9 0.00% sawaco 9 0.00% petit 9 0.00% cu'u 9 0.00% ngâ3n 9 0.00% beiruth 9 0.00% ni3 9 0.00% ca(3ng 9 0.00% cô1i

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 70 9 0.00% dìm 9 0.00% da(2ng 9 0.00% ti3a 9 0.00% vuông 9 0.00% ha(2n 9 0.00% francisco 9 0.00% sunni 9 0.00% liê1ng 9 0.00% (17 9 0.00% 1960 9 0.00% diê4m 9 0.00% mi3m 9 0.00% teo 9 0.00% alabama 9 0.00% les 9 0.00% blatter 9 0.00% chiê3u 9 0.00% phách 9 0.00% xù 9 0.00% vna 9 0.00% 1962 9 0.00% irish 9 0.00% le3o 9 0.00% lo5ng 9 0.00% tu’o’ng 9 0.00% dô5t 9 0.00% uzbekistan 9 0.00% ho'2n 9 0.00% ehud 9 0.00% (tháng 9 0.00% pgs 9 0.00% eo 9 0.00% unesco 9 0.00% hu'o'1c 9 0.00% gia(ng 9 0.00% grove 9 0.00% chèo 9 0.00% wayne 9 0.00% tara 9 0.00% ôtô 9 0.00% â2m 9 0.00% tru’5c 9 0.00% (22 9 0.00% hòang 9 0.00% xua 9 0.00% ngu5c 9 0.00% bít 9 0.00% angela 9 0.00% lã 8 0.00% club 8 0.00% nháp 8 0.00% (bán

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 71 8 0.00% hall 8 0.00% west 8 0.00% alzheimer 8 0.00% suv 8 0.00% 79 8 0.00% stephen 8 0.00% ra(1m 8 0.00% nga(5t 8 0.00% giô4 8 0.00% teng 8 0.00% hâ1t 8 0.00% lau 8 0.00% phô3i 8 0.00% qdd 8 0.00% du’5a 8 0.00% bâ3y 8 0.00% ho’i 8 0.00% sa5p 8 0.00% 500dd 8 0.00% 000m3 8 0.00% thô1 8 0.00% du’o’ng 8 0.00% ròng 8 0.00% hé 8 0.00% du’o’1i 8 0.00% sê1p 8 0.00% georgia 8 0.00% sành 8 0.00% nuo'1c 8 0.00% (saprissa 8 0.00% ráng 8 0.00% nu'5c 8 0.00% loãng 8 0.00% gong 8 0.00% asia 8 0.00% togo 8 0.00% viettel 8 0.00% rang 8 0.00% nhai 8 0.00% hu'5c 8 0.00% (ba 8 0.00% nhàn 8 0.00% (ba(1c 8 0.00% cha3i 8 0.00% ngo5ai 8 0.00% vách 8 0.00% ca(1m 8 0.00% garden 8 0.00% tha(3m 8 0.00% xíu 8 0.00% thiê1p 8 0.00% uyên

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 72 8 0.00% cha(1p 8 0.00% koizumi 8 0.00% ra(n 8 0.00% ngu’o’5c 8 0.00% hori 8 0.00% ms 8 0.00% bob 8 0.00% ho'3i 8 0.00% (28 8 0.00% ngu'u 8 0.00% nixon 8 0.00% oai 8 0.00% bã 8 0.00% vo'1 8 0.00% 1984 8 0.00% smith 8 0.00% abbas 8 0.00% singh 8 0.00% dí 8 0.00% te3 8 0.00% bai 8 0.00% 007 8 0.00% nga5t 8 0.00% scott 8 0.00% ttg 8 0.00% tòi 8 0.00% manager 8 0.00% khan 8 0.00% kashmir 8 0.00% nu'1c 8 0.00% trau 8 0.00% nách 8 0.00% mennonite 8 0.00% lù 8 0.00% luyê1n 8 0.00% na5t 8 0.00% katrina 8 0.00% léo 8 0.00% ddâ2m 8 0.00% lô1c 8 0.00% (ddh 8 0.00% cu 8 0.00% photocopy 8 0.00% ngô1n 8 0.00% win 8 0.00% royce 8 0.00% mo'1m 8 0.00% (vo'1i 8 0.00% mcgirk 8 0.00% yasser 8 0.00% globe 8 0.00% ru5t

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 73 8 0.00% khodorkovsky 8 0.00% rè 8 0.00% 560 8 0.00% koehler 8 0.00% ho'4i 8 0.00% chông 8 0.00% cong 8 0.00% ngâ4m 8 0.00% wolbachia 8 0.00% quì 8 0.00% 68 8 0.00% (26 8 0.00% co’n 8 0.00% vu5t 8 0.00% tu'ng 8 0.00% isser 8 0.00% (gia3m 8 0.00% siêng 8 0.00% italia 8 0.00% national 8 0.00% ddãng 8 0.00% heroin 8 0.00% ukraina 8 0.00% phai3 8 0.00% ddiê1c 8 0.00% relations 8 0.00% adidas 8 0.00% crespo 8 0.00% sacombank 8 0.00% (con 8 0.00% a(ng 8 0.00% giò 8 0.00% xoa(1n 8 0.00% morrison 8 0.00% (mez 8 0.00% nòi 8 0.00% bô5t 8 0.00% len 8 0.00% bo’1t 8 0.00% (gia 8 0.00% gõ 8 0.00% rô4i 8 0.00% luis 8 0.00% elizondo 8 0.00% today 8 0.00% ruô2i 8 0.00% nhõm 8 0.00% hispanic 8 0.00% kha(ng 8 0.00% mercedes 8 0.00% huyê5t 8 0.00% dda(5n

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 74 8 0.00% santos 8 0.00% perfume 8 0.00% cg 8 0.00% cuô2n 8 0.00% 125 8 0.00% vành 8 0.00% su5c 8 0.00% wales 8 0.00% 2x 8 0.00% pw 8 0.00% náu 8 0.00% va 8 0.00% sbs 8 0.00% thâu 8 0.00% thomas 8 0.00% murray 8 0.00% (q 8 0.00% ray 8 0.00% ddo’5t 8 0.00% http 8 0.00% mike 8 0.00% christian 8 0.00% thò 8 0.00% osama 8 0.00% quy5 8 0.00% jazz 8 0.00% 140 8 0.00% ptqd 8 0.00% 1952 8 0.00% hugo 8 0.00% condolezza 8 0.00% domenech 8 0.00% co5 8 0.00% zuleyka 8 0.00% 113 8 0.00% pin 8 0.00% 787 8 0.00% nancy 8 0.00% nha(2ng 8 0.00% (o'3 8 0.00% khoai 8 0.00% ralph 8 0.00% cnn 8 0.00% burns 8 0.00% giê1m 8 0.00% … 8 0.00% (phút 8 0.00% phao 8 0.00% úp 8 0.00% xh 7 0.00% kcx 7 0.00% gio'2i

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 75 7 0.00% nhâ3y 7 0.00% du'3ng 7 0.00% chì 7 0.00% cho’2 7 0.00% bilis 7 0.00% bi5a 7 0.00% chiê2n 7 0.00% che5t 7 0.00% miê1u 7 0.00% (9 7 0.00% tel 7 0.00% nâ1c 7 0.00% tze 7 0.00% reinado 7 0.00% (sv 7 0.00% bch 7 0.00% â3u 7 0.00% ri 7 0.00% nít 7 0.00% mba 7 0.00% (nhu'4ng 7 0.00% kuwait 7 0.00% chu'2a 7 0.00% (18 7 0.00% nhúng 7 0.00% ben 7 0.00% kilomet 7 0.00% tâ1c 7 0.00% 1981 7 0.00% râ5m 7 0.00% ch 7 0.00% so’ 7 0.00% boris 7 0.00% suýt 7 0.00% ra3 7 0.00% hít 7 0.00% ddoa3n 7 0.00% hoay 7 0.00% manmohan 7 0.00% matt 7 0.00% loay 7 0.00% islamabad 7 0.00% ls 7 0.00% grosso 7 0.00% vo'2 7 0.00% south 7 0.00% miguel 7 0.00% taekwondo 7 0.00% rivera 7 0.00% ddao 7 0.00% scolari 7 0.00% colin

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 76 7 0.00% vietnam4all 7 0.00% cpc 7 0.00% tu’o’3ng 7 0.00% (tây 7 0.00% loretta 7 0.00% 1m70 7 0.00% mo'1 7 0.00% nâ5p 7 0.00% bi5ch 7 0.00% le4o 7 0.00% foundation 7 0.00% so’1m 7 0.00% wilson 7 0.00% ro’i 7 0.00% mênh 7 0.00% ddo5 7 0.00% ddo’5i 7 0.00% nhoi 7 0.00% nhe5m 7 0.00% 777 7 0.00% vu'o'3ng 7 0.00% the5n 7 0.00% day 7 0.00% rodriguez 7 0.00% lâ4y 7 0.00% tiger 7 0.00% akram 7 0.00% 1959 7 0.00% seoul 7 0.00% phác 7 0.00% nham 7 0.00% chéo 7 0.00% mi5n 7 0.00% cày 7 0.00% lo'5 7 0.00% ddèo 7 0.00% ru5ng 7 0.00% lu' 7 0.00% chao 7 0.00% ngông 7 0.00% phung 7 0.00% nhoáng 7 0.00% bung 7 0.00% hòan 7 0.00% bussiness 7 0.00% fair 7 0.00% mô1ng 7 0.00% bo'3 7 0.00% tho'1i 7 0.00% xu’1 7 0.00% thiê3n 7 0.00% uô1n

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 77 7 0.00% (qua 7 0.00% lép 7 0.00% walter 7 0.00% xa(1n 7 0.00% tuô5t 7 0.00% (chính 7 0.00% nha3n 7 0.00% united 7 0.00% du’5ng 7 0.00% werder 7 0.00% dê 7 0.00% cddnvtd 7 0.00% vu’o’5t 7 0.00% 1972 7 0.00% (tho'2i 7 0.00% gãi 7 0.00% 2020 7 0.00% su'ng 7 0.00% suôn 7 0.00% na(5c 7 0.00% gâ1m 7 0.00% thuo'3 7 0.00% ngoa(5c 7 0.00% chém 7 0.00% hu'1c 7 0.00% díu 7 0.00% râ2u 7 0.00% mì 7 0.00% da(m 7 0.00% náo 7 0.00% hu'2ng 7 0.00% fax 7 0.00% chát 7 0.00% italy 7 0.00% nsu't 7 0.00% qlvnch 7 0.00% (eu 7 0.00% agency 7 0.00% riquelme 7 0.00% keo 7 0.00% thu’3 7 0.00% d' 7 0.00% jenny 7 0.00% 1958 7 0.00% ù 7 0.00% simao 7 0.00% 7 0.00% (12 7 0.00% eriksson 7 0.00% 1983 7 0.00% mertesacker 7 0.00% 21h00

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 78 7 0.00% juan 7 0.00% cristiano 7 0.00% michigan 7 0.00% cô4 7 0.00% ulkraine 7 0.00% 420 7 0.00% lampard 7 0.00% ddo’2i 7 0.00% hk 7 0.00% quách 7 0.00% (chiê1m 7 0.00% (vì 7 0.00% 18h00 7 0.00% der 7 0.00% cào 7 0.00% phàng 7 0.00% (evn 7 0.00% gô5p 7 0.00% 99 7 0.00% chô1c 7 0.00% tru'o'5t 7 0.00% gô5c 7 0.00% mci 7 0.00% gates 7 0.00% harel 7 0.00% jerusalem 7 0.00% ks 7 0.00% fbi 7 0.00% (33 7 0.00% vu5ng 7 0.00% is 7 0.00% qúy 7 0.00% (19 7 0.00% cua 7 0.00% 8255 7 0.00% nu’1t 7 0.00% mcloughlin 7 0.00% faz 7 0.00% chuô1i 7 0.00% khmer 7 0.00% qùy 7 0.00% cô5 7 0.00% hông 7 0.00% shabak 7 0.00% chu'5c 7 0.00% mikel 7 0.00% toa3n 7 0.00% xám 7 0.00% pha3 7 0.00% xxi 7 0.00% thiêm 7 0.00% rùng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 79 7 0.00% (hoa(5c 7 0.00% (anh 7 0.00% beveren 7 0.00% sâ1y 7 0.00% thày 7 0.00% va(1t 7 0.00% thuyên 7 0.00% th 7 0.00% quâ3n 7 0.00% aung 7 0.00% lin 7 0.00% goodlathe 7 0.00% opec 7 0.00% lìa 7 0.00% (bbc 7 0.00% tr 7 0.00% ethanol 7 0.00% ro'5 7 0.00% cph 7 0.00% cu'o'4i 7 0.00% tu3y 7 0.00% phen 7 0.00% kwh 7 0.00% xe3ng 7 0.00% basa 7 0.00% bvd 7 0.00% gô2ng 7 0.00% su'1t 7 0.00% (23 7 0.00% christopher 7 0.00% jimmy 7 0.00% helms 7 0.00% chiêm 7 0.00% vina 7 0.00% greenberg 7 0.00% da5c 7 0.00% cholesterol 7 0.00% dduô1c 7 0.00% rabin 7 0.00% (ddê3 7 0.00% dda(1k 7 0.00% rô 7 0.00% daily 7 0.00% ruili 7 0.00% ngo'5p 7 0.00% y3 7 0.00% lhtn 7 0.00% cha5p 7 0.00% (ông 7 0.00% sá 7 0.00% lóc 7 0.00% drogba

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 80 7 0.00% chác 7 0.00% 230 7 0.00% ru 7 0.00% phêrô 7 0.00% ho'2 7 0.00% tho'1t 7 0.00% 105 7 0.00% kadima 7 0.00% il 6 0.00% ptt 6 0.00% (alajuela 6 0.00% cành 6 0.00% truâ1t 6 0.00% xô5c 6 0.00% iss 6 0.00% sáp 6 0.00% nâ5u 6 0.00% gang 6 0.00% us 6 0.00% (xxi 6 0.00% so5c 6 0.00% 102 6 0.00% larry 6 0.00% rèn 6 0.00% diê1p 6 0.00% da5t 6 0.00% oscar 6 0.00% contra 6 0.00% 5000 6 0.00% on 6 0.00% ê1ch 6 0.00% mô5c 6 0.00% rô3 6 0.00% ngáo 6 0.00% delhi 6 0.00% buffon 6 0.00% that 6 0.00% wanchope 6 0.00% lác 6 0.00% ostrava 6 0.00% ngô3 6 0.00% 71 6 0.00% (cô5ng 6 0.00% ahmed 6 0.00% vnah 6 0.00% 92 6 0.00% met 6 0.00% so5 6 0.00% ngát 6 0.00% livni 6 0.00% chói 6 0.00% yorker

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 81 6 0.00% què 6 0.00% kidan 6 0.00% g7 6 0.00% tem 6 0.00% ruô2ng 6 0.00% vitamin 6 0.00% dduo'5c 6 0.00% campell 6 0.00% pacific 6 0.00% tròng 6 0.00% rala(ng 6 0.00% pacông 6 0.00% (16 6 0.00% ngóc 6 0.00% paulson 6 0.00% hì 6 0.00% (time 6 0.00% abkhazia 6 0.00% tw 6 0.00% kê1ch 6 0.00% loà 6 0.00% louisiana 6 0.00% (13 6 0.00% da(3ng 6 0.00% gâ4m 6 0.00% phích 6 0.00% vú 6 0.00% cho'n 6 0.00% he 6 0.00% thu’o’3ng 6 0.00% bâ2m 6 0.00% ma3 6 0.00% beer 6 0.00% roberto 6 0.00% (nam 6 0.00% mùng 6 0.00% 550 6 0.00% safavian 6 0.00% kèn 6 0.00% vuong 6 0.00% dâ4m 6 0.00% nghi4nh 6 0.00% tomas 6 0.00% ddiê2m 6 0.00% (cu4ng 6 0.00% powell 6 0.00% câ5t 6 0.00% khiê1t 6 0.00% tru'o'5ng 6 0.00% (thái 6 0.00% mafia 6 0.00% basten

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 82 6 0.00% 3495 6 0.00% 91 6 0.00% normal 6 0.00% xi5n 6 0.00% gelsenkirchen 6 0.00% paulo 6 0.00% va(5n 6 0.00% phô2n 6 0.00% williams 6 0.00% da(2n 6 0.00% khuâ1y 6 0.00% thui 6 0.00% (04 6 0.00% quâ1t 6 0.00% tre4n 6 0.00% jevric 6 0.00% persie 6 0.00% radio 6 0.00% 11m 6 0.00% go’4 6 0.00% antonio 6 0.00% tày 6 0.00% qúôc 6 0.00% nâ2n 6 0.00% nha(2n 6 0.00% diê5p 6 0.00% max 6 0.00% kezman 6 0.00% kone 6 0.00% phu4 6 0.00% db 6 0.00% rossi 6 0.00% a3m 6 0.00% (nhà 6 0.00% mirzapour 6 0.00% tizie 6 0.00% kuala 6 0.00% csg 6 0.00% hollings 6 0.00% ivoire 6 0.00% dô1p 6 0.00% ke4 6 0.00% thoa5t 6 0.00% 138 6 0.00% lim 6 0.00% slna 6 0.00% ru'o'5t 6 0.00% ford 6 0.00% ingushetia 6 0.00% bu5c 6 0.00% ngòi 6 0.00% out

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 83 6 0.00% jens 6 0.00% (dda3ng 6 0.00% fsb 6 0.00% pring 6 0.00% totti 6 0.00% toni 6 0.00% nghiê2n 6 0.00% lisa 6 0.00% nao 6 0.00% ffa 6 0.00% bgk 6 0.00% trent 6 0.00% nãy 6 0.00% kamaz 6 0.00% xuê3 6 0.00% su’1 6 0.00% nv 6 0.00% sam 6 0.00% tra(n 6 0.00% (cao 6 0.00% xi5t 6 0.00% intranet 6 0.00% (chu'a 6 0.00% tiê4u 6 0.00% 98 6 0.00% davis 6 0.00% syrie 6 0.00% phè 6 0.00% ca(2n 6 0.00% chan 6 0.00% hòm 6 0.00% 1481 6 0.00% abc 6 0.00% tu’o’5ng 6 0.00% natalie 6 0.00% 737 6 0.00% ngo 6 0.00% cau 6 0.00% (nhâ5t 6 0.00% harper 6 0.00% (ap 6 0.00% energy 6 0.00% lu'3ng 6 0.00% 6 0.00% mulally 6 0.00% qua5 6 0.00% câ1y 6 0.00% ðo3 6 0.00% ye 6 0.00% quality 6 0.00% du'a 6 0.00% boston

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 84 6 0.00% heng 6 0.00% suô1i 6 0.00% oslo 6 0.00% nín 6 0.00% tuô2ng 6 0.00% to'i 6 0.00% na5ng 6 0.00% kosovo 6 0.00% ba3nh 6 0.00% vòi 6 0.00% bu'2a 6 0.00% uô3ng 6 0.00% dde4o 6 0.00% vôi 6 0.00% sri 6 0.00% lanka 6 0.00% thoa 6 0.00% lún 6 0.00% (thay 6 0.00% psa 6 0.00% tu'o'5c 6 0.00% quito 6 0.00% mnc 6 0.00% adoption 6 0.00% phô 6 0.00% church 6 0.00% ubtvqh 6 0.00% pasteur 6 0.00% vgsv 6 0.00% ghpgvntn 6 0.00% két 6 0.00% ky 6 0.00% cuô1ng 6 0.00% nôm 6 0.00% lichtenstein 6 0.00% u3 6 0.00% 131 6 0.00% kình 6 0.00% suez 6 0.00% roi 6 0.00% chypre 6 0.00% (tru'o'2ng 6 0.00% madrid 6 0.00% dùm 6 0.00% tê2 6 0.00% 360 6 0.00% thaksin 6 0.00% aviv 6 0.00% ni5nh 6 0.00% (nay 6 0.00% audi 6 0.00% mo

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 85 6 0.00% bu'o'3i 6 0.00% z 6 0.00% dõng 6 0.00% ddòan 6 0.00% huê 6 0.00% ngòai 6 0.00% â1u 6 0.00% ok 6 0.00% hun 6 0.00% wong 6 0.00% crouch 6 0.00% ronald 6 0.00% bayer 6 0.00% kaesong 6 0.00% chiê1t 6 0.00% li5nh 6 0.00% giùm 6 0.00% whitney 6 0.00% pratt 6 0.00% (20 6 0.00% ngu'1a 6 0.00% xén 6 0.00% jakarta 6 0.00% daniel 6 0.00% nhang 6 0.00% vietnamnet 6 0.00% chài 6 0.00% ismail 6 0.00% ddai5 6 0.00% dâ1y 6 0.00% 96 6 0.00% châ5u 6 0.00% leslie 6 0.00% khét 6 0.00% entertainment 6 0.00% carmona 6 0.00% bmw 6 0.00% se5o 6 0.00% bar 6 0.00% sê 6 0.00% quáng 6 0.00% prambanan 6 0.00% 260 6 0.00% thabet 6 0.00% (tru'o'1c 6 0.00% river 6 0.00% sinai 6 0.00% nhút 6 0.00% quota 6 0.00% nhô3 6 0.00% brussels 6 0.00% (khu

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 86 6 0.00% cu’ 6 0.00% chè 6 0.00% nghe4n 6 0.00% ngu'2o'i 6 0.00% ktm 6 0.00% (hai 6 0.00% su'2ng 6 0.00% hastert 6 0.00% hú 6 0.00% nhô 6 0.00% gân 6 0.00% online 6 0.00% hô3i 6 0.00% ngoa(5t 6 0.00% nhâ1p 6 0.00% (5 6 0.00% dìu 6 0.00% u’o’1c 6 0.00% cha5nh 6 0.00% ddia5 6 0.00% jacques 6 0.00% khê 6 0.00% miêu 6 0.00% nhô5n 6 0.00% kali 6 0.00% (tu'o'ng 6 0.00% (ý 6 0.00% cnxh 6 0.00% quâ2y 6 0.00% gio'3 6 0.00% 25dd 6 0.00% giu'5t 6 0.00% mac 6 0.00% (hiê5n 6 0.00% khít 6 0.00% simi 6 0.00% kgb 6 0.00% (câ1p 6 0.00% nobel 6 0.00% jr 6 0.00% petro 6 0.00% ne3o 6 0.00% bìu 6 0.00% vá 6 0.00% xâ2m 5 0.00% mu’2ng 5 0.00% ntu 5 0.00% festival 5 0.00% chùn 5 0.00% kumar 5 0.00% (vê2 5 0.00% ngóng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 87 5 0.00% gilberto 5 0.00% calorie 5 0.00% oái 5 0.00% vu’4ng 5 0.00% ba(m 5 0.00% khi3 5 0.00% a(m 5 0.00% larson 5 0.00% du5m 5 0.00% phâ5p 5 0.00% (liên 5 0.00% tâ3u 5 0.00% ……………… 5 0.00% hip 5 0.00% kilo 5 0.00% anthony 5 0.00% carvalho 5 0.00% hargreaves 5 0.00% dô2ng 5 0.00% 000m 5 0.00% bê5 5 0.00% pho'1t 5 0.00% cu'ng 5 0.00% natanz 5 0.00% org 5 0.00% na(1n 5 0.00% free 5 0.00% chang 5 0.00% emirates 5 0.00% qui3 5 0.00% mào 5 0.00% suncruz 5 0.00% va(1c 5 0.00% wolfowitz 5 0.00% 104 5 0.00% nóc 5 0.00% ngâ3ng 5 0.00% collect 5 0.00% nguô5i 5 0.00% pac 5 0.00% cô4i 5 0.00% 121 5 0.00% kiatisak 5 0.00% (se4 5 0.00% rome 5 0.00% so3 5 0.00% dylan 5 0.00% (new 5 0.00% bô2n 5 0.00% golf 5 0.00% mó 5 0.00% jean

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 88 5 0.00% lo5ai 5 0.00% lynn 5 0.00% charles 5 0.00% bo'2i 5 0.00% myanmar 5 0.00% edge 5 0.00% lumpur 5 0.00% cain 5 0.00% (thanh 5 0.00% (hô5i 5 0.00% phâ3n 5 0.00% sisulu 5 0.00% xó 5 0.00% gu'o'm 5 0.00% heitinga 5 0.00% dnt 5 0.00% karan 5 0.00% harry 5 0.00% christ 5 0.00% somalia 5 0.00% er 5 0.00% zinedine 5 0.00% stankovic 5 0.00% (và 5 0.00% nilon 5 0.00% bâ3m 5 0.00% pleiku 5 0.00% albright 5 0.00% damadola 5 0.00% xiao 5 0.00% cambridge 5 0.00% tho’5 5 0.00% idf 5 0.00% bajur 5 0.00% marathon 5 0.00% dhl 5 0.00% hop 5 0.00% lu’o’5c 5 0.00% châ2u 5 0.00% sonia 5 0.00% baldemo 5 0.00% marx 5 0.00% (na(m 5 0.00% delta 5 0.00% tru5i 5 0.00% na5o 5 0.00% oi 5 0.00% daucher 5 0.00% nung 5 0.00% a(1t 5 0.00% acb 5 0.00% oxfam

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 89 5 0.00% corporation 5 0.00% yale 5 0.00% 97 5 0.00% co3i 5 0.00% râ2n 5 0.00% katmandu 5 0.00% ed 5 0.00% trèo 5 0.00% baldemor 5 0.00% ri5a 5 0.00% nhe5n 5 0.00% xúm 5 0.00% xi3u 5 0.00% sâ2m 5 0.00% rìa 5 0.00% (vào 5 0.00% asem 5 0.00% vãi 5 0.00% it 5 0.00% slogan 5 0.00% ghe 5 0.00% czech 5 0.00% (national 5 0.00% mikhail 5 0.00% bê1t 5 0.00% melbourne 5 0.00% lu’5a 5 0.00% trâm 5 0.00% steven 5 0.00% di4nh 5 0.00% 75dd 5 0.00% fleming 5 0.00% 1953 5 0.00% pirlo 5 0.00% cddsp 5 0.00% malouda 5 0.00% slovenia 5 0.00% franz 5 0.00% lu’3a 5 0.00% gala 5 0.00% sagnol 5 0.00% ních 5 0.00% diên 5 0.00% 7000 5 0.00% ô3i 5 0.00% policy 5 0.00% 1965 5 0.00% hànô5i 5 0.00% nhí 5 0.00% miami 5 0.00% nu'4u 5 0.00% câ3u

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 90 5 0.00% damascus 5 0.00% clawson 5 0.00% sùng 5 0.00% terry 5 0.00% mo5t 5 0.00% sào 5 0.00% thê5 5 0.00% cannavaro 5 0.00% games 5 0.00% (khi 5 0.00% eboue 5 0.00% rule 5 0.00% mão 5 0.00% phi3nh 5 0.00% váng 5 0.00% kate 5 0.00% 238 5 0.00% gallas 5 0.00% galang 5 0.00% universal 5 0.00% lu'o'2i 5 0.00% trezeguet 5 0.00% (dd 5 0.00% america 5 0.00% (ch 5 0.00% loã 5 0.00% tong 5 0.00% chíp 5 0.00% ddít 5 0.00% dê3 5 0.00% howard 5 0.00% thurman 5 0.00% va5t 5 0.00% susan 5 0.00% gomez 5 0.00% sô5 5 0.00% hòe 5 0.00% túp 5 0.00% (ghi 5 0.00% maoiste 5 0.00% makelele 5 0.00% hitzlsperger 5 0.00% hitler 5 0.00% nho’2 5 0.00% o'dell 5 0.00% rôn 5 0.00% phu’o’5ng 5 0.00% 199 5 0.00% (st 5 0.00% (tru'2 5 0.00% miê2u 5 0.00% nsw

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 91 5 0.00% 202 5 0.00% thu’2a 5 0.00% vo'i 5 0.00% 280 5 0.00% organization 5 0.00% space 5 0.00% 619 5 0.00% mè 5 0.00% shrine 5 0.00% do3m 5 0.00% pelz 5 0.00% kazakhstan 5 0.00% mía 5 0.00% loáng 5 0.00% bú 5 0.00% card 5 0.00% khui 5 0.00% hq 5 0.00% (ddang 5 0.00% su'3ng 5 0.00% nhen 5 0.00% tro'n 5 0.00% rúng 5 0.00% jayapura 5 0.00% ddóan 5 0.00% unocal 5 0.00% (canada 5 0.00% nd 5 0.00% thót 5 0.00% sebastian 5 0.00% yê1t 5 0.00% han 5 0.00% ramos 5 0.00% 114 5 0.00% 1969 5 0.00% chicago 5 0.00% chuô1c 5 0.00% lu 5 0.00% raymond 5 0.00% 1934 5 0.00% khoa3nh 5 0.00% anna 5 0.00% nhâm 5 0.00% dream 5 0.00% nu'o'2m 5 0.00% ddán 5 0.00% hamid 5 0.00% sì 5 0.00% mohamed 5 0.00% parreira 5 0.00% nâ3y 5 0.00% (permanent

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 92 5 0.00% múi 5 0.00% nl 5 0.00% gióng 5 0.00% (giá 5 0.00% hâ1n 5 0.00% ngu'o'5ng 5 0.00% amnesty 5 0.00% mâ1u 5 0.00% google 5 0.00% qu3a 5 0.00% nho5 5 0.00% (ba3n 5 0.00% trô3 5 0.00% (05 5 0.00% 108 5 0.00% craig 5 0.00% ve3n 5 0.00% lanh 5 0.00% du'2a 5 0.00% tháu 5 0.00% estrogen 5 0.00% gdgt 5 0.00% bún 5 0.00% cáy 5 0.00% karl 5 0.00% vu'à 5 0.00% kevin 5 0.00% cho5t 5 0.00% ethiopia 5 0.00% giòn 5 0.00% 320 5 0.00% field 5 0.00% kê2nh 5 0.00% xuô2ng 5 0.00% cbcc 5 0.00% cô2ng 5 0.00% nu'o'5p 5 0.00% exxonmobil 5 0.00% (csvn 5 0.00% doãn 5 0.00% cty 5 0.00% teheran 5 0.00% malaysiakini 5 0.00% lõng 5 0.00% mi3a 5 0.00% porras 5 0.00% bùa 5 0.00% airasia 5 0.00% hóc 5 0.00% jones 5 0.00% giô1c 5 0.00% ngo'2i

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 93 5 0.00% origami 5 0.00% phèn 5 0.00% uri 5 0.00% moo 5 0.00% lddbdd 5 0.00% rangoon 5 0.00% ddoa5 5 0.00% nn 5 0.00% frist 5 0.00% nôi 5 0.00% (ca3 5 0.00% randy 5 0.00% ruê5 5 0.00% everett 5 0.00% víu 5 0.00% vu'5a 5 0.00% rove 5 0.00% 1930 5 0.00% khiêng 5 0.00% (brazil 5 0.00% (ddã 5 0.00% tho3 5 0.00% lõi 5 0.00% râm 5 0.00% cô5m 5 0.00% mccain 5 0.00% (ddô2ng 5 0.00% oa(2n 5 0.00% 750 5 0.00% tu'o'3i 5 0.00% human 5 0.00% leverkusen 5 0.00% quàng 5 0.00% borussia 5 0.00% ro' 5 0.00% tê3 5 0.00% cùm 5 0.00% vtv 5 0.00% ria 5 0.00% ló 5 0.00% kho'2 5 0.00% douglas 5 0.00% (tôi 5 0.00% trô2i 5 0.00% thaung 5 0.00% kfc 5 0.00% silva 5 0.00% nation 5 0.00% jensen 5 0.00% reform 5 0.00% thi3u 5 0.00% (quâ5n

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 94 5 0.00% a3i 5 0.00% ddùng 5 0.00% wright 5 0.00% yukos 5 0.00% tru5y 5 0.00% (nga 5 0.00% dokdo 5 0.00% khay 5 0.00% media 5 0.00% xô1c 5 0.00% grover 5 0.00% 145 5 0.00% thây 5 0.00% gu'o'5ng 5 0.00% ngu'o'1c 5 0.00% un 5 0.00% (go5i 5 0.00% (ddu'o'2ng 5 0.00% mâm 5 0.00% trãi 5 0.00% nin 5 0.00% nmnth 5 0.00% evo 5 0.00% la(5t 5 0.00% tuyê2n 5 0.00% have 5 0.00% dâ5u 5 0.00% negroponte 5 0.00% slobodan 5 0.00% ali 5 0.00% business 5 0.00% tenet 5 0.00% chantha 5 0.00% (cho 5 0.00% economist 5 0.00% 119 5 0.00% vv 5 0.00% maradona 5 0.00% kv 5 0.00% fullerton 5 0.00% he3o 5 0.00% sa(5c 5 0.00% ke3o 5 0.00% 290 5 0.00% htun 5 0.00% (thu'5c 4 0.00% ex 4 0.00% super 4 0.00% máng 4 0.00% bêtông 4 0.00% tánh 4 0.00% •

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 95 4 0.00% (viê5t 4 0.00% star 4 0.00% ddun 4 0.00% o5p 4 0.00% hô1c 4 0.00% 525 4 0.00% carbon 4 0.00% piano 4 0.00% pangandaran 4 0.00% gu'2ng 4 0.00% qua(5c 4 0.00% quây 4 0.00% no'4 4 0.00% phômai 4 0.00% girlfriend 4 0.00% tro 4 0.00% niê5u 4 0.00% xuâ3n 4 0.00% press 4 0.00% association 4 0.00% séc 4 0.00% tzipi 4 0.00% neo 4 0.00% (15 4 0.00% javier 4 0.00% ddiê1u 4 0.00% du’ 4 0.00% kilogram 4 0.00% (cu4 4 0.00% andrew 4 0.00% ddo'2 4 0.00% phiê5n 4 0.00% griles 4 0.00% logo 4 0.00% missing 4 0.00% 1951 4 0.00% rót 4 0.00% níu 4 0.00% ddê3u 4 0.00% seymour 4 0.00% so’5 4 0.00% petrolimex 4 0.00% thoa(1t 4 0.00% hô3ng 4 0.00% goss 4 0.00% hyun 4 0.00% rafael 4 0.00% rên 4 0.00% (7 4 0.00% cho'2n 4 0.00% huênh 4 0.00% om

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 96 4 0.00% quo'3 4 0.00% giâý 4 0.00% pétrus 4 0.00% (chuyên 4 0.00% sa3 4 0.00% xoành 4 0.00% 106 4 0.00% qua(5ng 4 0.00% zinha 4 0.00% patrick 4 0.00% bernard 4 0.00% chô5t 4 0.00% (u'o'1c 4 0.00% hi3 4 0.00% la3ng 4 0.00% somkid 4 0.00% philippin 4 0.00% griffin 4 0.00% vieira 4 0.00% von 4 0.00% (nhân 4 0.00% lashkar 4 0.00% vietnamese 4 0.00% du’o’5c 4 0.00% se 4 0.00% tru’o’ng 4 0.00% ddu'5c 4 0.00% ho'1n 4 0.00% (kiên 4 0.00% jim 4 0.00% ddcsvn 4 0.00% 133 4 0.00% mr 4 0.00% lòi 4 0.00% 1977 4 0.00% sét 4 0.00% play 4 0.00% xe5t 4 0.00% chúi 4 0.00% bo'4 4 0.00% tand 4 0.00% lét 4 0.00% su5 4 0.00% gallon 4 0.00% go'5n 4 0.00% milan 4 0.00% gàng 4 0.00% superman 4 0.00% (phát 4 0.00% huyên 4 0.00% diê1m 4 0.00% e5p

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 97 4 0.00% junichiro 4 0.00% su'o'2n 4 0.00% vietcombank 4 0.00% noi 4 0.00% nha(4n 4 0.00% ru'o'2m 4 0.00% b61 4 0.00% (loa5i 4 0.00% thuy2 4 0.00% luô5c 4 0.00% cu'a 4 0.00% mnchen 4 0.00% loé 4 0.00% oxytocine 4 0.00% nobuko 4 0.00% múc 4 0.00% chat 4 0.00% hu'4ng 4 0.00% oang 4 0.00% total 4 0.00% na(5n 4 0.00% (pháp 4 0.00% quy2 4 0.00% bhyt 4 0.00% buô1t 4 0.00% 154 4 0.00% chuô2i 4 0.00% (no'i 4 0.00% tím 4 0.00% thu'à 4 0.00% hu'1o'ng 4 0.00% kv1 4 0.00% ttl 4 0.00% mosoco 4 0.00% downer 4 0.00% (ddây 4 0.00% cb 4 0.00% ðông 4 0.00% ferrari 4 0.00% ddu'á 4 0.00% (râ1t 4 0.00% adb 4 0.00% tréo 4 0.00% porsche 4 0.00% bái 4 0.00% latin 4 0.00% (ddà 4 0.00% kaemi 4 0.00% takeshima 4 0.00% bìa 4 0.00% silicon 4 0.00% nhích

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 98 4 0.00% jeep 4 0.00% xuyê1n 4 0.00% 3m 4 0.00% la(1t 4 0.00% stalin 4 0.00% 103 4 0.00% rít 4 0.00% ddái 4 0.00% lincoln 4 0.00% nguo'2i 4 0.00% sgd 4 0.00% tróc 4 0.00% sen 4 0.00% lynnphuong 4 0.00% (thu3 4 0.00% sãi 4 0.00% (gia3i 4 0.00% ariel 4 0.00% 1m 4 0.00% watt 4 0.00% soland 4 0.00% tvdddd 4 0.00% kha(1t 4 0.00% rô4ng 4 0.00% vii 4 0.00% iowa 4 0.00% clateman 4 0.00% khu'3 4 0.00% general 4 0.00% jeffrey 4 0.00% xo3 4 0.00% pha(ng 4 0.00% arbatov 4 0.00% gilardino 4 0.00% héc 4 0.00% úng 4 0.00% bhxh 4 0.00% phèo 4 0.00% grondona 4 0.00% tua 4 0.00% benz 4 0.00% mcnaught 4 0.00% ca5o 4 0.00% tã 4 0.00% graham 4 0.00% uni 4 0.00% tía 4 0.00% tê4 4 0.00% phi3 4 0.00% (mat 4 0.00% (bê1n 4 0.00% mubarak

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 99 4 0.00% hddba 4 0.00% chu'o'1ng 4 0.00% ptsg 4 0.00% 115 4 0.00% (thu5y 4 0.00% sofia 4 0.00% mundhra 4 0.00% one 4 0.00% hassan 4 0.00% lukashenka 4 0.00% ho'2i 4 0.00% 1961 4 0.00% hawaii 4 0.00% luô2n 4 0.00% (châu 4 0.00% tho5c 4 0.00% gót 4 0.00% predator 4 0.00% barthez 4 0.00% who 4 0.00% vó 4 0.00% chris 4 0.00% nguôi 4 0.00% seattle 4 0.00% kyat 4 0.00% gô1m 4 0.00% bu'1t 4 0.00% oil 4 0.00% tro5c 4 0.00% khâ1m 4 0.00% photo 4 0.00% category 4 0.00% or 4 0.00% plo 4 0.00% h5n2 4 0.00% tro’2i 4 0.00% east 4 0.00% ryan 4 0.00% (washington 4 0.00% nhói 4 0.00% little 4 0.00% 380 4 0.00% nhao 4 0.00% wayans 4 0.00% bi5nh 4 0.00% ntk 4 0.00% marine 4 0.00% beckett 4 0.00% hoe 4 0.00% trèm 4 0.00% pervez 4 0.00% love

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 100 4 0.00% stanol 4 0.00% parade 4 0.00% teen 4 0.00% institute 4 0.00% ho3n 4 0.00% bavik 4 0.00% contest 4 0.00% imaging 4 0.00% color 4 0.00% corp 4 0.00% dan 4 0.00% mo’4 4 0.00% leonardo 4 0.00% tròm 4 0.00% ddu’1a 4 0.00% (ddiê2u 4 0.00% khu' 4 0.00% lou 4 0.00% system 4 0.00% real 4 0.00% mu’u 4 0.00% queo 4 0.00% núng 4 0.00% 650 4 0.00% bô2ng 4 0.00% edward 4 0.00% bõ 4 0.00% htv 4 0.00% papua 4 0.00% nho'1t 4 0.00% with 4 0.00% intifada 4 0.00% thìn 4 0.00% 2m 4 0.00% biê1m 4 0.00% nâ1p 4 0.00% râ3y 4 0.00% kiê1t 4 0.00% kurd 4 0.00% bgh 4 0.00% sin 4 0.00% ba5 4 0.00% bâ1p 4 0.00% pei 4 0.00% siniora 4 0.00% kippour 4 0.00% cho’5 4 0.00% no'1t 4 0.00% (bê5nh 4 0.00% asian 4 0.00% khoáy 4 0.00% economic

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 101 4 0.00% 175 4 0.00% (cu3a 4 0.00% (quô1c 4 0.00% kyrgyzstan 4 0.00% ian 4 0.00% foreign 4 0.00% 8g 4 0.00% hách 4 0.00% ma(1m 4 0.00% xinhua 4 0.00% ht 4 0.00% luanda 4 0.00% muô5i 4 0.00% manga 4 0.00% (làm 4 0.00% co'3 4 0.00% trô2 4 0.00% bèn 4 0.00% (finale 4 0.00% sieng 4 0.00% catherine 4 0.00% va5i 4 0.00% thình 4 0.00% fox 4 0.00% (ma(5c 4 0.00% va(1n 4 0.00% ebay 4 0.00% hùa 4 0.00% ddu'o'c 4 0.00% (tô3 4 0.00% dde4 4 0.00% north 4 0.00% shop 4 0.00% lowy 4 0.00% perrotta 4 0.00% tâng 4 0.00% valente 4 0.00% loang 4 0.00% lóe 4 0.00% blue 4 0.00% andrea 4 0.00% alice 4 0.00% liberia 4 0.00% 200m 4 0.00% dâ2m 4 0.00% nê1m 4 0.00% (thôn 4 0.00% mohammed 4 0.00% 4 0.00% kh 4 0.00% chxhcnvn 4 0.00% dermalogica

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 102 4 0.00% cbs 4 0.00% ddác 4 0.00% 1m75 4 0.00% mumbai 4 0.00% zokora 4 0.00% science 4 0.00% ga3 4 0.00% centeno 4 0.00% ca(2m 4 0.00% phàn 4 0.00% boka 4 0.00% family 4 0.00% (2005 4 0.00% (arf 4 0.00% tiago 4 0.00% go'3 4 0.00% quâ1n 4 0.00% trô3i 4 0.00% ontario 4 0.00% (viê5n 4 0.00% yushchenko 4 0.00% 244 4 0.00% madonna 4 0.00% fonseca 4 0.00% robbie 4 0.00% thâ3n 4 0.00% thóc 4 0.00% svtn 4 0.00% islands 4 0.00% sellers 4 0.00% 950 4 0.00% tiê1u 4 0.00% neubrandenburg 4 0.00% alan 4 0.00% carolina 4 0.00% whip 4 0.00% (nhu'ng 4 0.00% ro'm 4 0.00% worldcup 4 0.00% tanh 4 0.00% u’o’ng 4 0.00% kurara 4 0.00% chibana 4 0.00% 210 4 0.00% (world 4 0.00% kiê2n 4 0.00% des 4 0.00% lagerbaeck 4 0.00% franco 4 0.00% doha 4 0.00% firket 4 0.00% schwarzenegger

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 103 4 0.00% 107 4 0.00% mueller 4 0.00% da(5c 4 0.00% 122 4 0.00% gien 4 0.00% wall 4 0.00% (1986 4 0.00% 1m73 4 0.00% leipzig 4 0.00% kiessling 4 0.00% rúc 4 0.00% chùi 4 0.00% kho3an 4 0.00% act 4 0.00% fiorentina 4 0.00% philippine 4 0.00% ga(m 4 0.00% thít 4 0.00% rùm 4 0.00% horacio 4 0.00% phào 4 0.00% ngoa(2n 4 0.00% villa 4 0.00% nhúm 4 0.00% giâ4m 4 0.00% xoài 4 0.00% khxh 4 0.00% chu’4 4 0.00% (bình 4 0.00% manchester 4 0.00% (nhâ1t 4 0.00% kimmitt 4 0.00% du’o’2ng 4 0.00% tampa 4 0.00% camoranesi 4 0.00% ptnt 4 0.00% (1998 4 0.00% bùm 4 0.00% dreamliner 4 0.00% (an 4 0.00% your 4 0.00% glebova 4 0.00% ddo’n 4 0.00% ru’2ng 4 0.00% thê2m 4 0.00% 2018 4 0.00% lom 4 0.00% kho3a 4 0.00% tax 4 0.00% globebeauties 4 0.00% sa5t 4 0.00% uyê3n

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 104 4 0.00% distefano 4 0.00% abbondanzieri 4 0.00% kampuchia 4 0.00% luiz 4 0.00% valdez 4 0.00% plaza 4 0.00% kuznets 4 0.00% (public 4 0.00% gamarra 4 0.00% fans 4 0.00% djordjevic 4 0.00% marquez 4 0.00% baldomir 4 0.00% review 4 0.00% caucasus 4 0.00% tm 4 0.00% munoz 4 0.00% bommel 4 0.00% nadj 4 0.00% cút 4 0.00% armi 4 0.00% vanderwall 4 0.00% kì 4 0.00% fabio 4 0.00% trainer 4 0.00% pascal 4 0.00% (beirut 4 0.00% hamburg 4 0.00% 000ha 4 0.00% ibrahimovic 4 0.00% (dc 4 0.00% 1971 4 0.00% 1956 4 0.00% lamy 4 0.00% tý 4 0.00% chip 4 0.00% (dn 4 0.00% mathijsen 4 0.00% grand 4 0.00% (mô4i 4 0.00% ashley 4 0.00% du’o’5t 4 0.00% centre 4 0.00% bronckhorst 4 0.00% chalabi 4 0.00% cho'1i 4 0.00% khe4 4 0.00% ga(1p 4 0.00% roger 4 0.00% (international 4 0.00% bu'o'n 4 0.00% kìa

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 105 4 0.00% inh 4 0.00% cincinnati 4 0.00% bô5n 4 0.00% (ha3i 4 0.00% min 4 0.00% (vi4nh 4 0.00% (california 4 0.00% royal 4 0.00% 1955 4 0.00% srifa 4 0.00% gaø 4 0.00% (sydney 4 0.00% nhe5t 4 0.00% toát 4 0.00% lu5p 4 0.00% icc 4 0.00% sa(1n 4 0.00% ngoai5 4 0.00% kirchner 4 0.00% prudential 4 0.00% el 4 0.00% táp 4 0.00% (tu'5 4 0.00% public 4 0.00% so'4 4 0.00% ngo'1t 4 0.00% bidong 4 0.00% singapour 4 0.00% lai5 4 0.00% gardiner 4 0.00% khâ5p 4 0.00% huntsville 4 0.00% neville 4 0.00% alexei 4 0.00% nhuâ2n 4 0.00% xi3a 4 0.00% a4 4 0.00% b14 4 0.00% wolf 4 0.00% nài 4 0.00% tvc 4 0.00% ayala 4 0.00% toure 4 0.00% bong 4 0.00% ra5c 4 0.00% we 4 0.00% grassley 4 0.00% mu 4 0.00% dâ1p 4 0.00% people 4 0.00% sorin 4 0.00% lâ4m

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 106 4 0.00% huân 4 0.00% cosplay 4 0.00% ia 4 0.00% tho3i 4 0.00% thierry 4 0.00% café 4 0.00% doan 4 0.00% cunningham 4 0.00% nistelrooy 4 0.00% mu3 4 0.00% gavrancic 4 0.00% mississippi 4 0.00% iom 4 0.00% network 4 0.00% systems 4 0.00% ta5t 4 0.00% point 4 0.00% tantillo 4 0.00% ' 4 0.00% (miê2n 4 0.00% bea 4 0.00% elenildo 3 0.00% gov 3 0.00% rày 3 0.00% 465 3 0.00% ddtddd 3 0.00% 143 3 0.00% straits 3 0.00% 460 3 0.00% saleh 3 0.00% (cha 3 0.00% 489 3 0.00% 438 3 0.00% …… 3 0.00% duô4i 3 0.00% gio3 3 0.00% week 3 0.00% vò 3 0.00% lùa 3 0.00% â1t 3 0.00% (83 3 0.00% perth 3 0.00% (trà 3 0.00% crowe 3 0.00% (em 3 0.00% ru'1t 3 0.00% bói 3 0.00% (84 3 0.00% ra5n 3 0.00% strasbourg 3 0.00% kha3m 3 0.00% 5602

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 107 3 0.00% lddsvvntd 3 0.00% cbkt 3 0.00% át 3 0.00% xâu 3 0.00% (u3y 3 0.00% asefi 3 0.00% gilad 3 0.00% valentin 3 0.00% foote 3 0.00% reza 3 0.00% knock 3 0.00% yogyakarta 3 0.00% brownback 3 0.00% ko 3 0.00% nga5nh 3 0.00% augustine 3 0.00% (ha5 3 0.00% air 3 0.00% mel 3 0.00% nu'ã 3 0.00% thuo'ng 3 0.00% fossum 3 0.00% (phâ2n 3 0.00% br 3 0.00% gô 3 0.00% shalit 3 0.00% allen 3 0.00% khua 3 0.00% 730 3 0.00% bu’o’1u 3 0.00% 10h 3 0.00% 1944 3 0.00% rinh 3 0.00% gâ5p 3 0.00% tra5c 3 0.00% (giâ1y 3 0.00% charlie 3 0.00% (câ2n 3 0.00% góa 3 0.00% rr 3 0.00% ge 3 0.00% office 3 0.00% sihanouk 3 0.00% (úc 3 0.00% pa(2ng 3 0.00% (phu'o'2ng 3 0.00% sâm 3 0.00% motor 3 0.00% haniya 3 0.00% gordon 3 0.00% to'3m 3 0.00% elena

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 108 3 0.00% maria 3 0.00% dow 3 0.00% adolf 3 0.00% todd 3 0.00% ddo'2m 3 0.00% 117 3 0.00% congress 3 0.00% blackledge 3 0.00% châ2n 3 0.00% beng 3 0.00% le5t 3 0.00% pennsylvania 3 0.00% ghán 3 0.00% chu'2 3 0.00% vnd 3 0.00% bo5 3 0.00% sê5t 3 0.00% (82 3 0.00% haye 3 0.00% tu5y 3 0.00% requelme 3 0.00% ddtntqcg 3 0.00% my5 3 0.00% argiope 3 0.00% chái 3 0.00% ddùn 3 0.00% brand 3 0.00% váy 3 0.00% qu5y 3 0.00% bloomberg 3 0.00% mfn 3 0.00% jodric 3 0.00% fidel 3 0.00% castro 3 0.00% dodge 3 0.00% (vnpt 3 0.00% ve5t 3 0.00% master 3 0.00% be4 3 0.00% sâ1m 3 0.00% will 3 0.00% chiêng 3 0.00% joseph 3 0.00% nghê4 3 0.00% baker 3 0.00% rmit 3 0.00% 167 3 0.00% piers 3 0.00% 602 3 0.00% 725 3 0.00% japan 3 0.00% tre3o

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 109 3 0.00% ga(ng 3 0.00% dâ4y 3 0.00% khu'o'1c 3 0.00% trâ2y 3 0.00% ottawa 3 0.00% (viettel 3 0.00% ganh 3 0.00% oxy 3 0.00% dó 3 0.00% americans 3 0.00% 192 3 0.00% sa5c 3 0.00% zealand 3 0.00% ted 3 0.00% tcty 3 0.00% condoleeza 3 0.00% serie 3 0.00% thênh 3 0.00% (cùng 3 0.00% (vm 3 0.00% maryland 3 0.00% nhuô5m 3 0.00% 362 3 0.00% 357 3 0.00% asp 3 0.00% ho5ach 3 0.00% thõng 3 0.00% (co' 3 0.00% hâ4ng 3 0.00% 2g 3 0.00% nhoe5t 3 0.00% democracy 3 0.00% nha5nh 3 0.00% dewar 3 0.00% bòn 3 0.00% ca(5n 3 0.00% (ddsq 3 0.00% wu 3 0.00% dopamin 3 0.00% (xin 3 0.00% rupiah 3 0.00% luông 3 0.00% council 3 0.00% 116 3 0.00% aaja 3 0.00% u'2ng 3 0.00% rùa 3 0.00% 1m2 3 0.00% xu5p 3 0.00% nê5m 3 0.00% te5o 3 0.00% anders

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 110 3 0.00% lâ1m 3 0.00% (dù 3 0.00% 1500 3 0.00% sb 3 0.00% bu'5 3 0.00% nang 3 0.00% (saint 3 0.00% 340 3 0.00% harcharik 3 0.00% (bv 3 0.00% lo'5p 3 0.00% intel 3 0.00% 190 3 0.00% xo'3 3 0.00% tiê1m 3 0.00% horta 3 0.00% alkatiri 3 0.00% cu5t 3 0.00% ti5t 3 0.00% du'1a 3 0.00% su'a 3 0.00% (nguyê4n 3 0.00% tuô1t 3 0.00% ac 3 0.00% pô 3 0.00% kalou 3 0.00% viêt 3 0.00% nhuê5 3 0.00% mu'o'1t 3 0.00% ngoa3nh 3 0.00% mourinho 3 0.00% schleck 3 0.00% 270 3 0.00% (ai 3 0.00% 7g30 3 0.00% (2003 3 0.00% andy 3 0.00% palace 3 0.00% house 3 0.00% phà 3 0.00% (pha3i 3 0.00% sê5 3 0.00% 1010 3 0.00% bear 3 0.00% nhoà 3 0.00% (2001 3 0.00% (ta5i 3 0.00% dòn 3 0.00% canaveral 3 0.00% chô2i 3 0.00% cõng 3 0.00% (huyê5n

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 111 3 0.00% yê1m 3 0.00% (tiê3u 3 0.00% dixon 3 0.00% vè 3 0.00% cncnnb 3 0.00% (nguô2n 3 0.00% 5cm 3 0.00% (bí 3 0.00% let 3 0.00% fouad 3 0.00% trác 3 0.00% (ddê2u 3 0.00% ve5o 3 0.00% sám 3 0.00% 187 3 0.00% le3n 3 0.00% copywriter 3 0.00% yemen 3 0.00% hu'á 3 0.00% oa3i 3 0.00% per 3 0.00% clark 3 0.00% go'2m 3 0.00% golan 3 0.00% búi 3 0.00% lu'4ng 3 0.00% vâ2ng 3 0.00% (ban 3 0.00% ru3ng 3 0.00% nuô1i 3 0.00% o'1i 3 0.00% (perth 3 0.00% 1918 3 0.00% edaw 3 0.00% dì 3 0.00% (huê1 3 0.00% arnold 3 0.00% o'2 3 0.00% 303 3 0.00% ho'5i 3 0.00% ong 3 0.00% ngô1 3 0.00% ddo'3 3 0.00% qddd 3 0.00% giri 3 0.00% duê5 3 0.00% razak 3 0.00% 1910 3 0.00% (thành 3 0.00% (ddhqg 3 0.00% shimbun 3 0.00% (bi5

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 112 3 0.00% tóe 3 0.00% chuô2ng 3 0.00% ddu'5o'c 3 0.00% sa(1ng 3 0.00% ru'ng 3 0.00% (viê1t 3 0.00% 200g 3 0.00% hu’1a 3 0.00% (27 3 0.00% (www 3 0.00% mu5 3 0.00% tiê5t 3 0.00% nghiê1n 3 0.00% su3ng 3 0.00% nhe4 3 0.00% patrushev 3 0.00% floyd 3 0.00% lô1t 3 0.00% cu'2u 3 0.00% 7g 3 0.00% mích 3 0.00% phình 3 0.00% king 3 0.00% vous 3 0.00% libya 3 0.00% xoáy 3 0.00% bê5t 3 0.00% trâng 3 0.00% quyê5t 3 0.00% ngu'òi 3 0.00% nh 3 0.00% (ddu'1ng 3 0.00% (san 3 0.00% yang 3 0.00% qu4y 3 0.00% (ddâ2u 3 0.00% (bo3 3 0.00% nokia 3 0.00% 388 3 0.00% há 3 0.00% 179 3 0.00% night 3 0.00% mercury 3 0.00% vck 3 0.00% pereiro 3 0.00% francis 3 0.00% 409 3 0.00% salem 3 0.00% qatar 3 0.00% bo'5m 3 0.00% cisco 3 0.00% aragon

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 113 3 0.00% seafood 3 0.00% (1959 3 0.00% bank 3 0.00% shinawatra 3 0.00% luy4 3 0.00% 211 3 0.00% (internet 3 0.00% nho'3n 3 0.00% châ5p 3 0.00% bali 3 0.00% (dda(5ng 3 0.00% ken 3 0.00% campbell 3 0.00% kilômet 3 0.00% et 3 0.00% giâ1m 3 0.00% turkmenistan 3 0.00% sergei 3 0.00% (na5n 3 0.00% dans 3 0.00% khom 3 0.00% u'2 3 0.00% gáy 3 0.00% (pakistan 3 0.00% lu’o’1i 3 0.00% ljungberg 3 0.00% (40 3 0.00% amsterdam 3 0.00% bosnia 3 0.00% bp 3 0.00% alaska 3 0.00% 559 3 0.00% worl 3 0.00% algeria 3 0.00% sandy 3 0.00% ó 3 0.00% quâ5y 3 0.00% ngoe 3 0.00% metro 3 0.00% dê2 3 0.00% ì 3 0.00% dà 3 0.00% ngoèo 3 0.00% npt 3 0.00% big 3 0.00% bargain 3 0.00% nuno 3 0.00% (plea 3 0.00% index 3 0.00% coast 3 0.00% meira 3 0.00% hu’o’ng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 114 3 0.00% lauriane 3 0.00% hanke 3 0.00% (golf 3 0.00% tarud 3 0.00% bu’2a 3 0.00% ames 3 0.00% (thua 3 0.00% apollo 3 0.00% ferdinand 3 0.00% chuô2n 3 0.00% joker 3 0.00% party 3 0.00% xo 3 0.00% pa 3 0.00% (k 3 0.00% zimmer 3 0.00% villar 3 0.00% stagflation 3 0.00% ports 3 0.00% ddo’4 3 0.00% perfumebay 3 0.00% petrodollars 3 0.00% ddn 3 0.00% marco 3 0.00% cocu 3 0.00% capital 3 0.00% bujr 3 0.00% 2500 3 0.00% levis 3 0.00% wipo 3 0.00% golmohammadi 3 0.00% mahdavikia 3 0.00% karimi 3 0.00% sjc 3 0.00% tamil 3 0.00% tddc 3 0.00% jun 3 0.00% bakary 3 0.00% amr 3 0.00% 283 3 0.00% cruyff 3 0.00% bón 3 0.00% gonzales 3 0.00% myco 3 0.00% tmn 3 0.00% pisico 3 0.00% luy5 3 0.00% khu'5ng 3 0.00% amaobi 3 0.00% tostao 3 0.00% heinze 3 0.00% (e

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 115 3 0.00% (tha(1ng 3 0.00% anz 3 0.00% (46 3 0.00% louis 3 0.00% lula 3 0.00% nestor 3 0.00% gatti 3 0.00% canterbury 3 0.00% lithuania 3 0.00% krstajic 3 0.00% ltd 3 0.00% airbus 3 0.00% gillieron 3 0.00% giãi 3 0.00% lòm 3 0.00% (majority 3 0.00% (chu'1 3 0.00% aa 3 0.00% dollar 3 0.00% (email 3 0.00% (tiê1ng 3 0.00% sts 3 0.00% ma3y 3 0.00% otaku 3 0.00% la(3ng 3 0.00% tràm 3 0.00% metallidurans 3 0.00% diê5m 3 0.00% lmvntd 3 0.00% risal 3 0.00% pen 3 0.00% lu5y 3 0.00% caribbean 3 0.00% citizens 3 0.00% nu'o'1ng 3 0.00% ralstonia 3 0.00% (vô1n 3 0.00% yu 3 0.00% (0 3 0.00% 164 3 0.00% 5m 3 0.00% idaho 3 0.00% leu 3 0.00% preston 3 0.00% 1m80 3 0.00% km2 3 0.00% make 3 0.00% lucia 3 0.00% values 3 0.00% mission 3 0.00% aso 3 0.00% thau

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 116 3 0.00% quâ4n 3 0.00% francois 3 0.00% oxana 3 0.00% traditional 3 0.00% hillevi 3 0.00% dô5ng 3 0.00% ph 3 0.00% vo3n 3 0.00% zambia 3 0.00% khu’1 3 0.00% (penalty 3 0.00% inna 3 0.00% winfield 3 0.00% brian 3 0.00% boich 3 0.00% shinya 3 0.00% tê1… 3 0.00% (kê1t 3 0.00% yellowcake 3 0.00% so’n 3 0.00% gerhard 3 0.00% lia 3 0.00% nho’1 3 0.00% conner 3 0.00% sa5 3 0.00% (chi 3 0.00% ponce 3 0.00% ngu’4 3 0.00% philipp 3 0.00% christoph 3 0.00% bernd 3 0.00% (so 3 0.00% committee 3 0.00% nbc 3 0.00% (ba5n 3 0.00% patton 3 0.00% nicotine 3 0.00% chlb 3 0.00% command 3 0.00% (central 3 0.00% virgin 3 0.00% lian 3 0.00% jackson 3 0.00% boggs 3 0.00% lo3i 3 0.00% denis 3 0.00% martinez 3 0.00% panikian 3 0.00% 6m 3 0.00% nê 3 0.00% worldwide 3 0.00% worldcom

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 117 3 0.00% ke5o 3 0.00% mram 3 0.00% abidal 3 0.00% line 3 0.00% schalke 3 0.00% allawi 3 0.00% tncn 3 0.00% chô5i 3 0.00% kuranyi 3 0.00% nha3 3 0.00% bo 3 0.00% truman 3 0.00% (na 3 0.00% c5 3 0.00% gù 3 0.00% lê1ch 3 0.00% ttytdp 3 0.00% nô5m 3 0.00% chu'o'3ng 3 0.00% services 3 0.00% ossetia 3 0.00% cu'4 3 0.00% go'2 3 0.00% osirak 3 0.00% brotherhood 3 0.00% muslim 3 0.00% kcn 3 0.00% josé 3 0.00% mo'1i… 3 0.00% county 3 0.00% (indonesia 3 0.00% valley 3 0.00% thein 3 0.00% orange 3 0.00% jeff 3 0.00% porter 3 0.00% arizona 3 0.00% nhòe 3 0.00% thompson 3 0.00% be5 3 0.00% (âu 3 0.00% dna 3 0.00% vinagame 3 0.00% guimaraes 3 0.00% 'dân 3 0.00% ptth 3 0.00% at 3 0.00% xu'o'1c 3 0.00% ngo5ng 3 0.00% (a 3 0.00% xào 3 0.00% vog

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 118 3 0.00% mottaki 3 0.00% solana 3 0.00% phé 3 0.00% xx 3 0.00% 20000 3 0.00% khamenei 3 0.00% nhâ2y 3 0.00% tyre 3 0.00% (2006 3 0.00% nafta 3 0.00% nghé 3 0.00% eximbank 3 0.00% shimane 3 0.00% 197 3 0.00% brzezinski 3 0.00% 93 3 0.00% calmette 3 0.00% aleksandr 3 0.00% container 3 0.00% pravda 3 0.00% chây 3 0.00% ngô3n 3 0.00% carnegie 3 0.00% project 3 0.00% rand 3 0.00% rô1c 3 0.00% musab 3 0.00% giông 3 0.00% (afp 3 0.00% nho'4 3 0.00% jazeera 3 0.00% xi3n 3 0.00% tajikistan 3 0.00% 330 3 0.00% hosni 3 0.00% xinhuanet 3 0.00% ba(5t 3 0.00% ayman 3 0.00% gen 3 0.00% abdullah 3 0.00% ô1i 3 0.00% guardian 3 0.00% steinberg 3 0.00% xa(m 3 0.00% nghè 3 0.00% thx 3 0.00% aksu 3 0.00% hen 3 0.00% rita 3 0.00% odonkor 3 0.00% tsc 3 0.00% dfb

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 119 3 0.00% call 3 0.00% college 3 0.00% mcclaren 3 0.00% samba 3 0.00% everton 3 0.00% hydrogen 3 0.00% karlsruhe 3 0.00% pokal 3 0.00% becamex 3 0.00% liga 3 0.00% toang 3 0.00% xuýt 3 0.00% hormone 3 0.00% áy 3 0.00% (du'o'1i 3 0.00% ngót 3 0.00% champion 3 0.00% náy 3 0.00% hackett 3 0.00% ttv 3 0.00% kí 3 0.00% ujfalusi 3 0.00% power 3 0.00% gyan 3 0.00% muntari 3 0.00% huw 3 0.00% zandi 3 0.00% this 3 0.00% waynerooney 3 0.00% tu5m 3 0.00% derby 3 0.00% hannover 3 0.00% toanh 3 0.00% 1963 3 0.00% illarionov 3 0.00% (dda 3 0.00% waschtschuk 3 0.00% schowkowski 3 0.00% pope 3 0.00% muhammad 3 0.00% mastroeni 3 0.00% 400dd 3 0.00% mcbride 3 0.00% president 3 0.00% amyloid 3 0.00% beta 3 0.00% pelé 3 0.00% natri 3 0.00% powers 3 0.00% nhuyê4n 3 0.00% gómez 3 0.00% rónald

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 120 3 0.00% nga(1t 3 0.00% (ddi 3 0.00% jiegao 3 0.00% ceo 3 0.00% vê5t 3 0.00% ri4 3 0.00% nylon 3 0.00% vnvnonn 3 0.00% hu5i 3 0.00% chêm 3 0.00% lampart 3 0.00% eisenhower 3 0.00% sa3ng 3 0.00% lóng 3 0.00% princeton 3 0.00% ttn 3 0.00% merk 3 0.00% dragutinovic 3 0.00% zigic 3 0.00% dulles 3 0.00% marín 3 0.00% la5n 3 0.00% estratest 3 0.00% harvard 3 0.00% ammar 3 0.00% casey 3 0.00% nicaragua 3 0.00% martínez 3 0.00% mauricio 3 0.00% hu5c 3 0.00% solís 3 0.00% thuô5t 3 0.00% césar 3 0.00% danny 3 0.00% gonzález 3 0.00% wb 3 0.00% (h 3 0.00% nha5i 3 0.00% sequeira 3 0.00% hão 3 0.00% cáng 3 0.00% (du'5 3 0.00% ubndtp 3 0.00% (tâ1t 3 0.00% óng 3 0.00% brandenburg 3 0.00% (in 3 0.00% staline 3 0.00% 148 3 0.00% korea 3 0.00% (kiê3u 3 0.00% abraham

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 121 3 0.00% bo’2 3 0.00% ubayda 3 0.00% bo5t 3 0.00% soái 3 0.00% 112 3 0.00% (bên 3 0.00% cuô5i 3 0.00% lô2i 3 0.00% truyê1t 3 0.00% choàng 3 0.00% nhàu 3 0.00% qiantang 3 0.00% chey 3 0.00% railpartners 3 0.00% bentley 3 0.00% niger 3 0.00% xèo 3 0.00% châ2m 3 0.00% kilô 3 0.00% diê4u 3 0.00% (gô2m 3 0.00% ghosh 3 0.00% nhu'5t 3 0.00% nua 3 0.00% loren 3 0.00% radar 3 0.00% còm 3 0.00% final 3 0.00% dâ5n 3 0.00% cruz 3 0.00% pool 3 0.00% hu'3ng 3 0.00% techno 3 0.00% ross 3 0.00% lâ1t 3 0.00% qua(5t 3 0.00% ddo5an 3 0.00% museum 3 0.00% destination 3 0.00% pho5t 3 0.00% búp 3 0.00% mâ3n 3 0.00% express 3 0.00% tri5ch 3 0.00% muô2i 3 0.00% (48 3 0.00% waziristan 3 0.00% tha3ng 3 0.00% intelligence 3 0.00% sáo 3 0.00% bô5p 3 0.00% (thê1

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 122 3 0.00% kasuri 3 0.00% xém 3 0.00% 534370 3 0.00% ngo’2 3 0.00% henri 3 0.00% humvee 3 0.00% gông 3 0.00% enjoy 3 0.00% 1964 3 0.00% kiang 3 0.00% chye 3 0.00% tuông 3 0.00% nhô3m 3 0.00% cu’o’1p 3 0.00% hillary 3 0.00% (châ1t 3 0.00% seidel 3 0.00% kílô 3 0.00% spa 3 0.00% tech 3 0.00% (hans 3 0.00% 255 3 0.00% (nus 3 0.00% luâ3n 3 0.00% 256 3 0.00% quai 3 0.00% ho5at 3 0.00% basra 3 0.00% shangri 3 0.00% bremer 3 0.00% lagarde 3 0.00% ro5i 3 0.00% 128 3 0.00% lénine 3 0.00% nhâ1c 3 0.00% 1925 3 0.00% sandal 3 0.00% nus 3 0.00% no' 3 0.00% so5t 3 0.00% ho' 3 0.00% aston 3 0.00% 010 3 0.00% disney 3 0.00% nuô5t 3 0.00% vía 3 0.00% tóan 3 0.00% (nhiê2u 3 0.00% live 3 0.00% 6x 3 0.00% nhái 3 0.00% roed

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 123 3 0.00% golmud 3 0.00% 850 3 0.00% a(4m 3 0.00% petrochina 3 0.00% number 3 0.00% (27dd 3 0.00% sa3o 3 0.00% prize 3 0.00% chetta 3 0.00% (chu'4 3 0.00% driver 3 0.00% gorbatchev 3 0.00% kimbrell 3 0.00% key 3 0.00% (ireland 3 0.00% cha3o 3 0.00% mui 3 0.00% anime 3 0.00% 5x 3 0.00% 9x 3 0.00% xu’1ng 3 0.00% phuo'ng 3 0.00% 8x 3 0.00% synovations 3 0.00% bc 3 0.00% cho'1m 3 0.00% (mnc 3 0.00% vút 3 0.00% (ba(2ng 3 0.00% gt 3 0.00% bosworth 3 0.00% do'i 3 0.00% c6 3 0.00% returns 3 0.00% shafer 3 0.00% (dân 3 0.00% vánh 3 0.00% barbara 3 0.00% traurig 3 0.00% phài 3 0.00% vát 3 0.00% nê1n 3 0.00% cornwall 3 0.00% dâ2y 3 0.00% hê1n 3 0.00% journal 3 0.00% áng 3 0.00% su’o’2n 3 0.00% mu'o'5t 3 0.00% lu'5 3 0.00% santa 3 0.00% newsweek

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 124 3 0.00% iv 3 0.00% mu3i 3 0.00% ricefish 3 0.00% stillwell 3 0.00% sucacnô 3 0.00% táy 3 0.00% isbell 3 0.00% toe 3 0.00% ddu’5ng 3 0.00% du'a5 3 0.00% me5t 3 0.00% armey 3 0.00% toét 3 0.00% gutierrez 3 0.00% faith 3 0.00% 840 3 0.00% bondi 3 0.00% ghiê2n 3 0.00% atr 3 0.00% idecaf 3 0.00% ba(5m 3 0.00% chu’2ng 3 0.00% 430 3 0.00% like 3 0.00% hê 3 0.00% nha(m 3 0.00% tha(1p 3 0.00% mass 3 0.00% mit 3 0.00% vu'o'5n 3 0.00% sean 3 0.00% piero 3 0.00% exmocare 3 0.00% ghe3 3 0.00% zahar 3 0.00% haniyah 3 0.00% sà 3 0.00% toé 3 0.00% cech 3 0.00% nhóc 3 0.00% anan 3 0.00% suyê3n 3 0.00% panahi 3 0.00% liê1c 3 0.00% ngay2 3 0.00% xoa5ch 3 0.00% first 3 0.00% soul 3 0.00% hua 3 0.00% cho3 3 0.00% liveshow 3 0.00% búyt

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 125 3 0.00% dpa 3 0.00% rì 3 0.00% scandal 3 0.00% ddu’o’ng 3 0.00% (bang 3 0.00% ronaldinho 2 0.00% menatep 2 0.00% (nô3i 2 0.00% peres 2 0.00% mu’o’5n 2 0.00% na5 2 0.00% silvio 2 0.00% harbor 2 0.00% nhác 2 0.00% schwab 2 0.00% cùa 2 0.00% nagorno 2 0.00% xiu 2 0.00% binyamin 2 0.00% jihad 2 0.00% pat 2 0.00% kawai 2 0.00% 2500km 2 0.00% (pearl 2 0.00% (lu 2 0.00% rún 2 0.00% shimon 2 0.00% robertson 2 0.00% bachelor 2 0.00% ds 2 0.00% karabakh 2 0.00% (coalition 2 0.00% (khác 2 0.00% (tdv 2 0.00% schumer 2 0.00% (student 2 0.00% iyad 2 0.00% nablus 2 0.00% dth 2 0.00% phai 2 0.00% aachen 2 0.00% murtha 2 0.00% koeln 2 0.00% berlusconi 2 0.00% ddiã 2 0.00% ahmad 2 0.00% issah 2 0.00% woolsey 2 0.00% gore 2 0.00% pranab 2 0.00% webster 2 0.00% uê1

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 126 2 0.00% yudhoyono 2 0.00% (newsweek 2 0.00% colt 2 0.00% transdnistr 2 0.00% en 2 0.00% agni 2 0.00% (gia(ng 2 0.00% nhe 2 0.00% sidon 2 0.00% netayahu 2 0.00% ida 2 0.00% steve 2 0.00% mossad 2 0.00% cents 2 0.00% xo'n 2 0.00% gia5t 2 0.00% mart 2 0.00% nicholas 2 0.00% watergate 2 0.00% mccone 2 0.00% wal 2 0.00% (châm 2 0.00% 149 2 0.00% (503 2 0.00% libby 2 0.00% pastors 2 0.00% lê3 2 0.00% shekel 2 0.00% rubaie 2 0.00% 949 2 0.00% begin 2 0.00% chuâ4n 2 0.00% (khuynh 2 0.00% windows 2 0.00% chi5ch 2 0.00% kho'3i' 2 0.00% worker 2 0.00% 97307 2 0.00% saigòn 2 0.00% rong 2 0.00% (guest 2 0.00% nhoe3n 2 0.00% chiarelli 2 0.00% huynhquocbinh 2 0.00% (mâ1t 2 0.00% ptcs 2 0.00% 8752 2 0.00% (border 2 0.00% gianh 2 0.00% abdul 2 0.00% xb 2 0.00% patrol

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 127 2 0.00% nháo 2 0.00% khê4nh 2 0.00% chevron 2 0.00% 770 2 0.00% vietmatchbiz 2 0.00% khi5 2 0.00% gibson 2 0.00% thé 2 0.00% 'câ2n 2 0.00% shai 2 0.00% winn 2 0.00% oregon 2 0.00% que3 2 0.00% brent 2 0.00% dìn 2 0.00% (cia 2 0.00% rê2 2 0.00% aslan 2 0.00% nyunt 2 0.00% shwe 2 0.00% glickman 2 0.00% kontum 2 0.00% choi 2 0.00% barak 2 0.00% (bs 2 0.00% grant 2 0.00% hartert 2 0.00% riê4u 2 0.00% xu'1c 2 0.00% yong 2 0.00% dwight 2 0.00% calci 2 0.00% verdery 2 0.00% huyênh 2 0.00% bonner 2 0.00% 20361 2 0.00% 50m2 2 0.00% 'ddô2ng 2 0.00% thâ4n 2 0.00% lu'á 2 0.00% muà 2 0.00% (felony 2 0.00% tu3a 2 0.00% uttaradit 2 0.00% feeble 2 0.00% titorenko 2 0.00% cupid 2 0.00% authority 2 0.00% box 2 0.00% petronas 2 0.00% quran 2 0.00% pandora

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 128 2 0.00% darwazeh 2 0.00% marc 2 0.00% management 2 0.00% lo3n 2 0.00% mississipi 2 0.00% feinberg 2 0.00% vasell 2 0.00% (argentina 2 0.00% walsh 2 0.00% ddênh 2 0.00% xuông 2 0.00% rô2 2 0.00% lênh 2 0.00% ngu'o'3ng 2 0.00% gmail 2 0.00% zayat 2 0.00% lyn 2 0.00% cordesman 2 0.00% daklak 2 0.00% phét 2 0.00% essien 2 0.00% scorpion 2 0.00% mariana 2 0.00% chàm 2 0.00% thoã 2 0.00% (uruguay 2 0.00% government 2 0.00% perle 2 0.00% bi3nh 2 0.00% straw 2 0.00% fund 2 0.00% adam 2 0.00% signatures 2 0.00% (khoai 2 0.00% (must 2 0.00% (scotland 2 0.00% kiê1ng 2 0.00% 450 2 0.00% bellaire 2 0.00% hâ3m 2 0.00% (low 2 0.00% sâ1p 2 0.00% independent 2 0.00% fat 2 0.00% hiu 2 0.00% fries 2 0.00% sachs 2 0.00% zinni 2 0.00% (honorariums 2 0.00% allison 2 0.00% lausanne 2 0.00% emmanuel

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 129 2 0.00% voz 2 0.00% cross 2 0.00% 243 2 0.00% boulis 2 0.00% guatemala 2 0.00% simpson 2 0.00% shifter 2 0.00% wmd 2 0.00% ''iran 2 0.00% hi4nh 2 0.00% future 2 0.00% minority 2 0.00% weyrich 2 0.00% puyol 2 0.00% emerson 2 0.00% ren 2 0.00% armstrong 2 0.00% hadley 2 0.00% shandwick 2 0.00% tauzin 2 0.00% marketing 2 0.00% safa 2 0.00% elysees 2 0.00% atomic 2 0.00% georgetown 2 0.00% champs 2 0.00% missiles 2 0.00% (special 2 0.00% service 2 0.00% dobson 2 0.00% daum 2 0.00% ellis 2 0.00% majority 2 0.00% alliance 2 0.00% heu 2 0.00% eddi 2 0.00% bomb 2 0.00% nedved 2 0.00% robin 2 0.00% ncppr 2 0.00% viera 2 0.00% leader 2 0.00% asier 2 0.00% cmnd 2 0.00% republicans 2 0.00% nrcc 2 0.00% rich 2 0.00% tbn 2 0.00% schatz 2 0.00% cirincione 2 0.00% congressional 2 0.00% noble

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 130 2 0.00% leaders 2 0.00% red 2 0.00% priest 2 0.00% q4 2 0.00% lehrman 2 0.00% schlein 2 0.00% grimaldi 2 0.00% siyb 2 0.00% nabarro 2 0.00% aderholt 2 0.00% hunter 2 0.00% stanley 2 0.00% schmidt 2 0.00% gambling 2 0.00% woodward 2 0.00% ro'5n 2 0.00% prohibition 2 0.00% (lobbyist 2 0.00% sida 2 0.00% vietnam's 2 0.00% (republic 2 0.00% 222 2 0.00% azerbaijan 2 0.00% dealing 2 0.00% (plo 2 0.00% 264 2 0.00% westhead 2 0.00% nhú 2 0.00% andrei 2 0.00% acid 2 0.00% hâ5m 2 0.00% nhíu 2 0.00% pho'1i 2 0.00% su'o'5ng 2 0.00% (phía 2 0.00% nhó 2 0.00% xuý 2 0.00% tành 2 0.00% xoe5 2 0.00% toi 2 0.00% (anti 2 0.00% su’4a 2 0.00% chanh 2 0.00% keng 2 0.00% hariri 2 0.00% hám 2 0.00% thiê1c 2 0.00% est 2 0.00% xía 2 0.00% 24k 2 0.00% ro'5p 2 0.00% trát

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 131 2 0.00% schuman 2 0.00% 080 2 0.00% infos 2 0.00% môt 2 0.00% lakeshore 2 0.00% arkansas 2 0.00% su'o'1t 2 0.00% ki 2 0.00% nymex 2 0.00% chít 2 0.00% ftc 2 0.00% mukherjee 2 0.00% cutler 2 0.00% nld 2 0.00% do'5t 2 0.00% analysis 2 0.00% hót 2 0.00% nowakowski 2 0.00% mayflower 2 0.00% blog 2 0.00% vùn 2 0.00% khodorskovsky 2 0.00% loanh 2 0.00% (ít 2 0.00% rõi 2 0.00% bernanke 2 0.00% (bureau 2 0.00% rê2n 2 0.00% ngói 2 0.00% parkinson 2 0.00% mcclellan 2 0.00% tito 2 0.00% hindi 2 0.00% romano 2 0.00% 'ddâ1t 2 0.00% (engagement 2 0.00% bu'1ng 2 0.00% kiê5u 2 0.00% lu'5o'c 2 0.00% tncg 2 0.00% treaty 2 0.00% pot 2 0.00% khabab 2 0.00% phèng 2 0.00% pol 2 0.00% hô1ng 2 0.00% kyodo 2 0.00% kashmiri 2 0.00% sanskrit 2 0.00% union 2 0.00% bo’m 2 0.00% mujahedeen

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 132 2 0.00% dynamics 2 0.00% malayalam 2 0.00% russell 2 0.00% glass 2 0.00% bild 2 0.00% thu'3a 2 0.00% kanh 2 0.00% kunar 2 0.00% libbi 2 0.00% trô1i 2 0.00% dhabi 2 0.00% nghiê4m 2 0.00% bourland 2 0.00% fluor 2 0.00% mút 2 0.00% cities 2 0.00% nano 2 0.00% lo'5n 2 0.00% uae 2 0.00% phâ3y 2 0.00% dpw 2 0.00% lô3 2 0.00% kempner 2 0.00% ngô4ng 2 0.00% ahmanidejad 2 0.00% research 2 0.00% hormuz 2 0.00% dutch 2 0.00% proliferation 2 0.00% sikh 2 0.00% gheit 2 0.00% angel 2 0.00% viktor 2 0.00% biggins 2 0.00% electric 2 0.00% ewell 2 0.00% cuà 2 0.00% associated 2 0.00% phò 2 0.00% lu'a3 2 0.00% nghê4u 2 0.00% nghê5n 2 0.00% shinta 2 0.00% 172 2 0.00% (thích 2 0.00% nhuô1m 2 0.00% uruzgan 2 0.00% (bô1n 2 0.00% snow 2 0.00% (chiê1n 2 0.00% bâ5y 2 0.00% zahlé

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 133 2 0.00% pu 2 0.00% (lao 2 0.00% 536787 2 0.00% dominique 2 0.00% 6a 2 0.00% 1559 2 0.00% hê3 2 0.00% 1911 2 0.00% (gnp 2 0.00% 820 2 0.00% 345 2 0.00% roissy 2 0.00% ibm 2 0.00% 271 2 0.00% 470 2 0.00% lenovo 2 0.00% giê 2 0.00% biêp 2 0.00% ddui 2 0.00% hô5c 2 0.00% 245 2 0.00% assad 2 0.00% (jerusalem 2 0.00% dda3n 2 0.00% afganistan 2 0.00% bét 2 0.00% khênh 2 0.00% (thiên 2 0.00% ngác 2 0.00% qua(2n 2 0.00% rdx 2 0.00% khâ1n 2 0.00% lich 2 0.00% kathmandu 2 0.00% cha(1t 2 0.00% blhs 2 0.00% gu3i 2 0.00% uâ3n 2 0.00% (hê1t 2 0.00% (tuy 2 0.00% la(5c 2 0.00% xê5ch 2 0.00% vêu 2 0.00% dda(4ng 2 0.00% gô3 2 0.00% 319 2 0.00% (bà 2 0.00% gí 2 0.00% 299 2 0.00% è 2 0.00% vijay 2 0.00% pô1t

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 134 2 0.00% 178 2 0.00% nambiar 2 0.00% tiê5u 2 0.00% ngu'3i 2 0.00% lahore 2 0.00% ngoáy 2 0.00% va5c 2 0.00% bernardino 2 0.00% cùi 2 0.00% khac 2 0.00% 45b 2 0.00% giô1i 2 0.00% én 2 0.00% lifetime 2 0.00% ccb 2 0.00% award 2 0.00% tbt 2 0.00% ep 2 0.00% pretoria 2 0.00% xòm 2 0.00% 295 2 0.00% (u 2 0.00% ridder 2 0.00% jackie 2 0.00% benyhe 2 0.00% knight 2 0.00% chi5t 2 0.00% sum 2 0.00% (magyar 2 0.00% cha(2ng 2 0.00% québecois 2 0.00% (09 2 0.00% (ddá 2 0.00% istanbul 2 0.00% (1966 2 0.00% schaeuble 2 0.00% gattuso 2 0.00% lippi 2 0.00% yvonne 2 0.00% ribery 2 0.00% alberta 2 0.00% ellen 2 0.00% ôstrâylia 2 0.00% (liberal 2 0.00% hunggary 2 0.00% 24h 2 0.00% lynch 2 0.00% nathaniel 2 0.00% mobi 2 0.00% jános 2 0.00% reutes 2 0.00% 236

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 135 2 0.00% toác 2 0.00% afif 2 0.00% do' 2 0.00% slater 2 0.00% xanana 2 0.00% mari 2 0.00% garzelli 2 0.00% cunego 2 0.00% uighurs 2 0.00% (hezbollah 2 0.00% choueifat 2 0.00% 300km 2 0.00% nadji 2 0.00% (zang 2 0.00% su'4ng 2 0.00% arabiya 2 0.00% taro 2 0.00% mobai 2 0.00% mo5n 2 0.00% (va(1ng 2 0.00% dessel 2 0.00% giõi 2 0.00% ê1m 2 0.00% 94 2 0.00% vùa 2 0.00% (nói 2 0.00% rát 2 0.00% rôm 2 0.00% fillet 2 0.00% vhdd 2 0.00% cyril 2 0.00% lucky 2 0.00% computer 2 0.00% xvi 2 0.00% khiá 2 0.00% ro'1m 2 0.00% 627 2 0.00% moi 2 0.00% cho' 2 0.00% (thâ5t 2 0.00% 885 2 0.00% 337 2 0.00% 733 2 0.00% giai3 2 0.00% (ho5 2 0.00% yes 2 0.00% ré 2 0.00% rennes 2 0.00% rôma 2 0.00% isreal 2 0.00% nhùng 2 0.00% studdert

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 136 2 0.00% 640 2 0.00% 390 2 0.00% examiner 2 0.00% garmser 2 0.00% oda 2 0.00% terje 2 0.00% dông 2 0.00% ngu’2a 2 0.00% 593 2 0.00% sít 2 0.00% (94 2 0.00% aipo 2 0.00% 207 2 0.00% tu'1o'ng 2 0.00% hõan 2 0.00% rockét 2 0.00% 1860 2 0.00% 111 2 0.00% 1628 2 0.00% xiêm 2 0.00% (cái 2 0.00% ðô5 2 0.00% montana 2 0.00% koh 2 0.00% 165 2 0.00% caspers 2 0.00% vncs 2 0.00% (dda(5c 2 0.00% citigroup 2 0.00% seiple 2 0.00% lddldd 2 0.00% lgw 2 0.00% nong 2 0.00% 4084d 2 0.00% naò 2 0.00% kyl 2 0.00% ta5nh 2 0.00% jon 2 0.00% cót 2 0.00% property 2 0.00% (su' 2 0.00% bo'2n 2 0.00% huo'3n 2 0.00% commission 2 0.00% (ihrc 2 0.00% bahrain 2 0.00% tru'2o'ng 2 0.00% offside 2 0.00% l6 2 0.00% by 2 0.00% sói 2 0.00% canadda

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 137 2 0.00% bô1t 2 0.00% (68 2 0.00% ình 2 0.00% chình 2 0.00% (hình 2 0.00% bu3a 2 0.00% drancy 2 0.00% gâ4y 2 0.00% lu’o’1t 2 0.00% róc 2 0.00% (ke3 2 0.00% toái 2 0.00% hy3 2 0.00% suffredini 2 0.00% lobby 2 0.00% rsf 2 0.00% autonomous 2 0.00% fdi 2 0.00% mongolia 2 0.00% tru’2ng 2 0.00% du3 2 0.00% 2550 2 0.00% khóang 2 0.00% (nghi4a 2 0.00% ca(1c 2 0.00% dongfang 2 0.00% khuy3u 2 0.00% chê2 2 0.00% luther 2 0.00% thóa 2 0.00% ngoa3i 2 0.00% cùn 2 0.00% vksnd 2 0.00% paparazzi 2 0.00% 1940 2 0.00% tri4u 2 0.00% margaret 2 0.00% su'o'3i 2 0.00% baó 2 0.00% co5p 2 0.00% váo 2 0.00% ba3 2 0.00% (1945 2 0.00% stasi 2 0.00% xinjiang 2 0.00% khâ1t 2 0.00% ngóai 2 0.00% hariyanto 2 0.00% vén 2 0.00% report 2 0.00% (south 2 0.00% kho3ang

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 138 2 0.00% hu3i 2 0.00% kilômét 2 0.00% (mu'o'2i 2 0.00% thóat 2 0.00% nom 2 0.00% norodom 2 0.00% syri 2 0.00% ho'1 2 0.00% ro'1 2 0.00% nê1t 2 0.00% 441 2 0.00% minxin 2 0.00% tri4 2 0.00% ke 2 0.00% vô1 2 0.00% yuan 2 0.00% ddu'2o'ng 2 0.00% french 2 0.00% damas 2 0.00% yung 2 0.00% aie 2 0.00% phãi 2 0.00% 540 2 0.00% 198 2 0.00% affairs 2 0.00% (70 2 0.00% eastern 2 0.00% truo'2ng 2 0.00% 740 2 0.00% people's 2 0.00% 168 2 0.00% erikson 2 0.00% mo’2i 2 0.00% asley 2 0.00% ghìm 2 0.00% larsson 2 0.00% pavel 2 0.00% vo’5 2 0.00% beenhakker 2 0.00% borgetti 2 0.00% la3 2 0.00% armando 2 0.00% giun 2 0.00% vo'5i 2 0.00% (ddoa5t 2 0.00% ho'5t 2 0.00% delgado 2 0.00% (argentinia 2 0.00% klinsman 2 0.00% pardo 2 0.00% qúi 2 0.00% bankstown

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 139 2 0.00% tru’a 2 0.00% stadt 2 0.00% horst 2 0.00% ghiggia 2 0.00% alexandre 2 0.00% australian 2 0.00% huýt 2 0.00% (ddông 2 0.00% bravo 2 0.00% guillermo 2 0.00% omar 2 0.00% du’4 2 0.00% crystal 2 0.00% (thu'o'5ng 2 0.00% joao 2 0.00% schottland 2 0.00% commerzbank 2 0.00% alfredo 2 0.00% palacio 2 0.00% impossible 2 0.00% jockey 2 0.00% vicente 2 0.00% xìu 2 0.00% karim 2 0.00% maldives 2 0.00% nenad 2 0.00% tru’o’5t 2 0.00% bernhard 2 0.00% afc 2 0.00% komlan 2 0.00% marufshonow 2 0.00% chornidow 2 0.00% global 2 0.00% (joint 2 0.00% (council 2 0.00% umana 2 0.00% xiê3ng 2 0.00% xàng 2 0.00% khoa(1t 2 0.00% (hlv 2 0.00% nha(4ng 2 0.00% núddez 2 0.00% paredes 2 0.00% la(4ng 2 0.00% downing 2 0.00% abbou 2 0.00% shi 2 0.00% sharkawy 2 0.00% habib 2 0.00% zili 2 0.00% thu3i 2 0.00% lu3i

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 140 2 0.00% tunis 2 0.00% initiative 2 0.00% aussie 2 0.00% dâ1t 2 0.00% schroeder 2 0.00% (giáo 2 0.00% châ2y 2 0.00% xéo 2 0.00% su'a3 2 0.00% lu5i 2 0.00% me5o 2 0.00% obaid 2 0.00% vo’2i 2 0.00% dos 2 0.00% particular 2 0.00% countries 2 0.00% vái 2 0.00% arab 2 0.00% andrews 2 0.00% (ngân 2 0.00% anc 2 0.00% giâ2y 2 0.00% tanaka 2 0.00% hoon 2 0.00% nha3m 2 0.00% carbin 2 0.00% ishiba 2 0.00% miraucourt 2 0.00% matthews 2 0.00% uùy 2 0.00% hu’1ng 2 0.00% truo'1c 2 0.00% manouchehr 2 0.00% riyadh 2 0.00% ''mô5t 2 0.00% xô1i 2 0.00% (lee 2 0.00% desmond 2 0.00% alexander 2 0.00% lobato 2 0.00% concern 2 0.00% (08 2 0.00% m'gladbach 2 0.00% bor 2 0.00% biê1c 2 0.00% (01 2 0.00% ngo’ 2 0.00% ernst 2 0.00% (02 2 0.00% gerald 2 0.00% (heredia 2 0.00% t4

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 141 2 0.00% mesén 2 0.00% alvaro 2 0.00% salt 2 0.00% vfb 2 0.00% trê2 2 0.00% 240 2 0.00% kurt 2 0.00% (cho'i 2 0.00% mughal 2 0.00% ansar 2 0.00% nha(ng 2 0.00% toiba 2 0.00% tu'3ng 2 0.00% luo'5ng 2 0.00% ngùng 2 0.00% uss 2 0.00% mustin 2 0.00% european 2 0.00% ngo’5i 2 0.00% sâ3y 2 0.00% trây 2 0.00% steinglass 2 0.00% (phu3 2 0.00% tì 2 0.00% nhu’o’2ng 2 0.00% gnp 2 0.00% dean 2 0.00% portugal 2 0.00% viana 2 0.00% gía 2 0.00% tdd 2 0.00% lucio 2 0.00% cantalejo 2 0.00% dida 2 0.00% 279 2 0.00% huth 2 0.00% florenz 2 0.00% dduo'2ng 2 0.00% ect 2 0.00% arne 2 0.00% nhóang 2 0.00% robinson 2 0.00% piro 2 0.00% popovych 2 0.00% (spanien 2 0.00% madelein 2 0.00% hoen 2 0.00% ngo'1 2 0.00% gregory 2 0.00% dana 2 0.00% moreno 2 0.00% burdisso

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 142 2 0.00% ô1 2 0.00% byron 2 0.00% (ca3nh 2 0.00% mô5i 2 0.00% (group 2 0.00% jokers 2 0.00% yeutter 2 0.00% barshefsky 2 0.00% charlene 2 0.00% clayton 2 0.00% carla 2 0.00% kamikawa 2 0.00% steinmeier 2 0.00% internationale 2 0.00% uê3 2 0.00% boat 2 0.00% rouge 2 0.00% marcell 2 0.00% posters 2 0.00% eusebio 2 0.00% nowotny 2 0.00% lazio 2 0.00% wiltord 2 0.00% dâ5t 2 0.00% (argentinien 2 0.00% ting 2 0.00% saha 2 0.00% pays 2 0.00% nos 2 0.00% démocratie 2 0.00% illinois 2 0.00% fernando 2 0.00% laura 2 0.00% vb 2 0.00% ferreira 2 0.00% stamford 2 0.00% ferguson 2 0.00% so’5i 2 0.00% co’3i 2 0.00% hatton 2 0.00% lawrence 2 0.00% lu’2a 2 0.00% jansen 2 0.00% yaroslav 2 0.00% torsten 2 0.00% miroslav 2 0.00% (thi5 2 0.00% bastian 2 0.00% lukas 2 0.00% mascherano 2 0.00% namouchi 2 0.00% kho’i

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 143 2 0.00% schewtschenko 2 0.00% ulkrain 2 0.00% jaziri 2 0.00% lõa 2 0.00% kahtani 2 0.00% aziz 2 0.00% pessin 2 0.00% europe 2 0.00% alonso 2 0.00% pace 2 0.00% pablo 2 0.00% xavi 2 0.00% torres 2 0.00% jeserski 2 0.00% assembly 2 0.00% garcia 2 0.00% shield 2 0.00% nhu'o'ng 2 0.00% be 2 0.00% (lo'2i 2 0.00% liê5ng 2 0.00% dvtncs 2 0.00% malta 2 0.00% (bch 2 0.00% bulgarien 2 0.00% gamborgno 2 0.00% boruc 2 0.00% krzynowek 2 0.00% dollars 2 0.00% bolton 2 0.00% zurawski 2 0.00% community 2 0.00% koondoola 2 0.00% sobolewski 2 0.00% dzoanh 2 0.00% keller 2 0.00% cambiasso 2 0.00% messi 2 0.00% rosetti 2 0.00% klissmann 2 0.00% ooijer 2 0.00% dirksen 2 0.00% boulahrouz 2 0.00% sneijder 2 0.00% senate 2 0.00% maxi 2 0.00% vatican 2 0.00% tevez 2 0.00% noí 2 0.00% duljaj 2 0.00% ljuboja 2 0.00% petkovic

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 144 2 0.00% pelosi 2 0.00% jurgen 2 0.00% capitol 2 0.00% nekounam 2 0.00% madanchi 2 0.00% relation 2 0.00% nosrati 2 0.00% ngu’5c 2 0.00% tho5ai 2 0.00% kaiserslautern 2 0.00% otto 2 0.00% (or 2 0.00% kolo 2 0.00% workers 2 0.00% elfenbeinkueste 2 0.00% (vb 2 0.00% meite 2 0.00% arouna 2 0.00% kaebi 2 0.00% yaya 2 0.00% are 2 0.00% bambang 2 0.00% (trú 2 0.00% dda5 2 0.00% 189 2 0.00% 11g30 2 0.00% nâ1n 2 0.00% trâ2u 2 0.00% (l 2 0.00% la(2ng 2 0.00% rose 2 0.00% stc 2 0.00% 533 2 0.00% iso 2 0.00% triê3n… 2 0.00% giâ5m 2 0.00% 500m 2 0.00% wallace 2 0.00% (so'3 2 0.00% wat 2 0.00% phnom 2 0.00% cn 2 0.00% granit 2 0.00% isabel 2 0.00% (vu'o'5t 2 0.00% 600ha 2 0.00% cm2 2 0.00% 6kg 2 0.00% penh 2 0.00% ná 2 0.00% 100ha 2 0.00% (u'1ng

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 145 2 0.00% (p 2 0.00% mót 2 0.00% 349 2 0.00% yunnan 2 0.00% furama 2 0.00% bloc 2 0.00% (nxbgd 2 0.00% thòm 2 0.00% (sgk 2 0.00% kenkyusei 2 0.00% tíu 2 0.00% (ho'n 2 0.00% nasrallah 2 0.00% jds 2 0.00% sepa 2 0.00% what 2 0.00% ran 2 0.00% 2h 2 0.00% (gio'1i 2 0.00% nâ4u 2 0.00% 14g 2 0.00% khóm 2 0.00% che3 2 0.00% zace 2 0.00% bangladesh 2 0.00% (australian 2 0.00% khiê4ng 2 0.00% d1 2 0.00% 356 2 0.00% allah 2 0.00% scholarships 2 0.00% leadership 2 0.00% do5 2 0.00% 4m2 2 0.00% ngoai 2 0.00% dô4i 2 0.00% presley 2 0.00% ddóa 2 0.00% q7 2 0.00% rap 2 0.00% dexamethasone 2 0.00% elvis 2 0.00% syed 2 0.00% albar 2 0.00% loét 2 0.00% (máy 2 0.00% ytdp 2 0.00% 680 2 0.00% ringgit 2 0.00% ro'3m 2 0.00% hummer 2 0.00% culkin

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 146 2 0.00% vanquish 2 0.00% dãn 2 0.00% range 2 0.00% sophia 2 0.00% c4 2 0.00% la5o 2 0.00% (hãng 2 0.00% xê 2 0.00% (phó 2 0.00% ngo'5m 2 0.00% macaulay 2 0.00% stockholm 2 0.00% raul 2 0.00% (phòng 2 0.00% knightley 2 0.00% keira 2 0.00% zuôi 2 0.00% jennifer 2 0.00% (tq 2 0.00% rayban 2 0.00% garner 2 0.00% jolie 2 0.00% rán 2 0.00% ru'1a 2 0.00% angkor 2 0.00% lois 2 0.00% angelina 2 0.00% catwoman 2 0.00% lane 2 0.00% ngu'o' 2 0.00% bêu 2 0.00% ethan 2 0.00% làu 2 0.00% khaled 2 0.00% (n 2 0.00% hawke 2 0.00% bâ4m 2 0.00% cu'o'5c 2 0.00% (biên 2 0.00% (tuô3i 2 0.00% mt 2 0.00% (china 2 0.00% endowment 2 0.00% peace 2 0.00% veo 2 0.00% hu4 2 0.00% (lu'u 2 0.00% 100dd 2 0.00% 522 2 0.00% hsbc 2 0.00% sekong 2 0.00% tmcp

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 147 2 0.00% (thi 2 0.00% 1m77 2 0.00% tambuiforjudge 2 0.00% (là 2 0.00% 250m 2 0.00% (ttxvn 2 0.00% (lhq 2 0.00% 1m74 2 0.00% (mo'1i 2 0.00% (vinaconex 2 0.00% xiii 2 0.00% biê1n… 2 0.00% yersin 2 0.00% (petrolimex 2 0.00% biê2n 2 0.00% y2 2 0.00% lu'a 2 0.00% 9g 2 0.00% chartered 2 0.00% standard 2 0.00% bi5… 2 0.00% vat 2 0.00% 591 2 0.00% ita 2 0.00% ddòng 2 0.00% motors 2 0.00% ra3o 2 0.00% tè 2 0.00% uông 2 0.00% 191 2 0.00% electra 2 0.00% trimquest 2 0.00% xán 2 0.00% shore 2 0.00% magazine 2 0.00% bâ5p 2 0.00% technologies 2 0.00% access 2 0.00% karnataka 2 0.00% daimler 2 0.00% 275 2 0.00% opera 2 0.00% 14h 2 0.00% 400m2 2 0.00% massachusetts 2 0.00% ubqg 2 0.00% island 2 0.00% lance 2 0.00% westminster 2 0.00% johnny 2 0.00% turkey 2 0.00% tskh

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 148 2 0.00% turks 2 0.00% caicos 2 0.00% fountain 2 0.00% 721 2 0.00% (hindustan 2 0.00% tictack 2 0.00% inch 2 0.00% borneo 2 0.00% barem 2 0.00% monoxide 2 0.00% armitage 2 0.00% sumatra 2 0.00% konda 2 0.00% nguyê3n 2 0.00% eureka 2 0.00% thuâ1n 2 0.00% helen 2 0.00% anakonda 2 0.00% panama 2 0.00% (100 2 0.00% 20kg 2 0.00% (quê 2 0.00% (vietnam 2 0.00% ddê3… 2 0.00% nghét 2 0.00% (huynh 2 0.00% carol 2 0.00% gu 2 0.00% (bâ1t 2 0.00% paloma 2 0.00% (co 2 0.00% vâ3y 2 0.00% slovak 2 0.00% elcb 2 0.00% rccb 2 0.00% vincent 2 0.00% central 2 0.00% (1901 2 0.00% view 2 0.00% over 2 0.00% (nghê5 2 0.00% (igo 2 0.00% 05' 2 0.00% nhâ3m 2 0.00% (japan 2 0.00% chô5p 2 0.00% ix 2 0.00% disko 2 0.00% cent 2 0.00% mars 2 0.00% khê2u 2 0.00% jovp

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 149 2 0.00% 8h 2 0.00% nature 2 0.00% (1975 2 0.00% pictures 2 0.00% (1954 2 0.00% (ma5nh 2 0.00% 132 2 0.00% apasra 2 0.00% qtkd 2 0.00% (kcn 2 0.00% yolande 2 0.00% 22h30 2 0.00% alô 2 0.00% studios 2 0.00% 8m 2 0.00% (saigon 2 0.00% (gâ2n 2 0.00% albania 2 0.00% giu'o'ng 2 0.00% british 2 0.00% fu 2 0.00% 622 2 0.00% u'o'1p 2 0.00% (cu5c 2 0.00% dracula 2 0.00% frankenstein 2 0.00% thon 2 0.00% phu3i 2 0.00% 000m2 2 0.00% thvn 2 0.00% ddhsp 2 0.00% monica 2 0.00% híp 2 0.00% nhúc 2 0.00% cafe 2 0.00% kiê3ng 2 0.00% 1kg 2 0.00% lo5 2 0.00% shizuoka 2 0.00% vun 2 0.00% titanic 2 0.00% enzym 2 0.00% bellucci 2 0.00% vff 2 0.00% salon 2 0.00% nhâ1m 2 0.00% to'5n 2 0.00% qua(1t 2 0.00% 13g30 2 0.00% 18g 2 0.00% ba(4ng 2 0.00% òa

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 150 2 0.00% vi5n 2 0.00% niu 2 0.00% khuy2nh 2 0.00% 7cm 2 0.00% 15dd 2 0.00% 20km 2 0.00% thinh 2 0.00% 626 2 0.00% ounce 2 0.00% ta(2m 2 0.00% rú 2 0.00% mogan 2 0.00% 609 2 0.00% 666 2 0.00% kasim 2 0.00% gâ2u 2 0.00% berger 2 0.00% jogjakarta 2 0.00% khoa3i 2 0.00% xâ3u 2 0.00% 659 2 0.00% bk 2 0.00% 184 2 0.00% (cuô5c 2 0.00% bkaa12345 2 0.00% tndk 2 0.00% ngáy 2 0.00% adelaide 2 0.00% (fbi 2 0.00% technology 2 0.00% jersey 2 0.00% reith 2 0.00% petersen 2 0.00% brosnan 2 0.00% pierce 2 0.00% tennis 2 0.00% lindsey 2 0.00% landing 2 0.00% 4m 2 0.00% perfect 2 0.00% endeavour 2 0.00% tango 2 0.00% building 2 0.00% ho'1p 2 0.00% telemundo 2 0.00% maggie 2 0.00% queensland 2 0.00% amelia 2 0.00% b727 2 0.00% vega 2 0.00% beethoven 2 0.00% yamanaka

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 151 2 0.00% (tác 2 0.00% heineken 2 0.00% vidéo 2 0.00% matech 2 0.00% chi3n 2 0.00% (ljworld 2 0.00% vaa 2 0.00% 1935 2 0.00% hddqt 2 0.00% ô2m 2 0.00% liê1m 2 0.00% uô5t 2 0.00% mách 2 0.00% ddô4i 2 0.00% 18001567 2 0.00% amazon 2 0.00% nép 2 0.00% xo'i 2 0.00% (vai 2 0.00% blues 2 0.00% (iss 2 0.00% adventures 2 0.00% si 2 0.00% ríu 2 0.00% gamble 2 0.00% procter 2 0.00% packard 2 0.00% mi3 2 0.00% 5km 2 0.00% 159 2 0.00% valse 2 0.00% hewlett 2 0.00% stb 2 0.00% herti 2 0.00% èo 2 0.00% tiê2u 2 0.00% mu'o'2ng 2 0.00% whirlpool 2 0.00% (sacombank 2 0.00% 1a 2 0.00% (ngu5 2 0.00% tnt 2 0.00% (lâ1y 2 0.00% 10g 2 0.00% dj 2 0.00% ivanov 2 0.00% (hbsag 2 0.00% electronics 2 0.00% (ty3 2 0.00% nesta 2 0.00% smart 2 0.00% nikolai

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 152 2 0.00% 225 2 0.00% stress 2 0.00% nê5n 2 0.00% fujiwara 2 0.00% (kích 2 0.00% seiko 2 0.00% võng 2 0.00% (jetro 2 0.00% ddúa 2 0.00% xima(ng 2 0.00% nhún 2 0.00% grozny 2 0.00% clorua 2 0.00% (fsb 2 0.00% protein 2 0.00% reyes 2 0.00% lada 2 0.00% (bhyt 2 0.00% cornell 2 0.00% take 2 0.00% off 2 0.00% n17a 2 0.00% (giám 2 0.00% jba 2 0.00% cbt 2 0.00% brindani 2 0.00% 20m 2 0.00% rover 2 0.00% watts 2 0.00% nude 2 0.00% lamborghini 2 0.00% pha3ng 2 0.00% nghêu 2 0.00% vy4 2 0.00% utah 2 0.00% vilnius 2 0.00% philippineses 2 0.00% scid 2 0.00% hyundai 2 0.00% mg 2 0.00% (ddúng 2 0.00% unicef 2 0.00% hanh 2 0.00% (singapore 2 0.00% software 2 0.00% india 2 0.00% criminal 2 0.00% mói 2 0.00% 911 2 0.00% berkeley 2 0.00% jatusripitak 2 0.00% guevara

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 153 2 0.00% (cpc 2 0.00% dzhokhar 2 0.00% dagestan 2 0.00% 200dd 2 0.00% casino 2 0.00% gruzia 2 0.00% chuck 2 0.00% eirik 2 0.00% shawn 2 0.00% ta3o 2 0.00% beslan 2 0.00% hagel 2 0.00% kerry 2 0.00% dudayev 2 0.00% ne5t 2 0.00% bqldaddtltddqt 2 0.00% ngâ3m 2 0.00% pa3 2 0.00% cu'2 2 0.00% to5t 2 0.00% (23dd 2 0.00% axit 2 0.00% roàng 2 0.00% yêng 2 0.00% (sóc 2 0.00% (xuâ1t 2 0.00% tâ3n 2 0.00% chùm 2 0.00% miê2ng 2 0.00% trê5t 2 0.00% glenne 2 0.00% shopping 2 0.00% (170 2 0.00% masafumi 2 0.00% sanai 2 0.00% (grand 2 0.00% katsui 2 0.00% mitsuo 2 0.00% satoh 2 0.00% ùa 2 0.00% daido 2 0.00% thó 2 0.00% camp 2 0.00% taku 2 0.00% mi5ch 2 0.00% moriyama 2 0.00% nhe5p 2 0.00% nhi3nh 2 0.00% tuaàn 2 0.00% jamaica 2 0.00% dupree 2 0.00% (ts

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 154 2 0.00% mo'n 2 0.00% mo'3n 2 0.00% 258 2 0.00% 273 2 0.00% munoa 2 0.00% buckley 2 0.00% (photography 2 0.00% (graphics 2 0.00% xe5p 2 0.00% (47 2 0.00% huntington 2 0.00% (malaysia 2 0.00% zvonareva 2 0.00% security 2 0.00% (al 2 0.00% babylift 2 0.00% phone 2 0.00% cell 2 0.00% nguyen 2 0.00% rebibbia 2 0.00% xoàng 2 0.00% type 2 0.00% tèo 2 0.00% ri3a 2 0.00% ho'1t 2 0.00% (vòng 2 0.00% link 2 0.00% be5n 2 0.00% ……………………… 2 0.00% (ddiê5n 2 0.00% mu'3a 2 0.00% passport 2 0.00% fashion 2 0.00% ………………… 2 0.00% riê2ng 2 0.00% oliveira 2 0.00% so'3i 2 0.00% (tt 2 0.00% mode 2 0.00% danaan 2 0.00% grai 2 0.00% area 2 0.00% khatoco 2 0.00% tun 2 0.00% 11h 2 0.00% kilner 2 0.00% coming 2 0.00% gabriel 2 0.00% khi3nh 2 0.00% yonhap 2 0.00% (1968 2 0.00% ae

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 155 2 0.00% (va(n 2 0.00% kháu 2 0.00% hajduk 2 0.00% lh 2 0.00% mendoza 2 0.00% uk 2 0.00% rlc 2 0.00% split 2 0.00% donadoni 2 0.00% flatley 2 0.00% dominguez 2 0.00% oxford 2 0.00% nhè 2 0.00% …………… 2 0.00% valerie 2 0.00% xê1ch 2 0.00% hssv 2 0.00% 110mmol 2 0.00% (gâ1p 2 0.00% (bn 2 0.00% tuy5 2 0.00% slam 2 0.00% ngoài… 2 0.00% 52cm 2 0.00% (hoàng 2 0.00% u'1 2 0.00% ga5n 2 0.00% (ngo5c 2 0.00% 214 2 0.00% hu'5u 2 0.00% (lê 2 0.00% (phú 2 0.00% tabare 2 0.00% 34cm 2 0.00% ajax 2 0.00% (smddh 2 0.00% sui 2 0.00% lcd 2 0.00% (ks 2 0.00% fremont 2 0.00% nhành 2 0.00% guy 2 0.00% (nô5i 2 0.00% (ta5m 2 0.00% htv9 2 0.00% nicolas 2 0.00% kha3nh 2 0.00% ritchie 2 0.00% phomát 2 0.00% (philippines 2 0.00% vtv3 2 0.00% duarte

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 156 2 0.00% típ 2 0.00% daryl 2 0.00% tho'1 2 0.00% johan 2 0.00% tania 2 0.00% missouri 2 0.00% ailen 2 0.00% quýt 2 0.00% cu'5a 2 0.00% (37 2 0.00% nành 2 0.00% xoa(n 2 0.00% luz 2 0.00% (ký 2 0.00% (trâ2n 2 0.00% marina 2 0.00% school 2 0.00% zuluaga 2 0.00% nga(1c 2 0.00% loew 2 0.00% zuydendorp 2 0.00% phoenix 2 0.00% (tri5 2 0.00% serena 2 0.00% (clb 2 0.00% mâ5n 2 0.00% lddbddvn 2 0.00% tmncsg 2 0.00% sucre 2 0.00% pete 2 0.00% tsymbalyuk 2 0.00% nha3u 2 0.00% trê 2 0.00% potassium 2 0.00% davila 2 0.00% hói 2 0.00% 147 2 0.00% vip 2 0.00% fellowship 2 0.00% (fluid 2 0.00% judah 2 0.00% jinan 2 0.00% nè 2 0.00% oakland 2 0.00% re3ng 2 0.00% litvinova 2 0.00% pomina 2 0.00% 62m 2 0.00% (1994 2 0.00% glasgow 2 0.00% li3nh 2 0.00% mechanics

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 157 2 0.00% xiê1c 2 0.00% 55m 2 0.00% thiu 2 0.00% (lâ2n 2 0.00% (hàn 2 0.00% slaven 2 0.00% prông 2 0.00% oak 2 0.00% gàu 2 0.00% no'm 2 0.00% sudnicka 2 0.00% canadian 2 0.00% pjico 2 0.00% working 2 0.00% uc 2 0.00% srebotnik 2 0.00% starkville 2 0.00% mikado 2 0.00% premiership 2 0.00% auction 2 0.00% assist 2 0.00% charm 2 0.00% 21h 2 0.00% josephine 2 0.00% alhanko 2 0.00% (di 2 0.00% ilearn 2 0.00% bô 2 0.00% vtv4 2 0.00% osathanond 2 0.00% ana 2 0.00% susilo 2 0.00% commerce 2 0.00% morientes 2 0.00% (thô3 2 0.00% pauly 2 0.00% esso 2 0.00% iev 2 0.00% engineering 2 0.00% dda(2m 2 0.00% aurelio 2 0.00% sarayoot 2 0.00% váp 2 0.00% auctionassist 2 0.00% frazier 2 0.00% cyprus 2 0.00% soler 2 0.00% bdd 2 0.00% sona 2 0.00% park 2 0.00% (iaea 2 0.00% 15g

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 158 2 0.00% dhea 2 0.00% nho'2n 2 0.00% francys 2 0.00% 15h 2 0.00% ca5m 2 0.00% (1993 2 0.00% kha3n 2 0.00% 335 2 0.00% 439 2 0.00% aceh 2 0.00% (gia3ng 2 0.00% (kê3 2 0.00% gam 2 0.00% arevalos 2 0.00% (ti3nh 2 0.00% (202 2 0.00% kenai 2 0.00% móm 2 0.00% atlanta 2 0.00% (1988 2 0.00% 000usd 2 0.00% (tính 2 0.00% phôn 2 0.00% fsh 2 0.00% orleans 2 0.00% khâ1p 2 0.00% lourdes 2 0.00% keith 2 0.00% ki4 1 0.00% warfare 1 0.00% webofficenow 1 0.00% generational 1 0.00% micrsoft 1 0.00% (extremist 1 0.00% temyat 1 0.00% secretary 1 0.00% (misdemeanor 1 0.00% b17 1 0.00% back 1 0.00% running 1 0.00% (lu'o'5c 1 0.00% anything 1 0.00% paletta 1 0.00% cnn's 1 0.00% khariri 1 0.00% mcnamara 1 0.00% câ2u… 1 0.00% delong 1 0.00% dô 1 0.00% 300dd 1 0.00% done 1 0.00% mclellan

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 159 1 0.00% come 1 0.00% ideology 1 0.00% franks 1 0.00% noor 1 0.00% morning 1 0.00% soft 1 0.00% tommy 1 0.00% zied 1 0.00% corera 1 0.00% (people's 1 0.00% schriver 1 0.00% aribia 1 0.00% terror 1 0.00% (su'5 1 0.00% hizb 1 0.00% remains 1 0.00% ut 1 0.00% threat 1 0.00% wenyi 1 0.00% caliphate 1 0.00% seatle 1 0.00% elane 1 0.00% tahrir 1 0.00% you're 1 0.00% (triê5u 1 0.00% hi5ên 1 0.00% (wang 1 0.00% b19 1 0.00% sa(3n 1 0.00% (trô2ng 1 0.00% yaser 1 0.00% qeada 1 0.00% fourth 1 0.00% harassing 1 0.00% taiwan 1 0.00% jihads 1 0.00% threatening 1 0.00% intimidating 1 0.00% willfully 1 0.00% bellamy 1 0.00% coercing 1 0.00% dâ5m 1 0.00% gonzalez 1 0.00% republic 1 0.00% kolbe 1 0.00% span 1 0.00% 800dd 1 0.00% sami 1 0.00% (australien 1 0.00% macdill 1 0.00% tru'o'1c… 1 0.00% (centcom

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 160 1 0.00% cô2 1 0.00% official 1 0.00% kranjcar 1 0.00% xi5a 1 0.00% (wb 1 0.00% zlatko 1 0.00% sidney 1 0.00% hoyt 1 0.00% vanderberd 1 0.00% blance 1 0.00% souers 1 0.00% nsa 1 0.00% kept 1 0.00% secrets 1 0.00% clancy 1 0.00% (cu5 1 0.00% oss 1 0.00% 1941 1 0.00% (director 1 0.00% rohland 1 0.00% klaus 1 0.00% stansfield 1 0.00% ngu'4a 1 0.00% schlesinger 1 0.00% colby 1 0.00% aljosa 1 0.00% seagate 1 0.00% (tiê1n 1 0.00% turner 1 0.00% asanovic 1 0.00% testosterone 1 0.00% telegraph 1 0.00% (bay 1 0.00% roscoe 1 0.00% hillenkoetter 1 0.00% pigs 1 0.00% raborn 1 0.00% vienna 1 0.00% (áo 1 0.00% lyndon 1 0.00% buende 1 0.00% (na(1ng 1 0.00% janice 1 0.00% (la5nh 1 0.00% guest 1 0.00% kephart 1 0.00% xoang 1 0.00% 8000 1 0.00% stewart 1 0.00% (gió 1 0.00% 20g 1 0.00% duck

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 161 1 0.00% (cách 1 0.00% test 1 0.00% lame 1 0.00% (rumani 1 0.00% emila 1 0.00% 30g 1 0.00% lu'õng 1 0.00% (vgsv 1 0.00% holsen 1 0.00% (agency 1 0.00% tamiflu 1 0.00% (muô1i 1 0.00% dpb 1 0.00% briefing 1 0.00% grisham 1 0.00% (daily 1 0.00% presidential 1 0.00% ahle 1 0.00% studies 1 0.00% (bush 1 0.00% krikorian 1 0.00% immigration 1 0.00% gu'2 1 0.00% sessions 1 0.00% ra(2n 1 0.00% (nu'o'1c 1 0.00% buender 1 0.00% troáng 1 0.00% (dâ2u 1 0.00% (crude 1 0.00% (refining 1 0.00% mtbe 1 0.00% andorra 1 0.00% nghieäp 1 0.00% maùi 1 0.00% hoaøng 1 0.00% vòt 1 0.00% lockyer 1 0.00% (cúp 1 0.00% (ftc 1 0.00% bosacki 1 0.00% lewandowski 1 0.00% (tsc 1 0.00% zewlakow 1 0.00% toserco 1 0.00% bak 1 0.00% (gdp 1 0.00% wanderers 1 0.00% pickup 1 0.00% truck 1 0.00% loaïi 1 0.00% jaber

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 162 1 0.00% shah 1 0.00% cánh… 1 0.00% teery 1 0.00% cars 1 0.00% naøy 1 0.00% (ñ 1 0.00% coâng 1 0.00% (tuy2 1 0.00% tröôùc 1 0.00% (bù 1 0.00% (hybrid 1 0.00% halliburton 1 0.00% arabi 1 0.00% federal 1 0.00% goal 1 0.00% smolarek 1 0.00% prevention 1 0.00% (90 1 0.00% estimate 1 0.00% nie 1 0.00% tuâ1 1 0.00% macedonia 1 0.00% jelen 1 0.00% terrorism 1 0.00% kohl 1 0.00% livorno 1 0.00% deutch 1 0.00% chê3nh 1 0.00% hitz 1 0.00% apb 1 0.00% (75 1 0.00% (it's 1 0.00% (intelligence 1 0.00% kilduff 1 0.00% fimat 1 0.00% (nhiên 1 0.00% (apb 1 0.00% radomski 1 0.00% brewery 1 0.00% mobil 1 0.00% mercantile 1 0.00% exchange 1 0.00% estonia 1 0.00% szymkowiak 1 0.00% gouging 1 0.00% nou 1 0.00% (price 1 0.00% (strategic 1 0.00% allan 1 0.00% hubbard 1 0.00% petroleum 1 0.00% reserve

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 163 1 0.00% sergio 1 0.00% (abscam 1 0.00% vasco 1 0.00% (1984 1 0.00% pernía 1 0.00% flake 1 0.00% (mã 1 0.00% tunesia 1 0.00% casillas 1 0.00% gama 1 0.00% so'1t 1 0.00% sul 1 0.00% columbus 1 0.00% tcpv 1 0.00% (break 1 0.00% op 1 0.00% senna 1 0.00% corinthians 1 0.00% guy4 1 0.00% (metastasized 1 0.00% (pr 1 0.00% luo'1i 1 0.00% syndrome 1 0.00% infant 1 0.00% death 1 0.00% calcium 1 0.00% larrionda 1 0.00% diet 1 0.00% vmc 1 0.00% 46000 1 0.00% (knowledge 1 0.00% o5c 1 0.00% ulkrine 1 0.00% arabien 1 0.00% medicare 1 0.00% (sudden 1 0.00% (1987 1 0.00% sids 1 0.00% ayef 1 0.00% chu'n 1 0.00% 637 1 0.00% rio 1 0.00% grande 1 0.00% nguâ3y 1 0.00% chô3ng 1 0.00% (non 1 0.00% (fast 1 0.00% breeder 1 0.00% dehli 1 0.00% (55 1 0.00% rê1 1 0.00% michelle

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 164 1 0.00% russol 1 0.00% (chile 1 0.00% joachim 1 0.00% court 1 0.00% haiti 1 0.00% gringo 1 0.00% (mexico 1 0.00% bachelet 1 0.00% (hoà 1 0.00% ernest 1 0.00% hollins 1 0.00% valeo 1 0.00% fabregas 1 0.00% versus 1 0.00% strom 1 0.00% thurmond 1 0.00% (77 1 0.00% (1974 1 0.00% nghê2… 1 0.00% (containment 1 0.00% brewster 1 0.00% rân 1 0.00% reactor 1 0.00% canberra 1 0.00% share 1 0.00% (tcpv 1 0.00% (fair 1 0.00% maurice 1 0.00% stans 1 0.00% mariannas 1 0.00% iwata 1 0.00% skyboxes 1 0.00% unilever 1 0.00% cola 1 0.00% (435 1 0.00% (incumbents 1 0.00% (lobbyists 1 0.00% coca 1 0.00% zaccardo 1 0.00% ngúm 1 0.00% golden 1 0.00% deficit 1 0.00% merrill 1 0.00% (budget 1 0.00% rutgers 1 0.00% dentsu 1 0.00% suót 1 0.00% bates 1 0.00% cherundolo 1 0.00% vinagimex 1 0.00% (naintraco 1 0.00% (blatant

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 165 1 0.00% amoah 1 0.00% buchanan 1 0.00% sibelius 1 0.00% potomac 1 0.00% (staffer 1 0.00% shiled 1 0.00% neil 1 0.00% noam 1 0.00% levey 1 0.00% tgdd 1 0.00% (challengers 1 0.00% (vaa 1 0.00% groups 1 0.00% strussion 1 0.00% thuo'3ng 1 0.00% roche 1 0.00% interests 1 0.00% spending 1 0.00% convey 1 0.00% dempsey 1 0.00% simon 1 0.00% nasdaq 1 0.00% 1929 1 0.00% flamingo 1 0.00% las 1 0.00% reyna 1 0.00% goldsun 1 0.00% roosevelt 1 0.00% domestic 1 0.00% product 1 0.00% (gross 1 0.00% jorge 1 0.00% mandel 1 0.00% donovan 1 0.00% stormeye 1 0.00% confidence 1 0.00% (index 1 0.00% (consumer 1 0.00% (gô5p 1 0.00% arthur 1 0.00% (misery 1 0.00% (jobless 1 0.00% recovery 1 0.00% jubilo 1 0.00% advertising 1 0.00% (opec 1 0.00% okun 1 0.00% onyewu 1 0.00% bocanegra 1 0.00% bell 1 0.00% siegel 1 0.00% vegas

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 166 1 0.00% bugsy 1 0.00% (software 1 0.00% businessweek 1 0.00% landefeld 1 0.00% labs 1 0.00% transistor 1 0.00% example 1 0.00% jaidi 1 0.00% rare 1 0.00% 564 1 0.00% (it 1 0.00% exist 1 0.00% 209 1 0.00% but 1 0.00% trabelsi 1 0.00% boumnijel 1 0.00% 1660 1 0.00% charle 1 0.00% 1852 1 0.00% botany 1 0.00% 1788 1 0.00% denmark 1 0.00% ttptqdd 1 0.00% einstein 1 0.00% nazi 1 0.00% rô1n 1 0.00% immigrant 1 0.00% rusol 1 0.00% (tddc 1 0.00% arlen 1 0.00% specter 1 0.00% cáu 1 0.00% (schweiz 1 0.00% 530 1 0.00% venezelua 1 0.00% xoi 1 0.00% tancredo 1 0.00% welfare 1 0.00% (47' 1 0.00% 041 1 0.00% tunesien 1 0.00% denver 1 0.00% (arizona 1 0.00% angles 1 0.00% luxemburgo 1 0.00% border 1 0.00% (welsh 1 0.00% 1921 1 0.00% sensenbrenner 1 0.00% dokhi 1 0.00% khuân 1 0.00% 1920

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 167 1 0.00% offense 1 0.00% mahony 1 0.00% zaid 1 0.00% (pha5t 1 0.00% (civil 1 0.00% (property 1 0.00% (khách 1 0.00% truân 1 0.00% centcom 1 0.00% teixeira 1 0.00% montashari 1 0.00% fallatah 1 0.00% sulimani 1 0.00% (ghettos 1 0.00% ghamdi 1 0.00% mnari 1 0.00% 538 1 0.00% bouazizi 1 0.00% dornan 1 0.00% villaraigosa 1 0.00% cancun 1 0.00% haggui 1 0.00% jemmali 1 0.00% 568 1 0.00% vincente 1 0.00% vanderlei 1 0.00% (dám 1 0.00% (progressive 1 0.00% chikhaoui 1 0.00% 2014 1 0.00% 469 1 0.00% (ddiê3n 1 0.00% chedli 1 0.00% (amnesty 1 0.00% ba(1tddâ2u 1 0.00% jeddah 1 0.00% xa3n 1 0.00% abullah 1 0.00% emaar 1 0.00% khuê1ch 1 0.00% timoschuk 1 0.00% 364 1 0.00% gussew 1 0.00% (iata 1 0.00% bechtel 1 0.00% xe3n 1 0.00% rabigh 1 0.00% cayman 1 0.00% (dda(5t 1 0.00% rotan 1 0.00% khuyê1ch 1 0.00% schelajew

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 168 1 0.00% broadband 1 0.00% aramco 1 0.00% bourband 1 0.00% saud 1 0.00% faisal 1 0.00% khameini 1 0.00% (incompetent 1 0.00% perionyx 1 0.00% ddô1p 1 0.00% (uae 1 0.00% sx 1 0.00% (accusations 1 0.00% abul 1 0.00% worobej 1 0.00% petrodollar 1 0.00% airways 1 0.00% fibrin 1 0.00% a380 1 0.00% pew 1 0.00% excavatus 1 0.00% ipsos 1 0.00% gussin 1 0.00% nesmatschni 1 0.00% gtvt 1 0.00% airport 1 0.00% feet 1 0.00% consultants 1 0.00% burj 1 0.00% woronin 1 0.00% essex 1 0.00% hemsley 1 0.00% jumeirah 1 0.00% rebrow 1 0.00% csx 1 0.00% (ttptqdd 1 0.00% phâ4m 1 0.00% (stagnation 1 0.00% (inflation 1 0.00% busacca 1 0.00% oriental 1 0.00% naco 1 0.00% peninsular 1 0.00% autuori 1 0.00% (dpw 1 0.00% holding 1 0.00% wind 1 0.00% orascom 1 0.00% hotels 1 0.00% resorts 1 0.00% da3 1 0.00% pfc 1 0.00% financial

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 169 1 0.00% brad 1 0.00% (ca3ng 1 0.00% madame 1 0.00% tussauds 1 0.00% netherlands 1 0.00% sir 1 0.00% drake 1 0.00% fairmont 1 0.00% (64 1 0.00% talal 1 0.00% daimlerchrysler 1 0.00% alwaleed 1 0.00% prasad 1 0.00% routray 1 0.00% bibhu 1 0.00% 'chúng 1 0.00% ngu'o'i' 1 0.00% malegaon 1 0.00% maharashtran 1 0.00% aurangabad 1 0.00% maharashtra 1 0.00% mickey 1 0.00% varanasi 1 0.00% bobadila 1 0.00% nâ4ng 1 0.00% islamic 1 0.00% movemement 1 0.00% ajay 1 0.00% sahni 1 0.00% hyderabad 1 0.00% bangalore 1 0.00% aldo 1 0.00% fitzpatrick 1 0.00% taepodong 1 0.00% hawai 1 0.00% churchgate 1 0.00% teapodong 1 0.00% ttvnol 1 0.00% muabanraovat 1 0.00% zi 1 0.00% gìu'4 1 0.00% forum 1 0.00% mahal 1 0.00% bharatiya 1 0.00% taj 1 0.00% garfield 1 0.00% (taliban 1 0.00% umar 1 0.00% phím 1 0.00% gujarat 1 0.00% janata 1 0.00% (bjp

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 170 1 0.00% (let 1 0.00% commonwealth 1 0.00% mât 1 0.00% diê3n 1 0.00% penang 1 0.00% lúc15h00 1 0.00% u'u'1u 1 0.00% bavìere 1 0.00% 15000 1 0.00% dotrmund 1 0.00% (qtkd 1 0.00% (34t 1 0.00% triê4n 1 0.00% freier 1 0.00% fabian 1 0.00% 101 1 0.00% (feer 1 0.00% (tncn 1 0.00% hwang 1 0.00% bi4nh 1 0.00% balbina 1 0.00% gerard 1 0.00% ddô1ì 1 0.00% swen 1 0.00% konstantin 1 0.00% goeren 1 0.00% srinagar 1 0.00% malaysia(a3nh 1 0.00% perry 1 0.00% tóp 1 0.00% alex 1 0.00% baseyev 1 0.00% (co'4 1 0.00% chenchnya 1 0.00% huo'1ng 1 0.00% grad 1 0.00% bekham 1 0.00% tf1 1 0.00% saniora 1 0.00% (phong 1 0.00% fuad 1 0.00% vinhempich 1 0.00% scotland 1 0.00% wilkinson 1 0.00% ill 1 0.00% bonet 1 0.00% (189 1 0.00% cáceres 1 0.00% tagesspiegel 1 0.00% toledo 1 0.00% uhrlau 1 0.00% mbeki

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 171 1 0.00% cuevas 1 0.00% thabo 1 0.00% bhumibol 1 0.00% acudda 1 0.00% whitlock 1 0.00% lybia 1 0.00% (sepa 1 0.00% sadec 1 0.00% congo 1 0.00% nawaf 1 0.00% assessment 1 0.00% ngùn 1 0.00% mustafa 1 0.00% alani 1 0.00% diyala 1 0.00% qu3y 1 0.00% pound 1 0.00% (li 1 0.00% ngòm 1 0.00% caniza 1 0.00% 976 1 0.00% sunna 1 0.00% guido 1 0.00% (xinhuanet 1 0.00% lat 1 0.00% ngu5t 1 0.00% shiites 1 0.00% arav 1 0.00% bobadilla 1 0.00% sinopec 1 0.00% paramay 1 0.00% d70s 1 0.00% (bo5n 1 0.00% olympus 1 0.00% justo 1 0.00% lukin 1 0.00% nikon 1 0.00% hessen 1 0.00% koch 1 0.00% roland 1 0.00% 28t 1 0.00% caucase 1 0.00% shura 1 0.00% yokosuka 1 0.00% mujahideen 1 0.00% 699 1 0.00% e300 1 0.00% toitim 1 0.00% serguei 1 0.00% lavrov 1 0.00% venezeula 1 0.00% uo'1c

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 172 1 0.00% riveros 1 0.00% litani 1 0.00% karzai 1 0.00% adulyadej 1 0.00% dias 1 0.00% euardo 1 0.00% vejjajiva 1 0.00% 760 1 0.00% 48000 1 0.00% franfurt 1 0.00% eltsin 1 0.00% phê1t 1 0.00% vogvn 1 0.00% (mexiko 1 0.00% novosti 1 0.00% kazhakstan 1 0.00% 7m 1 0.00% (sco 1 0.00% owomoyela 1 0.00% ottmar 1 0.00% showmaster 1 0.00% (1978 1 0.00% bern 1 0.00% eckel 1 0.00% allback 1 0.00% sa5ng 1 0.00% stoiber 1 0.00% gottschalk 1 0.00% edmund 1 0.00% claudia 1 0.00% schiffer 1 0.00% main 1 0.00% (bâ1m 1 0.00% (1930 1 0.00% urus 1 0.00% haessler 1 0.00% huflit 1 0.00% pele 1 0.00% alcides 1 0.00% maidin 1 0.00% athen 1 0.00% aek 1 0.00% mành 1 0.00% saprissa 1 0.00% paolo 1 0.00% 29t 1 0.00% calcio 1 0.00% shamsul 1 0.00% brescia 1 0.00% und 1 0.00% guaatemala 1 0.00% mexiko

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 173 1 0.00% tru'1 1 0.00% ddâ5t 1 0.00% glenn 1 0.00% (deportivo 1 0.00% hislop 1 0.00% riaca 1 0.00% coata 1 0.00% (blatter 1 0.00% (cvtv 1 0.00% su'o'5t 1 0.00% mateus 1 0.00% mendez 1 0.00% nuremberg 1 0.00% (vui 1 0.00% xu5 1 0.00% (ddr 1 0.00% micro 1 0.00% goerlitz 1 0.00% perez 1 0.00% gerardo 1 0.00% (trái 1 0.00% daei 1 0.00% mình… 1 0.00% yahya 1 0.00% mario 1 0.00% nhu'3 1 0.00% torrado 1 0.00% volpe 1 0.00% mohammad 1 0.00% slowakei 1 0.00% marwijk 1 0.00% bert 1 0.00% (ffi 1 0.00% hernan 1 0.00% (cdu 1 0.00% ici 1 0.00% 30m 1 0.00% mez 1 0.00% cathay 1 0.00% bsg 1 0.00% botshabelo 1 0.00% chemnitz 1 0.00% jared 1 0.00% freiburg 1 0.00% bortmund 1 0.00% fulda 1 0.00% aliabadi 1 0.00% motro 1 0.00% timo 1 0.00% glad 1 0.00% hildebrand 1 0.00% trít

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 174 1 0.00% meet 1 0.00% bsc 1 0.00% tu3m 1 0.00% hertha 1 0.00% (i'm 1 0.00% ti3m 1 0.00% víctor 1 0.00% ivoira 1 0.00% tu'5c 1 0.00% boladdos 1 0.00% hernández 1 0.00% novotny 1 0.00% su5ng 1 0.00% ddu'o'5cvào 1 0.00% saborío 1 0.00% konnichiwa 1 0.00% ku'ln 1 0.00% jrgen 1 0.00% akale 1 0.00% trâ3y 1 0.00% dindane 1 0.00% aimar 1 0.00% mainz 1 0.00% fsv 1 0.00% keita 1 0.00% enke 1 0.00% wolfsburg 1 0.00% (ddô1i 1 0.00% xii 1 0.00% how 1 0.00% vfl 1 0.00% bleeckere 1 0.00% (03 1 0.00% hello 1 0.00% asamoah 1 0.00% (belgien 1 0.00% katei 1 0.00% mas 1 0.00% ua 1 0.00% vali 1 0.00% ya 1 0.00% alfaro 1 0.00% italien 1 0.00% jervis 1 0.00% tenorio 1 0.00% (brescia 1 0.00% agustin 1 0.00% hon 1 0.00% ô3ng 1 0.00% (kcx 1 0.00% malaga 1 0.00% avery

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 175 1 0.00% cu5i 1 0.00% wardy 1 0.00% ca(5m 1 0.00% eum 1 0.00% hagreves 1 0.00% lake 1 0.00% ca1c 1 0.00% chuy5ê5n 1 0.00% (real 1 0.00% boyfriend 1 0.00% (comunicaciones 1 0.00% cristian 1 0.00% azofeifa 1 0.00% (brujas 1 0.00% randall 1 0.00% mém 1 0.00% fring 1 0.00% shì 1 0.00% pressing 1 0.00% drummond 1 0.00% umadda 1 0.00% badilla 1 0.00% rodríguez 1 0.00% worls 1 0.00% harold 1 0.00% granite 1 0.00% eilliam 1 0.00% 'lên 1 0.00% lùc 1 0.00% c7 1 0.00% 429 1 0.00% (cctv 1 0.00% cao' 1 0.00% 'tha3o 1 0.00% gân' 1 0.00% wìliam 1 0.00% diaa 1 0.00% rashwan 1 0.00% tuy3 1 0.00% (cung 1 0.00% (hùynh 1 0.00% c3 1 0.00% giôn 1 0.00% kampoo 1 0.00% ahram 1 0.00% panna 1 0.00% ice 1 0.00% mri 1 0.00% age 1 0.00% nha(5ng 1 0.00% tu'21 1 0.00% tashkent

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 176 1 0.00% nâ1t 1 0.00% fadli 1 0.00% chòm 1 0.00% xu'o'5c 1 0.00% thuo'2ng 1 0.00% rê5u 1 0.00% (nhiê5m 1 0.00% 'ddô1i 1 0.00% lu'o'5c' 1 0.00% (co'2 1 0.00% gaojun 1 0.00% dòi 1 0.00% ru'3i 1 0.00% ngu5a 1 0.00% (tvdddd 1 0.00% mount 1 0.00% ddr 1 0.00% temple 1 0.00% sabra 1 0.00% chatila 1 0.00% qasa 1 0.00% (ttytdp 1 0.00% carthage 1 0.00% monetegro 1 0.00% francileudo 1 0.00% healthcare 1 0.00% attp 1 0.00% yitzhak 1 0.00% ec 1 0.00% excimer 1 0.00% 771 1 0.00% phalangist 1 0.00% menachem 1 0.00% 551 1 0.00% bashar 1 0.00% (tran 1 0.00% câ4ng 1 0.00% (ttl 1 0.00% thuyê1n 1 0.00% massege 1 0.00% co'… 1 0.00% mustapa 1 0.00% humberto 1 0.00% 8mm 1 0.00% nhâ1t… 1 0.00% samaria 1 0.00% fatehah 1 0.00% judea 1 0.00% rangers 1 0.00% hamed 1 0.00% amir 1 0.00% peretz

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 177 1 0.00% evangelicals 1 0.00% hammouda 1 0.00% rosenlund 1 0.00% 473 1 0.00% 496 1 0.00% 061 1 0.00% tê1' 1 0.00% servette 1 0.00% teneriffa 1 0.00% hòi 1 0.00% tró 1 0.00% 25b 1 0.00% xê3nh 1 0.00% (tô1n 1 0.00% moenchengladbach 1 0.00% tro'i 1 0.00% nhít 1 0.00% florea 1 0.00% genf 1 0.00% 'xiê1t 1 0.00% ma5nh' 1 0.00% locarno 1 0.00% co'4i 1 0.00% lô5n' 1 0.00% rostock 1 0.00% 'vàng 1 0.00% câ2u' 1 0.00% nghén 1 0.00% hbv 1 0.00% (ddi5nh 1 0.00% guard 1 0.00% hansa 1 0.00% cho3m 1 0.00% 'ddôi 1 0.00% lo'5i' 1 0.00% my4' 1 0.00% tô4ng 1 0.00% 'kính 1 0.00% 'khuynh 1 0.00% loát' 1 0.00% tri5' 1 0.00% (bia 1 0.00% 'mo'3 1 0.00% râ1p 1 0.00% 9h 1 0.00% clo 1 0.00% ddu'5oc 1 0.00% (times 1 0.00% luegde 1 0.00% 030 1 0.00% lý' 1 0.00% tình'

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 178 1 0.00% bon 1 0.00% 'biê3u 1 0.00% nha5o 1 0.00% (cá 1 0.00% bergen 1 0.00% câ5p… 1 0.00% musadshon 1 0.00% (cbt 1 0.00% tanimex 1 0.00% nhê1ch 1 0.00% (petro 1 0.00% phâ3u 1 0.00% (appeals 1 0.00% ddiê4n 1 0.00% madalina 1 0.00% assogbavi 1 0.00% oana 1 0.00% (nutraingredients 1 0.00% 7kg 1 0.00% geffen 1 0.00% gioan 1 0.00% phaolô 1 0.00% (thx 1 0.00% (14 1 0.00% ddai' 1 0.00% wangen 1 0.00% 'yêu 1 0.00% benenson 1 0.00% site 1 0.00% (aids 1 0.00% xu'ô1ng 1 0.00% chu5m 1 0.00% (stroke 1 0.00% (wfp 1 0.00% becora 1 0.00% wfp 1 0.00% steyr 1 0.00% glock 1 0.00% gusmão 1 0.00% sukehiro 1 0.00% tolu 1 0.00% fatuahi 1 0.00% tasi 1 0.00% rory 1 0.00% callinan 1 0.00% kazkhstan 1 0.00% sahara 1 0.00% 402 1 0.00% krông 1 0.00% magnum 1 0.00% 577 1 0.00% (ttn

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 179 1 0.00% abile 1 0.00% siem 1 0.00% (svtn 1 0.00% (pho3ng 1 0.00% rogério 1 0.00% reap 1 0.00% xo'1i 1 0.00% kuan 1 0.00% qua… 1 0.00% (pap 1 0.00% capumchia 1 0.00% ru'5a 1 0.00% sites 1 0.00% vision 1 0.00% hasegawa 1 0.00% 1300 1 0.00% maubisse 1 0.00% dare 1 0.00% sihanoukville 1 0.00% ea 1 0.00% kar 1 0.00% uganda 1 0.00% allafrica 1 0.00% lâ2m'' 1 0.00% balan 1 0.00% zarqa 1 0.00% smh 1 0.00% (ohio 1 0.00% ''mô1i 1 0.00% du'1t'' 1 0.00% hu'4a 1 0.00% ''cuô5c 1 0.00% (56 1 0.00% ''ddu'2ng 1 0.00% (145 1 0.00% rohan 1 0.00% gunaratna 1 0.00% kalaylah 1 0.00% (4a 1 0.00% amman 1 0.00% zarqawi'' 1 0.00% zarqo 1 0.00% tôr 1 0.00% ayatollah 1 0.00% (sa3n 1 0.00% schweigsteiger 1 0.00% sirry 1 0.00% châ4m 1 0.00% chãi 1 0.00% (ám 1 0.00% kenya 1 0.00% eritrea

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 180 1 0.00% khalilzad 1 0.00% nuri 1 0.00% zalmay 1 0.00% cao'' 1 0.00% laden'' 1 0.00% khalayleh 1 0.00% qudama 1 0.00% sayel 1 0.00% ma5cworld 1 0.00% maliki 1 0.00% stern 1 0.00% alterman 1 0.00% belgrad 1 0.00% (fta 1 0.00% (lat 1 0.00% (truongbangxdag 1 0.00% milesovic 1 0.00% (243a 1 0.00% rotern 1 0.00% cô5p 1 0.00% (32 1 0.00% (giâ1u 1 0.00% takesada 1 0.00% beck 1 0.00% hideshi 1 0.00% shigeru 1 0.00% (hddba 1 0.00% osasuna 1 0.00% masuda 1 0.00% pamplona 1 0.00% ô1p 1 0.00% predrag 1 0.00% dda( 1 0.00% crane 1 0.00% eximer 1 0.00% burghardt 1 0.00% (t 1 0.00% cxiii 1 0.00% dorothy 1 0.00% dworskin 1 0.00% (phân 1 0.00% khuây 1 0.00% smecta 1 0.00% neopeptine 1 0.00% antibio 1 0.00% sâ5m 1 0.00% connecticut 1 0.00% (43 1 0.00% bu5 1 0.00% vlxd 1 0.00% (mosnews 1 0.00% trum

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 181 1 0.00% samydorai 1 0.00% sinapan 1 0.00% chee 1 0.00% 16h 1 0.00% see 1 0.00% frederick 1 0.00% chiam 1 0.00% (hàng 1 0.00% think 1 0.00% hsien 1 0.00% loong 1 0.00% lém 1 0.00% yew 1 0.00% nhê4 1 0.00% báng 1 0.00% vê2… 1 0.00% chok 1 0.00% aljunied 1 0.00% goh 1 0.00% asahi 1 0.00% nishi 1 0.00% skijder 1 0.00% kuijt 1 0.00% yonsei 1 0.00% snijder 1 0.00% hye 1 0.00% lanzaat 1 0.00% nihon 1 0.00% akihiko 1 0.00% falklands 1 0.00% spratlys 1 0.00% trinida 1 0.00% hectare 1 0.00% (cu'5u 1 0.00% 4000 1 0.00% jaebum 1 0.00% 15min 1 0.00% 1900 1 0.00% heitiga 1 0.00% trillion 1 0.00% (purchasing 1 0.00% nehru 1 0.00% mohandas 1 0.00% jawaharlal 1 0.00% parity 1 0.00% urdu 1 0.00% gujarati 1 0.00% marathi 1 0.00% gengali 1 0.00% telugu 1 0.00% dda5o5 1 0.00% gideon

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 182 1 0.00% (axis 1 0.00% xuâ1i 1 0.00% eagleburger 1 0.00% (utopia 1 0.00% evil 1 0.00% (born 1 0.00% again 1 0.00% goldberg 1 0.00% (pyongyang 1 0.00% tru'o'1ùc 1 0.00% kannada 1 0.00% “dreamliner” 1 0.00% b6 1 0.00% 700km 1 0.00% safta 1 0.00% ny 1 0.00% p3 1 0.00% (gtcc 1 0.00% gio'2… 1 0.00% báy 1 0.00% c17 1 0.00% gtcc 1 0.00% shankar 1 0.00% sindhi 1 0.00% comman 1 0.00% assamese 1 0.00% oriya 1 0.00% punjabi 1 0.00% (members 1 0.00% natural 1 0.00% mani 1 0.00% (liquefied 1 0.00% ttiê5u 1 0.00% lng 1 0.00% 1648 1 0.00% ziad 1 0.00% feith 1 0.00% khouri 1 0.00% bulliet 1 0.00% rami 1 0.00% (review 1 0.00% hughes 1 0.00% wilkerson 1 0.00% karen 1 0.00% if 1 0.00% 2200 1 0.00% (vhtt 1 0.00% jaafari 1 0.00% wildlife 1 0.00% ibrahim 1 0.00% ddóng' 1 0.00% khalizad

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 183 1 0.00% hakim 1 0.00% cpa 1 0.00% mowaffaq 1 0.00% provisional 1 0.00% leavitt 1 0.00% hirsh 1 0.00% ddô3ng 1 0.00% gorbachev 1 0.00% 13h 1 0.00% sharansky 1 0.00% soljenitsyn 1 0.00% natan 1 0.00% (bhxh 1 0.00% cu5ng 1 0.00% wesphalia 1 0.00% bi4 1 0.00% deng 1 0.00% xiaoping 1 0.00% sakharo 1 0.00% woodrow 1 0.00% 1917 1 0.00% 1821 1 0.00% quincy 1 0.00% adams 1 0.00% (america 1 0.00% empire 1 0.00% (détente 1 0.00% (evil 1 0.00% su'3u3 1 0.00% gio'1ùi 1 0.00% ácch 1 0.00% visa… 1 0.00% cao… 1 0.00% price 1 0.00% nathan 1 0.00% (vnvnonn 1 0.00% controls 1 0.00% 39000 1 0.00% control 1 0.00% crain 1 0.00% ngu’ 1 0.00% gan” 1 0.00% 10h45 1 0.00% 6h45 1 0.00% broadway 1 0.00% du’o’4ng” 1 0.00% baseball 1 0.00% trê5ch 1 0.00% collins 1 0.00% “to 1 0.00% nho’3 1 0.00% mcneff

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 184 1 0.00% register 1 0.00% caro 1 0.00% “cha(1c 1 0.00% ho5” 1 0.00% ldm 1 0.00% chrysler 1 0.00% romulus 1 0.00% (dinh 1 0.00% “thông 1 0.00% (norad 1 0.00% geneve 1 0.00% “phía 1 0.00% company 1 0.00% ag 1 0.00% tulane 1 0.00% kilowatt 1 0.00% thousand 1 0.00% detroit 1 0.00% oaks 1 0.00% newcomb 1 0.00% unger 1 0.00% sue 1 0.00% automotive 1 0.00% dearborn 1 0.00% fame 1 0.00% “bô2i 1 0.00% a321 1 0.00% 100m 1 0.00% a320 1 0.00% (hkd 1 0.00% 767 1 0.00% (kéo 1 0.00% weekly 1 0.00% matxco’va 1 0.00% (htv 1 0.00% (nl 1 0.00% 150m 1 0.00% (so'2 1 0.00% tra(5c 1 0.00% ddã… 1 0.00% dwt 1 0.00% 334 1 0.00% 000dwt 1 0.00% (ktm 1 0.00% ru’o’3i 1 0.00% (eximbank 1 0.00% (bank 1 0.00% ddang… 1 0.00% (hsbc 1 0.00% northwest 1 0.00% ðô4 1 0.00% vieteuro

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 185 1 0.00% (horea 1 0.00% sachen 1 0.00% 042 1 0.00% (bavik 1 0.00% 3x4 1 0.00% vô2n 1 0.00% vodka 1 0.00% ogoniok 1 0.00% 19h 1 0.00% anhalt 1 0.00% elliot 1 0.00% mukilteo 1 0.00% info 1 0.00% bedart 1 0.00% 1700 1 0.00% pak 1 0.00% tuô2n 1 0.00% 4595 1 0.00% engle 1 0.00% hyon 1 0.00% mercer 1 0.00% ngu'o'2i' 1 0.00% 219 1 0.00% gm 1 0.00% pity 1 0.00% khruschev 1 0.00% niall 1 0.00% (neo 1 0.00% (vu4 1 0.00% biden 1 0.00% destruction 1 0.00% conservative 1 0.00% (weapons 1 0.00% zbigniew 1 0.00% strike 1 0.00% (bully 1 0.00% emptive 1 0.00% tê4nh 1 0.00% (pre 1 0.00% (wild 1 0.00% tha3nh 1 0.00% tho'i 1 0.00% hopkins 1 0.00% speculation 1 0.00% johns 1 0.00% baron 1 0.00% zdebko 1 0.00% tomanovic 1 0.00% tuýp 1 0.00% nothing 1 0.00% gaza's 1 0.00% gia3o

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 186 1 0.00% serbian 1 0.00% balkans 1 0.00% pha5n 1 0.00% (1991 1 0.00% (1992 1 0.00% paletine 1 0.00% balance 1 0.00% (law 1 0.00% (check 1 0.00% montesquieu 1 0.00% (separation 1 0.00% order 1 0.00% difference 1 0.00% (na(ng 1 0.00% xay 1 0.00% 1863 1 0.00% (difference 1 0.00% rodhan 1 0.00% da5m 1 0.00% moriarty 1 0.00% thinking 1 0.00% nahavandian 1 0.00% (wishful 1 0.00% kantipur 1 0.00% (xuyên 1 0.00% dzo5t 1 0.00% u'3ng 1 0.00% kalandi 1 0.00% minendra 1 0.00% knesset 1 0.00% (defense 1 0.00% board 1 0.00% ddiê3m' 1 0.00% (moscow 1 0.00% 'du'1t 1 0.00% cambone 1 0.00% meir 1 0.00% dagan 1 0.00% galluci 1 0.00% azeri 1 0.00% baluchi 1 0.00% forein 1 0.00% hexafluoride 1 0.00% (cascade 1 0.00% (isotope 1 0.00% 12000 1 0.00% sái 1 0.00% enriched 1 0.00% saeedi 1 0.00% khalid 1 0.00% ve5m 1 0.00% (chiê1c

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 187 1 0.00% (high 1 0.00% ba3i 1 0.00% dác 1 0.00% dáo 1 0.00% (la5i 1 0.00% fool 1 0.00% twice 1 0.00% toàu5 1 0.00% ddi4nh 1 0.00% scud 1 0.00% patriot 1 0.00% mé 1 0.00% shabbaz 1 0.00% gulkin 1 0.00% siam 1 0.00% (asean 1 0.00% sihasak 1 0.00% phuangketkaew 1 0.00% methamphetamine 1 0.00% zaw 1 0.00% morris 1 0.00% kyaw 1 0.00% muse 1 0.00% ra5 1 0.00% giãy 1 0.00% daewoo 1 0.00% ongc 1 0.00% (total 1 0.00% burma 1 0.00% schiver 1 0.00% videsh 1 0.00% hideaki 1 0.00% mizukoshi 1 0.00% symon 1 0.00% offshore 1 0.00% cnooc 1 0.00% xô3 1 0.00% (state 1 0.00% (câu 1 0.00% (cu'3 1 0.00% ablonczy 1 0.00% ròi 1 0.00% ddoá 1 0.00% birzeit 1 0.00% risk 1 0.00% aqtash 1 0.00% (muslim 1 0.00% nashat 1 0.00% diane 1 0.00% koji 1 0.00% triumph 1 0.00% ddoái

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 188 1 0.00% kandawgyi 1 0.00% bosf 1 0.00% (bosf 1 0.00% lem 1 0.00% nhem 1 0.00% earthrights 1 0.00% katie 1 0.00% redford 1 0.00% pyinmana 1 0.00% cóng 1 0.00% naftogaz 1 0.00% resort 1 0.00% trie5âu 1 0.00% vu'5o't 1 0.00% trám 1 0.00% catalogue 1 0.00% ekho 1 0.00% valentine 1 0.00% (nato 1 0.00% armenia 1 0.00% essar 1 0.00% honecker 1 0.00% zhikov 1 0.00% rakowski 1 0.00% vsattp 1 0.00% (vsattp 1 0.00% (bulgary 1 0.00% rích 1 0.00% 603 1 0.00% vãng 1 0.00% tuo'3ng 1 0.00% tht 1 0.00% moskvy 1 0.00% havel 1 0.00% sarandon 1 0.00% vaclav 1 0.00% chuô1t 1 0.00% tutu 1 0.00% nengzheng 1 0.00% khin 1 0.00% (phiên 1 0.00% templer 1 0.00% ngô5p 1 0.00% 685 1 0.00% dô5 1 0.00% baltic 1 0.00% lavia 1 0.00% xiét 1 0.00% (kremlin 1 0.00% moldova 1 0.00% jerzy 1 0.00% gasprom

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 189 1 0.00% rosnef 1 0.00% click 1 0.00% marek 1 0.00% lóm 1 0.00% numo 1 0.00% gomes 1 0.00% (ubtvqh 1 0.00% tvqh 1 0.00% bastain 1 0.00% ðào 1 0.00% alessandro 1 0.00% no’5 1 0.00% dormund 1 0.00% quô1ctê1 1 0.00% gottlieb 1 0.00% pathez 1 0.00% zindane 1 0.00% “thi 1 0.00% hô5t 1 0.00% “zidane 1 0.00% “chính 1 0.00% tro’1 1 0.00% “nín 1 0.00% tho’3” 1 0.00% vo’2 1 0.00% co’m 1 0.00% guide 1 0.00% stade 1 0.00% cupper 1 0.00% “câ5u 1 0.00% hector 1 0.00% lionel 1 0.00% bianchi 1 0.00% “la(1m 1 0.00% “hoàng 1 0.00% ddê1” 1 0.00% oán” 1 0.00% tâ5t” 1 0.00% “ân 1 0.00% (jean 1 0.00% luzon 1 0.00% messenger 1 0.00% sampa 1 0.00% “chú 1 0.00% gôloa” 1 0.00% taylor 1 0.00% buenos 1 0.00% aires 1 0.00% “ngay 1 0.00% “khá 1 0.00% julio 1 0.00% xoàn

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 190 1 0.00% stefano 1 0.00% ki3nh 1 0.00% damian 1 0.00% thoa(n 1 0.00% luxembourg 1 0.00% kloden 1 0.00% luciano 1 0.00% moggi 1 0.00% tu’o’1c 1 0.00% kha5ng 1 0.00% khê5nh 1 0.00% hðba 1 0.00% nazareth 1 0.00% róckét 1 0.00% afula 1 0.00% 25000 1 0.00% tripoli 1 0.00% ddô1m 1 0.00% helmang 1 0.00% tra3o 1 0.00% sangin 1 0.00% carbonic 1 0.00% 6000 1 0.00% carraro 1 0.00% circus 1 0.00% maximus 1 0.00% sa(1c” 1 0.00% na(2n 1 0.00% “thành 1 0.00% co’2 1 0.00% lu’o’4ng 1 0.00% lousis 1 0.00% trâ5n” 1 0.00% bryant 1 0.00% “thâ1t 1 0.00% nì 1 0.00% cuaro’ 1 0.00% menchov 1 0.00% dozen 1 0.00% ru’o’4i 1 0.00% (vùng 1 0.00% (nông 1 0.00% farm 1 0.00% ðu’1c 1 0.00% juergen 1 0.00% béziers 1 0.00% montélimar 1 0.00% (bvd 1 0.00% 00961 1 0.00% sheila 1 0.00% hu’3u 1 0.00% aâu

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 191 1 0.00% chili 1 0.00% bo5ng 1 0.00% xu’o’ng 1 0.00% fudan 1 0.00% mali 1 0.00% tho’ 1 0.00% khu’3 1 0.00% monrovia 1 0.00% (bùi 1 0.00% khoa3ng150 1 0.00% do’2i 1 0.00% no’3 1 0.00% cho’1 1 0.00% 228 1 0.00% (tn 1 0.00% 15h20 1 0.00% thút 1 0.00% webcam 1 0.00% ddiê3u 1 0.00% mcalary 1 0.00% heather 1 0.00% phu’1 1 0.00% ningxia 1 0.00% zhongwei 1 0.00% 01m 1 0.00% ðiê2u 1 0.00% medic 1 0.00% latvia 1 0.00% suyê4n 1 0.00% cu’o’4ng 1 0.00% trans 1 0.00% pearson 1 0.00% tufts 1 0.00% “nguyên 1 0.00% jakarka 1 0.00% do’i 1 0.00% khem 1 0.00% transfat 1 0.00% lu’o’2ng 1 0.00% calori 1 0.00% omega 1 0.00% thu’o’1c 1 0.00% thêu 1 0.00% nhirn 1 0.00% pclb 1 0.00% jingpping 1 0.00% tru’4 1 0.00% zheng 1 0.00% wto–liên 1 0.00% ddi5nh” 1 0.00% 57250099 1 0.00% “danh

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 192 1 0.00% 070 1 0.00% achin 1 0.00% xu’o’3ng 1 0.00% crepso 1 0.00% (khóa 1 0.00% “nêm” 1 0.00% “ba3n 1 0.00% doanh” 1 0.00% “ddâ1u 1 0.00% masahiro 1 0.00% soo 1 0.00% schearf 1 0.00% ddôi” 1 0.00% frnakfurt 1 0.00% 633 1 0.00% lo’ 1 0.00% lu’3ng 1 0.00% snooker 1 0.00% hydrazine 1 0.00% kiê4m 1 0.00% fossumvà 1 0.00% discovry 1 0.00% hydro 1 0.00% 9h30 1 0.00% billiards 1 0.00% cghu’4a 1 0.00% reiter 1 0.00% khu’5ng 1 0.00% 0000943045 1 0.00% goldman 1 0.00% 57250095 1 0.00% roberts 1 0.00% (vama 1 0.00% camry 1 0.00% 0000943048 1 0.00% toyota 1 0.00% 652 1 0.00% 50000 1 0.00% berirut 1 0.00% phu’o’2ng 1 0.00% lu’o’4i 1 0.00% cu’1ng 1 0.00% (if 1 0.00% mssu 1 0.00% ‘thôi 1 0.00% 22000 1 0.00% “tô3ng 1 0.00% 8500 1 0.00% chu’1c’ 1 0.00% sim 1 0.00% tutor 1 0.00% you'll

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 193 1 0.00% “game” 1 0.00% mô1c”cu3a 1 0.00% 927 1 0.00% “dâ1u 1 0.00% ‘game’ 1 0.00% “kiê1m 1 0.00% thu3” 1 0.00% “swordsman” 1 0.00% bryan 1 0.00% baby 1 0.00% gì” 1 0.00% blogweb 1 0.00% reply 1 0.00% kha3ng 1 0.00% xén” 1 0.00% verison 1 0.00% ranchomirage 1 0.00% mu’a 1 0.00% mccormack 1 0.00% philipin 1 0.00% falls 1 0.00% khê3nh 1 0.00% “biê1u 1 0.00% fibre2fashion 1 0.00% nhu’5a 1 0.00% to’2 1 0.00% lu’o’5t 1 0.00% mu’o’1n 1 0.00% “ban 1 0.00% “có 1 0.00% ddiê3m” 1 0.00% án” 1 0.00% nhu4ng” 1 0.00% “cha5y 1 0.00% 328 1 0.00% u’u 1 0.00% karate 1 0.00% tantilo 1 0.00% (vô 1 0.00% olympics 1 0.00% menu 1 0.00% chu’1 1 0.00% (karate 1 0.00% “trong 1 0.00% wushu… 1 0.00% judo 1 0.00% tate 1 0.00% dda(1n” 1 0.00% dak 1 0.00% “lu’5a 1 0.00% scandasia 1 0.00% (lcci

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 194 1 0.00% lak 1 0.00% boxing 1 0.00% deborah 1 0.00% zimbabue 1 0.00% nu’o’ng 1 0.00% vanuatu 1 0.00% v… 1 0.00% democrat 1 0.00% gazette 1 0.00% “bu’o’1c 1 0.00% doang 1 0.00% hàn… 1 0.00% southern 1 0.00% no’1i 1 0.00% 141 1 0.00% never 1 0.00% level 1 0.00% tro’n 1 0.00% ha5i” 1 0.00% “chô1ng 1 0.00% lo'1n… 1 0.00% bê1n… 1 0.00% plongeon 1 0.00% chuy5ên 1 0.00% mathes 1 0.00% nu'o'1c… 1 0.00% “tê5 1 0.00% soviet 1 0.00% croissiere… 1 0.00% highway 1 0.00% nho’3n 1 0.00% nho’ 1 0.00% khemr 1 0.00% surrey 1 0.00% cu’u5 1 0.00% chi3a 1 0.00% shemona 1 0.00% li3a 1 0.00% kiryat 1 0.00% isael 1 0.00% safed 1 0.00% hornpipe 1 0.00% jigs 1 0.00% co’4 1 0.00% al–bashir 1 0.00% bruxelles 1 0.00% ngoác 1 0.00% treble 1 0.00% nitrate 1 0.00% tdx 1 0.00% ammonium 1 0.00% 182

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 195 1 0.00% dâ4nddâ2u 1 0.00% farouk 1 0.00% trust 1 0.00% (ddêm 1 0.00% sirie 1 0.00% ro’1t 1 0.00% stearns 1 0.00% tuz 1 0.00% berut 1 0.00% awkar 1 0.00% kazzaz 1 0.00% kurmatu 1 0.00% adel 1 0.00% bo’1i 1 0.00% dancing 1 0.00% mahmoudiya 1 0.00% burnaby 1 0.00% sharra 1 0.00% villepin 1 0.00% lu’5u 1 0.00% championships 1 0.00% solo 1 0.00% jordanie 1 0.00% moqtada 1 0.00% sadr 1 0.00% celtic 1 0.00% kufa 1 0.00% tho’2 1 0.00% ðánh 1 0.00% naway 1 0.00% barakzayi 1 0.00% mouwafak 1 0.00% kelowna 1 0.00% bint 1 0.00% thê3” 1 0.00% (women 1 0.00% “nhu’4ng 1 0.00% helmand 1 0.00% lp 1 0.00% baluchistan 1 0.00% woodbridge 1 0.00% shinzo 1 0.00% abe 1 0.00% quetta 1 0.00% schwartzeneggar 1 0.00% need 1 0.00% phuong 1 0.00% mullah 1 0.00% hamdullah 1 0.00% jbeil 1 0.00% 150000 1 0.00% 200000

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 196 1 0.00% 5700 1 0.00% amiri 1 0.00% yogyokarta 1 0.00% riverside 1 0.00% brum 1 0.00% lord 1 0.00% shlomo 1 0.00% steeple 1 0.00% cáøc 1 0.00% haq 1 0.00% western 1 0.00% chad 1 0.00% udi 1 0.00% nahriya 1 0.00% oireachtas 1 0.00% bouchard 1 0.00% pangandara 1 0.00% torng 1 0.00% 23000 1 0.00% hadi 1 0.00% kuswoyo 1 0.00% high 1 0.00% po'1p 1 0.00% ngô4ng' 1 0.00% tru'5 1 0.00% câ2p 1 0.00% ha3ng 1 0.00% (khtn 1 0.00% trei3n 1 0.00% (buô5c 1 0.00% 11dd 1 0.00% (khxh 1 0.00% cu4a 1 0.00% qh10 1 0.00% qùa 1 0.00% nhâ3n 1 0.00% ddóm 1 0.00% aga 1 0.00% cmc 1 0.00% 9333 1 0.00% 780 1 0.00% (tiê1p 1 0.00% làâ 1 0.00% ngu'á 1 0.00% sình 1 0.00% 13dd 1 0.00% licence 1 0.00% maitrise 1 0.00% iut 1 0.00% dea 1 0.00% as 1 0.00% doctorat

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 197 1 0.00% lo'1 1 0.00% i4 1 0.00% re3o 1 0.00% ddãm 1 0.00% diêu 1 0.00% gio3i' 1 0.00% transnational 1 0.00% internationalization 1 0.00% investment 1 0.00% 322 1 0.00% hco 1 0.00% nha(2mmu5c 1 0.00% honolulu 1 0.00% 16dd 1 0.00% pacifichem 1 0.00% ddánhgiá 1 0.00% (innovation 1 0.00% rê1t 1 0.00% darrunta 1 0.00% jalalabad 1 0.00% masri 1 0.00% gall 1 0.00% jehl 1 0.00% chô3i 1 0.00% rahman 1 0.00% maghrebi 1 0.00% abd 1 0.00% misri 1 0.00% shakai 1 0.00% carlotta 1 0.00% (afghanistan 1 0.00% khurseed 1 0.00% vàâ 1 0.00% sheikh 1 0.00% rashid 1 0.00% vecto' 1 0.00% (1801 1 0.00% tncs 1 0.00% msnbc 1 0.00% ddáu 1 0.00% 1865 1 0.00% farraj 1 0.00% sheibani 1 0.00% bartis 1 0.00% ebrahim 1 0.00% ruholah 1 0.00% khomeini 1 0.00% 19001785 1 0.00% perot 1 0.00% depot 1 0.00% lott 1 0.00% iwhrekan

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 198 1 0.00% goldwyn 1 0.00% (chi5 1 0.00% quôc 1 0.00% satan 1 0.00% bakhtur 1 0.00% (gd 1 0.00% bakhthur 1 0.00% (kháng 1 0.00% hoffman 1 0.00% strait 1 0.00% rabia 1 0.00% nhu3i 1 0.00% hamza 1 0.00% xáng 1 0.00% 370 1 0.00% 670 1 0.00% (42 1 0.00% (khóm 1 0.00% têt 1 0.00% 353 1 0.00% mobicard 1 0.00% ngiê5m 1 0.00% quy5t 1 0.00% lèn 1 0.00% 29dd 1 0.00% 7610 1 0.00% (tham 1 0.00% che5n 1 0.00% clamoxyl 1 0.00% adrenalin 1 0.00% 382a 1 0.00% 26dd 1 0.00% 31dd 1 0.00% thân( 1 0.00% hoa3( 1 0.00% 28dd 1 0.00% nuông 1 0.00% í 1 0.00% u5n 1 0.00% ddâ1t' 1 0.00% du'o'5ng 1 0.00% 'rô2ng 1 0.00% cu5p 1 0.00% hên 1 0.00% tich 1 0.00% (na(2m 1 0.00% 492 1 0.00% lâ3u 1 0.00% mai' 1 0.00% lêu 1 0.00% lô3ng 1 0.00% xiêu

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 199 1 0.00% ga(5ng 1 0.00% 42dd 1 0.00% 405 1 0.00% 'ddui' 1 0.00% 'ddô2i 1 0.00% chu3' 1 0.00% ca5y 1 0.00% 'bá 1 0.00% vi3 1 0.00% cha(1m 1 0.00% dda(p 1 0.00% (26dd 1 0.00% kha(m 1 0.00% trâ1u 1 0.00% (31dd 1 0.00% (35 1 0.00% quê5t 1 0.00% tíc 1 0.00% (34 1 0.00% lú 1 0.00% dâ1m 1 0.00% ngô5n 1 0.00% u3n 1 0.00% ngô2n 1 0.00% 14dd 1 0.00% ô1t 1 0.00% i3n 1 0.00% toáng 1 0.00% cu'1a 1 0.00% toáy 1 0.00% eng 1 0.00% éc 1 0.00% láu 1 0.00% nhá 1 0.00% (21dd 1 0.00% bu'o'm 1 0.00% (thâ1y 1 0.00% phô4ng 1 0.00% cho3ng 1 0.00% (44dd 1 0.00% (tru'o'3ng 1 0.00% khô3n 1 0.00% ghiê5m 1 0.00% kìê5n 1 0.00% si3nh 1 0.00% pha3y 1 0.00% do3ng 1 0.00% ca5ch 1 0.00% táu 1 0.00% (36 1 0.00% vâ5p 1 0.00% lt

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 200 1 0.00% (37dd 1 0.00% vts 1 0.00% tktt 1 0.00% choe5t 1 0.00% crocker 1 0.00% iaculor 1 0.00% mm 1 0.00% republican 1 0.00% politics 1 0.00% 407 1 0.00% wthr 1 0.00% lpr 1 0.00% 390m 1 0.00% pips 1 0.00% against 1 0.00% waste 1 0.00% responsive 1 0.00% garn 1 0.00% 410 1 0.00% jake 1 0.00% (cnw 1 0.00% helium 1 0.00% mini 1 0.00% zabel 1 0.00% injection 1 0.00% roth 1 0.00% sherbrooke 1 0.00% schulte 1 0.00% boom 1 0.00% focus 1 0.00% louie 1 0.00% falwell 1 0.00% moral 1 0.00% panasonic 1 0.00% giu4 1 0.00% elot 1 0.00% groveri 1 0.00% ralston 1 0.00% century 1 0.00% strategies 1 0.00% cingular 1 0.00% (perfect 1 0.00% (libertarian 1 0.00% (near 1 0.00% sonic 1 0.00% (speaker 1 0.00% ground 1 0.00% toward 1 0.00% tradition 1 0.00% wireless 1 0.00% ddddst 1 0.00% goodlatte

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 201 1 0.00% nãi 1 0.00% sciences 1 0.00% dòm 1 0.00% (indictment 1 0.00% avenir 1 0.00% hi4 1 0.00% thâ3u 1 0.00% (appropriations 1 0.00% kyoto 1 0.00% kelley 1 0.00% (góp 1 0.00% redlands 1 0.00% lawler 1 0.00% aviation 1 0.00% mahler 1 0.00% fn 1 0.00% dave 1 0.00% bruckner 1 0.00% nigel 1 0.00% ritz 1 0.00% schubert 1 0.00% ambassador's 1 0.00% vaughan 1 0.00% dvorak 1 0.00% (significant 1 0.00% hades 1 0.00% 'tô3 1 0.00% mb 1 0.00% fila 1 0.00% hedge 1 0.00% (vulture 1 0.00% freescale 1 0.00% bdi 1 0.00% protection 1 0.00% (chip 1 0.00% (bankruptcy 1 0.00% alamo 1 0.00% funds 1 0.00% netco 1 0.00% (mutual 1 0.00% role 1 0.00% (political 1 0.00% ram 1 0.00% anchor 1 0.00% rental 1 0.00% lnr 1 0.00% iap 1 0.00% (ram 1 0.00% copley 1 0.00% conrad 1 0.00% vest 1 0.00% employee

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 202 1 0.00% logic 1 0.00% doolittle 1 0.00% (belfast 1 0.00% choctaws 1 0.00% choctaw 1 0.00% reid 1 0.00% (southampton 1 0.00% (contract 1 0.00% marianna 1 0.00% (us 1 0.00% ngoi 1 0.00% cook 1 0.00% saipan 1 0.00% territories 1 0.00% (football 1 0.00% redskins 1 0.00% marianas 1 0.00% (james 1 0.00% (slide 1 0.00% qua3hay 1 0.00% (british 1 0.00% date 1 0.00% abscam 1 0.00% gambino 1 0.00% (maritime 1 0.00% burkanday 1 0.00% mê2n 1 0.00% va3ng 1 0.00% khu'1a 1 0.00% hujra 1 0.00% pashtun 1 0.00% alvarez 1 0.00% talk 1 0.00% alexandria 1 0.00% town 1 0.00% dorgan 1 0.00% rehoboth 1 0.00% coushatta 1 0.00% (cotton 1 0.00% cesar 1 0.00% konstantinos 1 0.00% ring 1 0.00% bu'o'1m 1 0.00% northern 1 0.00% vìu5 1 0.00% tortilla 1 0.00% ru'òm 1 0.00% (suspension 1 0.00% calendar 1 0.00% châ3y 1 0.00% tru5quô1c 1 0.00% (bulk

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 203 1 0.00% lehman 1 0.00% (simple 1 0.00% kathryn 1 0.00% (amendments 1 0.00% bluetooth 1 0.00% denny 1 0.00% pc 1 0.00% bae 1 0.00% farnborough 1 0.00% quarterly 1 0.00% anderson 1 0.00% soyuz 1 0.00% exmovere 1 0.00% jeb 1 0.00% matthew 1 0.00% rate 1 0.00% brandeis 1 0.00% (college 1 0.00% now 1 0.00% corleone 1 0.00% (skyboxes 1 0.00% lafayette 1 0.00% gingrich 1 0.00% diners 1 0.00% newt 1 0.00% hamamatsu 1 0.00% namibia 1 0.00% lynne 1 0.00% madagascar 1 0.00% fantasy 1 0.00% hayes 1 0.00% oklahoma 1 0.00% rogan 1 0.00% sports 1 0.00% (felonies 1 0.00% pioneers 1 0.00% won 1 0.00% good 1 0.00% guys 1 0.00% zeus 1 0.00% giòi 1 0.00% tomdispatch 1 0.00% globalsecurity 1 0.00% engelhardt 1 0.00% (hope 1 0.00% axis 1 0.00% 4x4 1 0.00% ducks 1 0.00% vo5i 1 0.00% (lame 1 0.00% pike 1 0.00% aparisim

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 204 1 0.00% gallardo 1 0.00% pendleton 1 0.00% eldon 1 0.00% bargewell 1 0.00% (collateral 1 0.00% calley 1 0.00% (draftees 1 0.00% donnelly 1 0.00% damage 1 0.00% sally 1 0.00% nghi5u 1 0.00% them 1 0.00% (dead 1 0.00% (bring 1 0.00% a8l 1 0.00% cassano 1 0.00% alive 1 0.00% baath 1 0.00% madeline 1 0.00% loa5n' 1 0.00% thía 1 0.00% 'pha3n 1 0.00% aznar 1 0.00% (vi5 1 0.00% ballistic 1 0.00% navigato 1 0.00% colgate 1 0.00% rwanda 1 0.00% missile 1 0.00% willing 1 0.00% maría 1 0.00% h2 1 0.00% abm 1 0.00% 1411 1 0.00% arnage 1 0.00% husayif 1 0.00% jassim 1 0.00% nahiba 1 0.00% swarzkopt 1 0.00% samarra 1 0.00% faliha 1 0.00% nightline 1 0.00% koppel 1 0.00% dover 1 0.00% ishaqi 1 0.00% enzo 1 0.00% norman 1 0.00% (160 1 0.00% (191 1 0.00% (157 1 0.00% beast 1 0.00% 805

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 205 1 0.00% yogurt 1 0.00% deutsche 1 0.00% mahmud 1 0.00% fe 1 0.00% xie 1 0.00% deshui 1 0.00% zarquawi 1 0.00% a8 1 0.00% hammurabi 1 0.00% taher 1 0.00% ayed 1 0.00% briones 1 0.00% phantom 1 0.00% barry 1 0.00% rolls 1 0.00% aqi 1 0.00% pyjama 1 0.00% ramadi 1 0.00% younis 1 0.00% escalade 1 0.00% euphrates 1 0.00% x5 1 0.00% (incident 1 0.00% colleen 1 0.00% anbar 1 0.00% wuterich 1 0.00% waleed 1 0.00% terrazas 1 0.00% cadillac 1 0.00% sinna 1 0.00% judith 1 0.00% miller 1 0.00% 1 0.00% rather 1 0.00% (paper 1 0.00% xi5ch 1 0.00% dirita 1 0.00% whitman 1 0.00% near 1 0.00% boss 1 0.00% (messianic 1 0.00% pulitzer 1 0.00% titan 1 0.00% crôm 1 0.00% (inconceivable 1 0.00% sh 1 0.00% (chô1ng 1 0.00% (ma5 1 0.00% (plan 1 0.00% attack 1 0.00% bernstein 1 0.00% 2400

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 206 1 0.00% carl 1 0.00% armani 1 0.00% strategic 1 0.00% (berlin 1 0.00% gucci 1 0.00% (arabian 1 0.00% (tactical 1 0.00% (ballistic 1 0.00% staff 1 0.00% low 1 0.00% chiefs 1 0.00% khè 1 0.00% (cruise 1 0.00% reconnaissance 1 0.00% (executing 1 0.00% orders 1 0.00% không' 1 0.00% blitzer 1 0.00% 'ném 1 0.00% (presidential 1 0.00% operations 1 0.00% (black 1 0.00% (covert 1 0.00% finding 1 0.00% (luâ5t 1 0.00% breaking 1 0.00% 2025 1 0.00% 5m2 1 0.00% hôi5 1 0.00% pas 1 0.00% nord 1 0.00% chùng 1 0.00% chemical 1 0.00% (r 1 0.00% yarnell 1 0.00% ddâ2âu 1 0.00% de4o 1 0.00% calais 1 0.00% diesel 1 0.00% na(m1979 1 0.00% chô2n 1 0.00% qaida 1 0.00% guantanamo 1 0.00% tai5 1 0.00% hhà 1 0.00% bengali 1 0.00% (60 1 0.00% davos 1 0.00% ttk 1 0.00% traí 1 0.00% ''tru5c 1 0.00% ác''

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 207 1 0.00% khí'' 1 0.00% baradei 1 0.00% du4a 1 0.00% ''có 1 0.00% wave 1 0.00% maixo 1 0.00% gwozdecky 1 0.00% giá'' 1 0.00% nhân'' 1 0.00% (ba3o 1 0.00% vu'o'1t 1 0.00% 2040 1 0.00% mu'o'ng 1 0.00% derming 1 0.00% golman 1 0.00% 2050 1 0.00% (oecd 1 0.00% khô5ng 1 0.00% quô1âc 1 0.00% mo'i 1 0.00% sa(ùt 1 0.00% beauty 1 0.00% trô1 1 0.00% u5 1 0.00% rs6 1 0.00% alexanrda 1 0.00% mo5p 1 0.00% véo 1 0.00% children's 1 0.00% (adelaide 1 0.00% 'ngày 1 0.00% oà 1 0.00% 312 1 0.00% 599 1 0.00% heo' 1 0.00% 'tâ1n 1 0.00% 'con 1 0.00% co'5t 1 0.00% 155 1 0.00% công' 1 0.00% 'nãy 1 0.00% biê1ng 1 0.00% sia3 1 0.00% gtb 1 0.00% que5o 1 0.00% sàigon 1 0.00% nylong 1 0.00% buà 1 0.00% chô3 1 0.00% lu'à 1 0.00% xôm 1 0.00% e3o

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 208 1 0.00% co5a3ng 1 0.00% tri5â3 1 0.00% 1886 1 0.00% muá 1 0.00% lexus 1 0.00% 1400 1 0.00% marelli 1 0.00% hànôinet 1 0.00% yesterday 1 0.00% rx 1 0.00% siva 1 0.00% nhách 1 0.00% le5o 1 0.00% kê1p 1 0.00% lu'o'n 1 0.00% bus 1 0.00% hanoinet 1 0.00% luâ5n' 1 0.00% (ngay 1 0.00% orchad 1 0.00% (chênh 1 0.00% khcn 1 0.00% ngò 1 0.00% katong 1 0.00% 988 1 0.00% 'tour 1 0.00% ddình… 1 0.00% pearls 1 0.00% (1m79 1 0.00% (1549 1 0.00% mu3n 1 0.00% tuê1 1 0.00% vinh' 1 0.00% (cddsp 1 0.00% buá 1 0.00% niêu 1 0.00% na3 1 0.00% di5t 1 0.00% 5b 1 0.00% xoong 1 0.00% thô5t 1 0.00% 888 1 0.00% thoóc 1 0.00% 'cà 1 0.00% hâ2y 1 0.00% chu'5ng 1 0.00% khú 1 0.00% sex' 1 0.00% hâ5p 1 0.00% 'bình 1 0.00% chu'á 1 0.00% 7h

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 209 1 0.00% (51 1 0.00% chuôi 1 0.00% orchard 1 0.00% 'nàng 1 0.00% myfinances 1 0.00% limousine 1 0.00% webgiadinh 1 0.00% nghiã' 1 0.00% o'1 1 0.00% lòe 1 0.00% no'1p 1 0.00% xia3 1 0.00% (1964 1 0.00% mclaren 1 0.00% 327 1 0.00% slr 1 0.00% tro'5t 1 0.00% ngáp 1 0.00% massage 1 0.00% ddô2m 1 0.00% phu'1t 1 0.00% nnptnt 1 0.00% 'da5 1 0.00% carrera 1 0.00% tu'3u 1 0.00% la3nh 1 0.00% job 1 0.00% hoa(2n 1 0.00% ddo'5 1 0.00% zizou 1 0.00% citroen 1 0.00% ké 1 0.00% nhu4i 1 0.00% lúy 1 0.00% hóp 1 0.00% la3o 1 0.00% 450dd 1 0.00% tra( 1 0.00% (ds 1 0.00% jaguar 1 0.00% trozim 1 0.00% (700 1 0.00% vw 1 0.00% maserati 1 0.00% (tùy 1 0.00% lêniniste 1 0.00% (bvddk 1 0.00% lancia 1 0.00% (ct 1 0.00% pharma 1 0.00% quyê2nâ3 1 0.00% zuellig

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 210 1 0.00% ddôa3c 1 0.00% â5m 1 0.00% lèm 1 0.00% renault 1 0.00% tarcefokyn 1 0.00% cu3aâ3 1 0.00% nhèm 1 0.00% 320usd 1 0.00% continental 1 0.00% mia3 1 0.00% nghiá 1 0.00% vo'5' 1 0.00% 'cuô5c 1 0.00% quô1c' 1 0.00% cha5c 1 0.00% ddâù 1 0.00% nhâ3u 1 0.00% xi5 1 0.00% cls55 1 0.00% lu'2 1 0.00% amg 1 0.00% vietbooks 1 0.00% lu'5i4c 1 0.00% xuât 1 0.00% nck 1 0.00% sl65 1 0.00% didier 1 0.00% choa 1 0.00% xo5ach 1 0.00% (ptth 1 0.00% te3o 1 0.00% nghê 1 0.00% soa5ng 1 0.00% bu'o'i 1 0.00% huyn 1 0.00% hoàng' 1 0.00% parks 1 0.00% 'ông 1 0.00% nhu'n 1 0.00% ddâ5n 1 0.00% huâ5n 1 0.00% classe 1 0.00% saìgon 1 0.00% nu'o'1cviê5t 1 0.00% qualitex 1 0.00% gia5y 1 0.00% 'giá 1 0.00% 645 1 0.00% bi4u 1 0.00% ne3 1 0.00% lu'a5 1 0.00% vdc

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 211 1 0.00% 'qua3n 1 0.00% khiên 1 0.00% li5m 1 0.00% kha5o 1 0.00% appiah 1 0.00% producers 1 0.00% cultural 1 0.00% unions 1 0.00% form 1 0.00% labor 1 0.00% products 1 0.00% beliefs 1 0.00% expressions 1 0.00% (le4 1 0.00% export 1 0.00% (ddâ1t 1 0.00% (vâ5t 1 0.00% principle 1 0.00% fabrics 1 0.00% relevant 1 0.00% few 1 0.00% conditions 1 0.00% sustainable 1 0.00% should 1 0.00% include 1 0.00% amended 1 0.00% relationship 1 0.00% both 1 0.00% other 1 0.00% 160km 1 0.00% bu5p 1 0.00% ore 1 0.00% mont 1 0.00% choa3ng 1 0.00% warbird 1 0.00% scale 1 0.00% (câ1t 1 0.00% 35m 1 0.00% n4120 1 0.00% xoe5t 1 0.00% 700m 1 0.00% òn 1 0.00% file 1 0.00% declaration 1 0.00% basic 1 0.00% un's 1 0.00% a(ngten 1 0.00% (helicopter 1 0.00% (airplane 1 0.00% (methanol 1 0.00% airplane 1 0.00% 250cc

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 212 1 0.00% attach 1 0.00% samy 1 0.00% 683 1 0.00% vellu 1 0.00% 784 1 0.00% (ma5ng 1 0.00% 918 1 0.00% (parliamentary 1 0.00% ling 1 0.00% (countries 1 0.00% sik 1 0.00% liong 1 0.00% fontelles 1 0.00% turkménistan 1 0.00% motjaba 1 0.00% tunisie 1 0.00% saoudite 1 0.00% fong 1 0.00% saminejad 1 0.00% josep 1 0.00% borrell 1 0.00% sharer 1 0.00% (tunisie 1 0.00% (syrie 1 0.00% asiaweek 1 0.00% granting 1 0.00% permanent 1 0.00% consider 1 0.00% district 1 0.00% knowledge 1 0.00% ncgd 1 0.00% ask 1 0.00% amend 1 0.00% would 1 0.00% status 1 0.00% days 1 0.00% voter 1 0.00% ddô3i… 1 0.00% ga(1m 1 0.00% ítn 1 0.00% thuy 1 0.00% quoc 1 0.00% dear 1 0.00% congresswoman 1 0.00% i'm 1 0.00% congressman 1 0.00% senator 1 0.00% ddu3ng 1 0.00% thích… 1 0.00% viê2n 1 0.00% dài… 1 0.00% 203

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 213 1 0.00% laurie 1 0.00% marcom 1 0.00% dduma 1 0.00% jurapa 1 0.00% (tiêu 1 0.00% ho'5 1 0.00% 1735 1 0.00% (rà 1 0.00% sse 1 0.00% (quyê2n 1 0.00% nhoay 1 0.00% larue 1 0.00% nhoáy 1 0.00% singarpore 1 0.00% nanyang 1 0.00% cabramatta 1 0.00% (ntu 1 0.00% vietact 1 0.00% saconnex 1 0.00% (riverside 1 0.00% uptown 1 0.00% 1106 1 0.00% nazran 1 0.00% (ingushetia 1 0.00% (chicago 1 0.00% komsomolskaya 1 0.00% (liège 1 0.00% d'avroy 1 0.00% novosty 1 0.00% yevloyev 1 0.00% nazir 1 0.00% hamburgers 1 0.00% gw 1 0.00% anton 1 0.00% surikov 1 0.00% khattab 1 0.00% janet 1 0.00% tass 1 0.00% shirt 1 0.00% 100kg 1 0.00% (hu'o'1ng 1 0.00% califorina 1 0.00% itar 1 0.00% fall 1 0.00% outfitters 1 0.00% tometa 1 0.00% rorahbacher 1 0.00% 20m2 1 0.00% (jba 1 0.00% pte 1 0.00% kpro 1 0.00% nu'1o'1c

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 214 1 0.00% watchguard 1 0.00% consultans 1 0.00% blitz 1 0.00% robina 1 0.00% lúi 1 0.00% (amtac 1 0.00% manufacturing 1 0.00% hoi3 1 0.00% húi 1 0.00% deryx 1 0.00% aò 1 0.00% hai3 1 0.00% taì 1 0.00% creative 1 0.00% lencii 1 0.00% cac 1 0.00% giong 1 0.00% butcher 1 0.00% 420ha 1 0.00% um 1 0.00% mcnulty 1 0.00% ngu5p 1 0.00% journey 1 0.00% from 1 0.00% ddu'o'1c 1 0.00% kearney 1 0.00% he3 1 0.00% brass 1 0.00% 25m2 1 0.00% (ma3ng 1 0.00% réo 1 0.00% goonline 1 0.00% globals 1 0.00% (ddo'n 1 0.00% tùm 1 0.00% challenge 1 0.00% dunnett 1 0.00% rar 1 0.00% 1200 1 0.00% seul 1 0.00% nations 1 0.00% passent 1 0.00% (toutes 1 0.00% idéologies 1 0.00% restent 1 0.00% (ddh4 1 0.00% gtgt 1 0.00% lenduong 1 0.00% 60m2 1 0.00% (cbcc 1 0.00% gaule 1 0.00% 504

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 215 1 0.00% (lu'o'ng 1 0.00% (chú 1 0.00% (ca3m 1 0.00% 455 1 0.00% 384 1 0.00% d'etat 1 0.00% 471 1 0.00% (théorie 1 0.00% malaisia 1 0.00% (nghèo 1 0.00% 267 1 0.00% trung… 1 0.00% cho'5… 1 0.00% ddh3 1 0.00% ddh2 1 0.00% 1388 1 0.00% workshops 1 0.00% amazing 1 0.00% race 1 0.00% tu'… 1 0.00% anniversary 1 0.00% delivery 1 0.00% (1999 1 0.00% ksnd 1 0.00% (ddê2 1 0.00% sunnybrook 1 0.00% 15g30 1 0.00% accent 1 0.00% townhall 1 0.00% (1882 1 0.00% ddh1 1 0.00% prince 1 0.00% 2011 1 0.00% exile 1 0.00% 249 1 0.00% dang_vanhai2001 1 0.00% nhót 1 0.00%(nguoikhungbohientu_121171 1 0.00% ho'5m 1 0.00% (kính 1 0.00% (hong_quyen99 1 0.00% (nv1 1 0.00% (nguyenthithuydung712 1 0.00% (yourangel172 1 0.00% (huybinhmt88 1 0.00% (fareasttravel 1 0.00% tuê1ch 1 0.00% thuâ3n 1 0.00% (dhs 1 0.00% nhu'1ng 1 0.00% tô4 1 0.00% thiê2u

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 216 1 0.00% yê1u… 1 0.00% detention 1 0.00% ra(1p 1 0.00% arbitrary 1 0.00% nghiê5m… 1 0.00% hcm… 1 0.00% unicode 1 0.00% ddàu 1 0.00% (lí 1 0.00% jessy 1 0.00% vu'o'5c 1 0.00% (180 1 0.00% bua 1 0.00% tccn 1 0.00% 247 1 0.00% bich 1 0.00% (nhê1ch 1 0.00% khâ3y 1 0.00% ngi5ch 1 0.00% (chiê1u 1 0.00% cooke 1 0.00% (aei 1 0.00% font 1 0.00% phàm 1 0.00% ra3ng 1 0.00% ãi 1 0.00% awards 1 0.00% bettina 1 0.00% (cddnv 1 0.00% biê1 1 0.00% (commission 1 0.00% ecosoc 1 0.00% (spam 1 0.00% ru´t 1 0.00% digital 1 0.00% itu 1 0.00% statement 1 0.00% expression 1 0.00% investor 1 0.00% undp 1 0.00% ba´c 1 0.00% 272 1 0.00% 373 1 0.00% 661 1 0.00% shaer 1 0.00% 030m3 1 0.00% lel 1 0.00% 600m3 1 0.00% wanadoo 1 0.00% lu´c 1 0.00% tru'o' 1 0.00% 8ha

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 217 1 0.00% vàl 1 0.00% (president 1 0.00% najib 1 0.00% wanneroo 1 0.00% phcg 1 0.00% (btc 1 0.00% ngâu 1 0.00% nordin 1 0.00% ouzbékistan 1 0.00% arabie 1 0.00% népal 1 0.00% onn 1 0.00% libye 1 0.00% buô4i 1 0.00% (wa 1 0.00% la5i… 1 0.00% quiê1t 1 0.00% 2015 1 0.00% (phcg 1 0.00% not 1 0.00% (so5t 1 0.00% hu'o'u 1 0.00% các… 1 0.00% afraid 1 0.00% chiu5 1 0.00% 660m3 1 0.00% lâ5y 1 0.00% (nmnth 1 0.00% (saigontourist 1 0.00% (226 1 0.00% 153 1 0.00% ceausescu 1 0.00% (chcnnb 1 0.00% cón 1 0.00% 3kg 1 0.00% 22g 1 0.00% 18g30 1 0.00% 220m2 1 0.00% ddh5 1 0.00% giàng 1 0.00% year 1 0.00% idols 1 0.00% young 1 0.00% 8215969 1 0.00% 8214444 1 0.00% ddô5… 1 0.00% qua(1p 1 0.00% 8214730 1 0.00% gó 1 0.00% (vài 1 0.00% tolga 1 0.00% (tunisia

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 218 1 0.00% eidhr 1 0.00% suhaili 1 0.00% vô2ng 1 0.00% biê1t10 1 0.00% 389 1 0.00% 381 1 0.00% zo' 1 0.00% (tra3 1 0.00% (hddnd 1 0.00% (european 1 0.00% 960 1 0.00% (100ha 1 0.00% thê1ch 1 0.00% to'3n 1 0.00% (sawaco 1 0.00% súy 1 0.00% osce 1 0.00% kavaratti 1 0.00% (vienna 1 0.00% (trình 1 0.00% (2004 1 0.00% anti 1 0.00% chtaura 1 0.00% litami 1 0.00% ne5p 1 0.00% békaa 1 0.00% masnaa 1 0.00% anjar 1 0.00% tho5t 1 0.00% arménia 1 0.00% jalala 1 0.00% taanayel 1 0.00% vis 1 0.00% neon 1 0.00% (cap 1 0.00% jbail 1 0.00% siniori 1 0.00% aitaroune 1 0.00% xòa 1 0.00% ddi5u 1 0.00% k'ho 1 0.00% atlantis 1 0.00% (nasa 1 0.00% 1ngu'o'2i 1 0.00% ilizarov 1 0.00% (los 1 0.00% caifornia 1 0.00% stralsund 1 0.00% bieng 1 0.00% shangye 1 0.00% ddâ2t 1 0.00% ru5i

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 219 1 0.00% (beiruth 1 0.00% yucca 1 0.00% joshua 1 0.00% tree 1 0.00% mòn'' 1 0.00% shyam 1 0.00% saran 1 0.00% taiba 1 0.00% (tân 1 0.00% 12g 1 0.00% chandrasekharan 1 0.00% farnboough 1 0.00% ''trong 1 0.00% (seatlle 1 0.00% talat 1 0.00% masood 1 0.00% faoud 1 0.00% búng 1 0.00% (djakarta 1 0.00% ddu'o'2ng… 1 0.00% gú 1 0.00% là… 1 0.00% to3i 1 0.00% gmt 1 0.00% sunda 1 0.00% 05g00 1 0.00% o3m 1 0.00% nurdina 1 0.00% que5t 1 0.00% lomonosov 1 0.00% 24gio'2 1 0.00% mick 1 0.00% (tro'1 1 0.00% gusmao 1 0.00% eke 1 0.00% (muô1n 1 0.00% o'1n 1 0.00% vedeno 1 0.00% imam 1 0.00% ichkeria 1 0.00% djakarta 1 0.00% junaidi 1 0.00% nu'1a 1 0.00% dudi 1 0.00% jeltsin 1 0.00% (pangandaran 1 0.00% cilacap 1 0.00% (kho'3i 1 0.00% villepine 1 0.00% (bqldaddtltddqt 1 0.00% dody 1 0.00% sumantiawan

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 220 1 0.00% kirton 1 0.00% hzbollah 1 0.00% bachar 1 0.00% freidman 1 0.00% alon 1 0.00% friedman 1 0.00% 30kg 1 0.00% mccomack 1 0.00% malachenko 1 0.00% (palestine 1 0.00% 500m2 1 0.00% ghei 1 0.00% still 1 0.00% deeply 1 0.00% tomorrow 1 0.00% face 1 0.00% difficulties 1 0.00% rooted 1 0.00% meaning 1 0.00% its 1 0.00% true 1 0.00% rise 1 0.00% up 1 0.00% though 1 0.00% sùi 1 0.00% pleikrông 1 0.00% sâ2n 1 0.00% sòm 1 0.00% tro3 1 0.00% yaly 1 0.00% 1g 1 0.00% even 1 0.00% thu'4ng 1 0.00% tuabin 1 0.00% nu'5ng 1 0.00% creed 1 0.00% cu'óp 1 0.00% du'4ng 1 0.00% luyn(dâ2u 1 0.00% fulrô 1 0.00% 475 1 0.00% made 1 0.00% gìo' 1 0.00% ri3nh 1 0.00% nu'óc 1 0.00% vi3nh 1 0.00% cuo'1p 1 0.00% ru'1c 1 0.00% self 1 0.00% evident 1 0.00% truths 1 0.00% hold

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 221 1 0.00% these 1 0.00% all 1 0.00% 735 1 0.00% manifesto 1 0.00% ke5 1 0.00% created 1 0.00% equal 1 0.00% lumpua 1 0.00% siddiqui 1 0.00% vu5c 1 0.00% muhaddin 1 0.00% chimma 1 0.00% aftab 1 0.00% ra(5t 1 0.00% fayyaz 1 0.00% bót 1 0.00% zulfeqar 1 0.00% sayyad 1 0.00% zabiuddin 1 0.00% gulam 1 0.00% karachi 1 0.00% mohan 1 0.00% gô5i 1 0.00% gút 1 0.00% (kathmandu 1 0.00% bhattari 1 0.00% bahadur 1 0.00% karki 1 0.00% dhak 1 0.00% ngâ2u 1 0.00% nepak 1 0.00% tayyaba 1 0.00% (bi3 1 0.00% mandelson 1 0.00% brussel 1 0.00% danlentieng 1 0.00% (geneva 1 0.00% hiê2m 1 0.00% 2t5 1 0.00% (amm 1 0.00% 1gio'2 1 0.00% co3n 1 0.00% ______1 0.00% (toàn 1 0.00% pham 1 0.00% tandtc 1 0.00% gi 1 0.00% âmthâ2m 1 0.00% (làng 1 0.00% me5c 1 0.00% khong 1 0.00% phau

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 222 1 0.00% ptdcvn 1 0.00% xa(1p 1 0.00% vksndtc 1 0.00% (hùng 1 0.00% (tin 1 0.00% (ldd 1 0.00% (mlnqvn 1 0.00% (cddvn 1 0.00% gi3a 1 0.00% stephany 1 0.00% dawson 1 0.00% cu'ú 1 0.00% gìo'1i 1 0.00% ghê1ch 1 0.00% thiê2ng 1 0.00% ronie 1 0.00% guyer 1 0.00% ddoì 1 0.00% ekazhevo 1 0.00% hôp 1 0.00% lorreta 1 0.00% (gddpt 1 0.00% fry 1 0.00% toòng 1 0.00% 5g 1 0.00% (hòa 1 0.00% so'2n 1 0.00% weston 1 0.00% derek 1 0.00% tonks 1 0.00% calgary 1 0.00% 200kg 1 0.00% (scarborough 1 0.00% networks 1 0.00% (oasis 1 0.00% television 1 0.00% montréal 1 0.00% parliamentary 1 0.00% (conservative 1 0.00% choviê5t 1 0.00% (tuyên 1 0.00% luô1ng 1 0.00% rúm 1 0.00% ngoe5o 1 0.00% 0g 1 0.00% bô4 1 0.00% rob 1 0.00% (ndp 1 0.00% (centre 1 0.00% block 1 0.00% saigonforsaigon 1 0.00% tarkhan

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 223 1 0.00% interfax 1 0.00% ganizhev 1 0.00% souvenir 1 0.00% l'exode 1 0.00% monde 1 0.00% belgique 1 0.00% d'accueil 1 0.00% remercient 1 0.00% réfugiés 1 0.00% vietnamiens 1 0.00% giu'1p 1 0.00% urals 1 0.00% démocratique 1 0.00% vremya 1 0.00% (monument 1 0.00% résistance 1 0.00% humanisme 1 0.00% solidarité 1 0.00% isa 1 0.00% kushtov 1 0.00% caritas 1 0.00% croix 1 0.00% nous 1 0.00% rahir 1 0.00% liege 1 0.00% maaskroon 1 0.00% (tín 1 0.00% allard 1 0.00% calif 1 0.00% nãn 1 0.00% ddu'o'1i 1 0.00% inches 1 0.00% (little 1 0.00% akhmed 1 0.00% coeurs 1 0.00% cet 1 0.00% espace 1 0.00% vivre 1 0.00% sommes 1 0.00% heureux 1 0.00% paix 1 0.00% restera 1 0.00% jamais 1 0.00% ancêtres 1 0.00% liberté 1 0.00% zakayev 1 0.00% quyê5n 1 0.00% waal 1 0.00% bayside 1 0.00% lofgren 1 0.00% makarkine 1 0.00% budennovsk

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 224 1 0.00% height 1 0.00% linda 1 0.00% trannh 1 0.00% trê3n 1 0.00% sd 1 0.00% carpenter 1 0.00% nù 1 0.00% weisel 1 0.00% representative 1 0.00% nhâ4y 1 0.00% putlizer 1 0.00% ú 1 0.00% hollen 1 0.00% cato 1 0.00% thoang 1 0.00% 75w 1 0.00% sos 1 0.00% vista 1 0.00% barajneh 1 0.00% mineralnye 1 0.00% bourj 1 0.00% (caucasus 1 0.00% vody 1 0.00% khasbulatov 1 0.00% tamarov 1 0.00% supyan 1 0.00% (dili 1 0.00% hôm19 1 0.00% ruslan 1 0.00% ankara 1 0.00% escondido 1 0.00% (thí 1 0.00% mesa 1 0.00% sandwishes 1 0.00% mira 1 0.00% (super 1 0.00% gagra 1 0.00% (ha(1n 1 0.00% benedicto 1 0.00% splitting 1 0.00% (vatican 1 0.00% olmstead 1 0.00% april 1 0.00% gmc 1 0.00% black 1 0.00% 7300 1 0.00% soldier 1 0.00% realty 1 0.00% arlington 1 0.00% (hawaii 1 0.00% galleries 1 0.00% valleyjo

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 225 1 0.00% furniture 1 0.00% orchid 1 0.00% tro'2 1 0.00% vaudreuil 1 0.00% (bloc 1 0.00% meili 1 0.00% faille 1 0.00% soulanges 1 0.00% leyna 1 0.00% dzu4ng 1 0.00% satelitte 1 0.00% québec 1 0.00% mèm 1 0.00% sheraton 1 0.00% gupta 1 0.00% npr 1 0.00% sanjay 1 0.00% matsukawa 1 0.00% tho3m 1 0.00% 6g30 1 0.00% ddìa 1 0.00% diversity 1 0.00% justice 1 0.00% civil 1 0.00% social 1 0.00% lori 1 0.00% le5p 1 0.00% u'5c 1 0.00% journalist 1 0.00% waikiki 1 0.00% (aaja 1 0.00% achievement 1 0.00% mâ2n 1 0.00% kashiwahara 1 0.00% toyata 1 0.00% 23g57 1 0.00% tritia 1 0.00% êkip 1 0.00% obi 1 0.00% yaourt 1 0.00% 50m 1 0.00% gu5 1 0.00% old 1 0.00% calcimex 1 0.00% (vân 1 0.00% claude 1 0.00% trafford 1 0.00% chelea 1 0.00% almeida 1 0.00% (mercosur 1 0.00% (venezuela 1 0.00% zola

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 226 1 0.00% seaman 1 0.00% gianfranco 1 0.00% mòi 1 0.00% ntn 1 0.00% gddt 1 0.00% vasquez 1 0.00% nicanor 1 0.00% (paraguay 1 0.00% andriy 1 0.00% eidur 1 0.00% gudjohnsen 1 0.00% ddi4a… 1 0.00% bridge 1 0.00% (rlc 1 0.00% irvine 1 0.00% under 1 0.00% entrepreneurs 1 0.00% oleguer 1 0.00% fragrances 1 0.00% bronkhorst 1 0.00% 21và 1 0.00% 1x2cm 1 0.00% feyenoord 1 0.00% salomon 1 0.00% shevchenko 1 0.00% gô2 1 0.00% 145mmol 1 0.00% (chuyê3n 1 0.00% urê 1 0.00% achilles 1 0.00% atletico 1 0.00% bilbao 1 0.00% overmars 1 0.00% (du5c 1 0.00% 8g10 1 0.00% sabato 1 0.00% (dehydroepiandrosteron 1 0.00% lillian 1 0.00% 9km 1 0.00% 482 1 0.00% assam 1 0.00% ra5o 1 0.00% 40oc 1 0.00% queens 1 0.00% oxtocin 1 0.00% sôcôla 1 0.00% amphetamin 1 0.00% du5c… 1 0.00% serotonin 1 0.00% nhoài 1 0.00% bull 1 0.00% ocytocin

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 227 1 0.00% matxa 1 0.00% phiê1m 1 0.00% (tu' 1 0.00% phenylethylamine 1 0.00% goldstein 1 0.00% 2051 1 0.00% 437 1 0.00% dietary 1 0.00% yongfeng 1 0.00% allowances 1 0.00% recommended 1 0.00% 63gr 1 0.00% 50gr 1 0.00% rda 1 0.00% cddv 1 0.00% 423 1 0.00% yongjian 1 0.00% 278 1 0.00% ahn 1 0.00% digest 1 0.00% women 1 0.00% brigham 1 0.00% reader's 1 0.00% 1gr 1 0.00% caffeine 1 0.00% cortisol 1 0.00% schweimler 1 0.00% ignacio 1 0.00% cooperation 1 0.00% vcci 1 0.00% development 1 0.00% traivenguon2006 1 0.00% swedish 1 0.00% 705 1 0.00% ubnviet 1 0.00% 9306737 1 0.00% 8225540 1 0.00% chamber 1 0.00% industry 1 0.00% labour 1 0.00% headphone 1 0.00% danna 1 0.00% (3d 1 0.00% inside 1 0.00% savannah 1 0.00% stark 1 0.00% moreau 1 0.00% ilo 1 0.00% improve 1 0.00% hayward 1 0.00% start 1 0.00% 066

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 228 1 0.00% trial 1 0.00% thu3a 1 0.00% suite 1 0.00% cyberscrub 1 0.00% privacy 1 0.00% ……………ngày……… 1 0.00% ngày……… 1 0.00% ……………no'i 1 0.00% ………… 1 0.00% ajoka 1 0.00% khuy 1 0.00% (dùng 1 0.00% kt3 1 0.00% ngày… 1 0.00% …tháng… 1 0.00% 9302127 1 0.00% 23gio'2 1 0.00% 0983326718 1 0.00% ………………fax 1 0.00% ngày…………… 1 0.00% (07 1 0.00%……………………………………… 1 0.00%…………………………ddtddd 1 0.00% asiad 1 0.00% apphich 1 0.00% socceroos 1 0.00% wenger 1 0.00% guillou 1 0.00% arsene 1 0.00% hiddink 1 0.00% safety 1 0.00% broadcasting 1 0.00% ba(ngrôn 1 0.00% mô5t… 1 0.00% hungaria 1 0.00% librex 1 0.00% ab 1 0.00% shibuya 1 0.00% mutu 1 0.00% hottest 1 0.00% 2214 1 0.00% ctv 1 0.00% moravskoslezsko 1 0.00% four 1 0.00% tošenovský 1 0.00% c1 1 0.00% zedník 1 0.00% footspa 1 0.00% hobson 1 0.00% sceaux 1 0.00% osca 1 0.00% 57km

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 229 1 0.00% 1g8'56 1 0.00% antony 1 0.00% (safa 1 0.00% doping 1 0.00% alberto 1 0.00% yurczyszyn 1 0.00% news24 1 0.00% mines 1 0.00% riêng… 1 0.00% (afc 1 0.00% trìu 1 0.00% (ffa 1 0.00% (nails 1 0.00% adn 1 0.00% creusot 1 0.00% montceau 1 0.00% vâ1p( 1 0.00% moore 1 0.00% o'neill 1 0.00% thô5p 1 0.00% ths 1 0.00% gorenberg 1 0.00% sõi 1 0.00% gershom 1 0.00% (tim 1 0.00% maariv 1 0.00% (israel 1 0.00% specific 1 0.00% cobalt 1 0.00% antigen 1 0.00% fv 1 0.00% ly5 1 0.00% ss 1 0.00% gorontlo 1 0.00% huddersfield 1 0.00% (aman 1 0.00% o'5 1 0.00% hagana 1 0.00% gurion 1 0.00% halperin 1 0.00% eichmann 1 0.00% vitebsk 1 0.00% (prostate 1 0.00% 3cm 1 0.00% 25cm 1 0.00% salim 1 0.00% mansour 1 0.00% jamal 1 0.00% 8kg 1 0.00% (5x600 1 0.00% yomiuri 1 0.00% (hô1t

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 230 1 0.00% 250g 1 0.00% 0kg 1 0.00% (jamal 1 0.00% 13cm 1 0.00% 49cm 1 0.00% 48cm 1 0.00% nha(1t 1 0.00% (tlt 1 0.00% 46cm 1 0.00% 12cm 1 0.00% 1cm 1 0.00% bethlehem 1 0.00% 42cm 1 0.00% jenin 1 0.00% (xinhua 1 0.00% baidoa 1 0.00% mogadishu 1 0.00% birmingham 1 0.00% (ddài 1 0.00% tnhk 1 0.00% abdullahi 1 0.00% châng 1 0.00% ndonesia 1 0.00% 596 1 0.00% yusuf 1 0.00% ngu'o'2i… 1 0.00% yanjin 1 0.00% (liba(ng 1 0.00% haret 1 0.00% (bernama 1 0.00% (voa 1 0.00% (iom 1 0.00% hreik 1 0.00% aqsa 1 0.00% aljazeera 1 0.00% newscastle 1 0.00% rô1ckét 1 0.00% shiefield 1 0.00% (dld 1 0.00% ynetnews 1 0.00% môto' 1 0.00% ernesto 1 0.00% qua(3ng 1 0.00% hanan 1 0.00% to'2i 1 0.00% debkafile 1 0.00% intelligencer 1 0.00% nhão 1 0.00% 70x70cm 1 0.00% 15m 1 0.00% 5g40 1 0.00% meesiriphan

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 231 1 0.00% dld 1 0.00% panom 1 0.00% boonriang 1 0.00% chuchaisaengrat 1 0.00% yukol 1 0.00% phán… 1 0.00% (kyodo 1 0.00% (idf 1 0.00% limlamthong 1 0.00% (who 1 0.00% lán 1 0.00% loa(ng 1 0.00% sharia 1 0.00% history 1 0.00% last 1 0.00% albopictus 1 0.00% loews 1 0.00% estée 1 0.00% tisch 1 0.00% aegypti 1 0.00% tomahawk 1 0.00% end 1 0.00% lô3n 1 0.00% roma 1 0.00% nhô3n 1 0.00% erythromycin 1 0.00% haaretz 1 0.00% salvatore 1 0.00% prolactin 1 0.00% fukuyama 1 0.00% testosteron 1 0.00% striano 1 0.00% shevardnadze 1 0.00% lauder 1 0.00% bóng…ra 1 0.00% rondon 1 0.00% kaiser 1 0.00% môtô 1 0.00% harley 1 0.00% mu5t 1 0.00% vasopressine… 1 0.00% norepinephrine 1 0.00% 70m 1 0.00% nhiê3u 1 0.00% mdairej 1 0.00% 580 1 0.00% melinda 1 0.00% (baby 1 0.00% mozart 1 0.00% chocolate 1 0.00% christie's 1 0.00% boomer

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 232 1 0.00% (massachusetts 1 0.00% raisio 1 0.00% rockland 1 0.00% (sxh 1 0.00% valio 1 0.00% nitrofurantoin 1 0.00% arginin 1 0.00% siôn 1 0.00% juin 1 0.00% l'arche 1 0.00% 474 1 0.00% methadon 1 0.00% quinolon 1 0.00% (lê4 1 0.00% cimetidin 1 0.00% (cocain 1 0.00% mig 1 0.00% fructose 1 0.00% degania 1 0.00% tiberia 1 0.00% (kibbutz 1 0.00% (3200 1 0.00% samakh 1 0.00% 1923 1 0.00% akaba 1 0.00% 4kg 1 0.00% uthant 1 0.00% 2kg 1 0.00% nasser 1 0.00% spiramycin 1 0.00% (tuâ2n 1 0.00% l'express 1 0.00% quid 1 0.00% rabinovich 1 0.00% koms 1 0.00% skyhawk 1 0.00% galilê 1 0.00% gentamycin 1 0.00% chlotetracyclin 1 0.00% mirage 1 0.00% sadate 1 0.00% trimoxazol 1 0.00% moshe 1 0.00% dayan 1 0.00% yariv 1 0.00% 16km 1 0.00% aaron 1 0.00% yom 1 0.00% transformed 1 0.00% middle 1 0.00% encounter 1 0.00% kippur

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 233 1 0.00% epic 1 0.00% 60km 1 0.00% sô2m 1 0.00% (hâ2u 1 0.00% stabilimenta 1 0.00% cbc 1 0.00% afar 1 0.00% (judas 1 0.00% iscariot 1 0.00% envisat 1 0.00% bafoeg 1 0.00% (esa 1 0.00% sphecidae 1 0.00% (lddsvvntd 1 0.00% (phu5c 1 0.00% svvn 1 0.00% gps 1 0.00% muenchen 1 0.00% fischer 1 0.00% liê5u… 1 0.00% 000km 1 0.00% buô5t 1 0.00% mtqggpmnvn 1 0.00% (dth 1 0.00% (global 1 0.00% m8 1 0.00% mimosa 1 0.00% (giô1ng 1 0.00% báp 1 0.00% sddnd 1 0.00% ashao 1 0.00% pao 1 0.00% 719 1 0.00% ddakto 1 0.00% khamjei 1 0.00% 015 1 0.00% (globol 1 0.00% amylose 1 0.00% (hàm 1 0.00% râ 1 0.00% floria 1 0.00% passion 1 0.00% (nhe5 1 0.00% bright 1 0.00% gpn 1 0.00% lu´a 1 0.00% hiê 1 0.00% giô´ng 1 0.00% hình… 1 0.00% mastercard 1 0.00% pétersburg 1 0.00% joern

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 234 1 0.00% nielsen 1 0.00% skov 1 0.00% yoshii 1 0.00% (thô1t 1 0.00% (lht 1 0.00% (1970 1 0.00% nogaard 1 0.00% vietsovpetro 1 0.00% lu'o'3i 1 0.00% (biê1t 1 0.00% chê3m 1 0.00% ngâ1p 1 0.00% xnk 1 0.00% nhòm 1 0.00% chê5 1 0.00% ilulissat 1 0.00% hydrocarbure 1 0.00% o'3… 1 0.00% (cpi 1 0.00% hiê5u… 1 0.00% (vn 1 0.00% bonn 1 0.00% kiê1u 1 0.00% 1901 1 0.00% ying 1 0.00% 0001 1 0.00% xê3 1 0.00% sao( 1 0.00% svvnch 1 0.00% (tro'2i 1 0.00% (mô2ng 1 0.00% 4dm 1 0.00% (cospar 1 0.00% ngo'3 1 0.00% salat 1 0.00% (ceerd 1 0.00% ro'1i 1 0.00% nho3m 1 0.00% muô1ng 1 0.00% chu'3 1 0.00% tu'3… 1 0.00% scout 1 0.00% laboratory 1 0.00% ngâ1u 1 0.00% toefl 1 0.00% úùy 1 0.00% ielts 1 0.00% (gv 1 0.00% gmat 1 0.00% u'1dda5i 1 0.00% lovelygirl4687 1 0.00% ddoài

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 235 1 0.00% (càn 1 0.00% phâ1p 1 0.00% (cec 1 0.00% de3 1 0.00% nhipcautinhban20042003 1 0.00% (diem 1 0.00% moa 1 0.00% kv3 1 0.00% dduà 1 0.00% buo'1c 1 0.00% (gdgt 1 0.00% jacket 1 0.00% (ma5o 1 0.00% khêu 1 0.00% ni5t 1 0.00% (kim 1 0.00% quo' 1 0.00% jice 1 0.00% (gakushushoreihi 1 0.00% (eju 1 0.00% (jasso 1 0.00% mext 1 0.00% bu'1u 1 0.00% (nghiên 1 0.00% a41 1 0.00% (jds 1 0.00% (mext 1 0.00% hcmcgj 1 0.00% (xâ1u 1 0.00% 417 1 0.00% (tô1t 1 0.00% (nguy 1 0.00% (kha(1c 1 0.00% 8225314 1 0.00% jp 1 0.00% emb 1 0.00% tô5n 1 0.00% su'3u 1 0.00% (ddt 1 0.00% (ddào 1 0.00% clor 1 0.00% endosulfan 1 0.00% retro 1 0.00% scène 1 0.00% (brightness 1 0.00% (para 1 0.00% toujours 1 0.00% thàn 1 0.00% para 1 0.00% niclosamite 1 0.00% jour 1 0.00% (contrast

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 236 1 0.00% sintef 1 0.00% ddãy 1 0.00% bars 1 0.00% vie 1 0.00% zdnets 1 0.00% volume 1 0.00% don 1 0.00% metteur 1 0.00% (color 1 0.00% clarinette 1 0.00% ruby 1 0.00% châ3u 1 0.00% (minh 1 0.00% (hongtrung1987hnd 1 0.00% exciter 1 0.00% hu5 1 0.00% ddu'o5c 1 0.00% kv2 1 0.00% (ddiê3m 1 0.00% (akhuong1967 1 0.00% (esc_kenvin_matnick 1 0.00% (kv2 1 0.00% lùm 1 0.00% pha(1c 1 0.00% giu'ã 1 0.00% nhô5t 1 0.00% ke4m 1 0.00% (bourbo 1 0.00% phia 1 0.00% (caporal 1 0.00% rubinokia 1 0.00% la5ch 1 0.00% (sergent 1 0.00% (retroactive 1 0.00% effect 1 0.00% ddâ5u' 1 0.00% (mòng 1 0.00% lb 1 0.00% vladimirovich 1 0.00% vladimia 1 0.00% 'giâ5t 1 0.00% 'tu'2 1 0.00% du'4' 1 0.00% châu' 1 0.00% thi' 1 0.00% tòng 1 0.00% 'lanh 1 0.00% qua(5n 1 0.00% nhín 1 0.00% thu'o'3 1 0.00% ventures 1 0.00% (innovative

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 237 1 0.00% suleco 1 0.00% vôn 1 0.00% va(3ng 1 0.00% avenal 1 0.00% lddtbxh 1 0.00% jos 1 0.00% vích 1 0.00% 620 1 0.00% tntqcg 1 0.00% (vl 1 0.00% (qddd 1 0.00% pttn 1 0.00% nhim 1 0.00% ipp 1 0.00% 150mw 1 0.00% bot 1 0.00% hinh 1 0.00% (cph 1 0.00% 342ha 1 0.00% andhra 1 0.00% vizag 1 0.00% pradesh 1 0.00% xéc 1 0.00% newswires 1 0.00% (xác 1 0.00% 747ha 1 0.00% 268 1 0.00% 144ha 1 0.00% (ddtntqcg 1 0.00% tuô1n 1 0.00% when 1 0.00% 8682902 1 0.00% kimngannguthien 1 0.00% 35f8 1 0.00% (wikimedia 1 0.00% mfa 1 0.00% pm 1 0.00% 3421 1 0.00% intouch 1 0.00% 823 1 0.00% cls 1 0.00% 6928 1 0.00% mofa 1 0.00% …………………… 1……………………………………………………………………… 0.00% 1 0.00% ddiêng 1 0.00% máclê 1…………………………………………………………………… 0.00% 1 0.00% dda5p… 1 0.00% 31cp 1 0.00% 56ndd 1 0.00% cairô

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 238 1 0.00% quo'1i 1 0.00% 45' 1 0.00% vnaemb 1 0.00% stone 1 0.00% olivier 1 0.00% toa5i 1 0.00% 100000 1 0.00% 90000 1 0.00% (già 1 0.00% heaven 1 0.00% (thâ5m 1 0.00% earth 1 0.00% place 1 0.00% changed 1 0.00% thúng 1 0.00% xoà 1 0.00% 0953359738 1 0.00% 336 1 0.00% 8612 1 0.00% to3ng 1 0.00% thanhniendanchuvn 1 0.00% (zoning 1 0.00% mu3ng 1 0.00% buddhist 1 0.00% 1189 1 0.00% (tâm 1 0.00% 717 1 0.00% 830 1 0.00% 600dd 1 0.00% thong 1 0.00% dong 1 0.00% bathurst 1 0.00% pib 1 0.00% lu5n 1 0.00% (lào 1 0.00% (bidina 1 0.00% bidiphar 1 0.00% 900dd 1 0.00% button 1 0.00% mississauga 1 0.00% donut 1 0.00% 'nu'o'1c 1 0.00% du4ng' 1 0.00% (prubf1 1 0.00% ngái 1 0.00% mongthong 1 0.00% (projector 1 0.00% quít 1 0.00% giava 1 0.00% champasak 1 0.00% ghiê1c 1 0.00% 608

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 239 1 0.00% 513 1 0.00% (rô 1 0.00% 414 1 0.00% 176 1 0.00% (ê 1 0.00% 862 1 0.00% 064 1 0.00% 897 1 0.00% 983 1 0.00% ttgdck 1 0.00% bonjour 1 0.00% (bddhq 1 0.00% ho'i… 1 0.00% cbf 1 0.00% kém… 1 0.00% (vietcombank 1 0.00% (ita 1 0.00% hn 1 0.00% hqb 1 0.00% kìn 1 0.00% hiê5 1 0.00% country 1 0.00% voltaire 1 0.00% helmut 1 0.00% (hiê1n 1 0.00% panke 1 0.00% (báo 1 0.00% chu'3ng 1 0.00% que 1 0.00% norbert 1 0.00% parce 1 0.00% (on 1 0.00% reithofer 1 0.00% 440usd 1 0.00% 839 1 0.00% 1913 1 0.00% cu'4u 1 0.00% 300mw 1 0.00% 916 1 0.00% (chu'2ng 1 0.00% (cif 1 0.00% (gorky 1 0.00% 400usd 1 0.00% giuse 1 0.00% 415 1 0.00% mettez 1 0.00% u'1a 1 0.00% ddu'o'5c' 1 0.00% 992 1 0.00% radjasa 1 0.00% hatta 1 0.00% 'bông

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 240 1 0.00% otc 1 0.00% our 1 0.00% least 1 0.00% flyer 1 0.00% parking 1 0.00% ddo'm 1 0.00% droitement 1 0.00% marat 1 0.00% levez 1 0.00% genoux 1 0.00% citoyens 1 0.00% kytô 1 0.00% (ddtddd 1 0.00% (tsunami 1 0.00% shao 1 0.00% qiwei 1 0.00% lãn 1 0.00% nhôm 1 0.00% (67 1 0.00% 463 1 0.00% (mâ4u 1 0.00% jerris 1 0.00% 2030 1 0.00% kwong 1 0.00% samurai 1 0.00% (fdi 1 0.00% (2050 1 0.00% (lenovo 1 0.00% sóat 1 0.00% 000km2 1 0.00% sanxia 1 0.00% (danjiangkou 1 0.00% alternative 1 0.00% 782km 1 0.00% (shibaozhai 1 0.00% ha5p 1 0.00% 700m3 1 0.00% tào 1 0.00% (huangling 1 0.00% 208 1 0.00% gini 1 0.00% (600 1 0.00% lo'4n 1 0.00% (350 1 0.00% dúa 1 0.00% (2467 1 0.00% xuezhong 1 0.00% kapila 1 0.00% napoléon 1 0.00% subhash 1 0.00% 443 1 0.00% do'3m

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 241 1 0.00% rombin 1 0.00% elit 1 0.00% (globebeauties 1 0.00% nanh 1 0.00% lo'3n 1 0.00% vo'3n 1 0.00% margarita 1 0.00% irene 1 0.00% di3nh 1 0.00% sáez 1 0.00% rui 1 0.00% chacao 1 0.00% (sars 1 0.00% 479 1 0.00% (468 1 0.00% (551 1 0.00% (1012 1 0.00% 1068 1 0.00% 376 1 0.00% 286 1 0.00% (298 1 0.00% (369 1 0.00% (372 1 0.00% 289 1 0.00% 425 1 0.00% altai 1 0.00% karakorum 1 0.00% 806 1 0.00% boutan 1 0.00% 541 1 0.00% qomolangma 1 0.00% (amua 1 0.00% judita 1 0.00% thiê3m 1 0.00% 848 1 0.00% 464 1 0.00% (280 1 0.00% go'2n 1 0.00% cheong 1 0.00% 028 1 0.00% 1100 1 0.00% ta(2n 1 0.00% thúe 1 0.00% (viê5c 1 0.00% nada 1 0.00% milinic 1 0.00% 1m82 1 0.00% cnocc 1 0.00% 1650 1 0.00% (1911 1 0.00% nghiêu 1 0.00% foon

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 242 1 0.00% 233 1 0.00% (1644 1 0.00% trach 1 0.00% yim 1 0.00% 1325 1 0.00% 1799 1 0.00% (ba(1t 1 0.00% (1711 1 0.00% tíê1c 1 0.00% ho5e 1 0.00% truy5 1 0.00% airline 1 0.00% ngoa 1 0.00% ga(p 1 0.00% 22h15 1 0.00% 1m72 1 0.00% tiê3ng 1 0.00% 19h30 1 0.00% 13h30 1 0.00% pum18 1 0.00% nguoivietonline 1 0.00% vietnamexodus 1 0.00% danchimviet 1 0.00% goa5i 1 0.00% hacker 1 0.00% vietland 1 0.00% seaprodex 1 0.00% vietsopetro 1 0.00% riê1u 1 0.00% doithoai 1 0.00% ykien 1 0.00% chisenga 1 0.00% pctt 1 0.00% 1m69 1 0.00% tgcp 1 0.00% moss 1 0.00% 310 1 0.00% ngút 1 0.00% (trích 1 0.00% cerge 1 0.00% (thông 1 0.00% cu4i 1 0.00% miê1t 1 0.00% 7h30 1 0.00% vu'3a 1 0.00% tre5 1 0.00% 1m85 1 0.00% mofya 1 0.00% dôi 1 0.00% toài 1 0.00% nhô1 1 0.00% 1m67

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 243 1 0.00% gâ4u 1 0.00% ddu4ng 1 0.00% jictzad 1 0.00% nauy 1 0.00% sirikit 1 0.00% hongsakula 1 0.00% lóa 1 0.00% ma(t 1 0.00% dô5c 1 0.00% (sa(1c 1 0.00% ghpgvnt 1 0.00% ttpgqt 1 0.00% to3n 1 0.00% irland 1 0.00% thu'o5ng 1 0.00% cashmir 1 0.00% fedorova 1 0.00% khóat 1 0.00% okinawa 1 0.00% mearsheimer 1 0.00% monroe 1 0.00% amparo 1 0.00% vê3 1 0.00% philipine 1 0.00% xói 1 0.00% pakisstan 1 0.00% lu'5oc 1 0.00% louise 1 0.00% vladdi 1 0.00% vôstô1c 1 0.00% betbeze 1 0.00% (ca5nh 1 0.00% rio'2ng 1 0.00% (hhhv 1 0.00% hotmail 1 0.00% hushmail 1 0.00% loè 1 0.00% 100usd 1 0.00% gém 1 0.00% (495 1 0.00% kuusela 1 0.00% xoan 1 0.00% (vu'2a 1 0.00% arbour 1 0.00% chu5i 1 0.00% mills 1 0.00% trành 1 0.00% (thu'2a 1 0.00% (bê2n 1 0.00% 4gio'2 1 0.00% (cu'1 1 0.00% 323

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 244 1 0.00% ddbqh 1 0.00% (1767 1 0.00% lo5i 1 0.00% chu'ong 1 0.00% tphn 1 0.00% (thu' 1 0.00% 2700002 1 0.00% hddndtp 1 0.00% làthu'o'ng 1 0.00% (qh 1 0.00% 298 1 0.00% oh 1 0.00% thè 1 0.00% kiss 1 0.00% crazy 1 0.00% happy 1 0.00% way 1 0.00% canto 1 0.00% 123 1 0.00% do'5 1 0.00% rù 1 0.00% mõm 1 0.00% nghía 1 0.00% alba 1 0.00% jessica 1 0.00% alessandrie 1 0.00% tre5o 1 0.00% inddonêxia 1 0.00% suraibaia 1 0.00% xa(2ng 1 0.00% manhattan 1 0.00% (oda 1 0.00% inddônêxia 1 0.00% htx 1 0.00% halle 1 0.00% tha3y 1 0.00% pgqt 1 0.00% 1m83 1 0.00% racism 1 0.00% (di4 1 0.00% pirates 1 0.00% (hâ5u 1 0.00% gô5t 1 0.00% berry 1 0.00% mrs 1 0.00% tbd 1 0.00% du'o'1í 1 0.00% kháo 1 0.00% '70s 1 0.00% cung( 1 0.00% mila 1 0.00% nhom

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 245 1 0.00% qua5i 1 0.00% (6m2 1 0.00% 1200cm3 1 0.00% bâ1n 1 0.00% mõ 1 0.00% (cu' 1 0.00% rê 1 0.00% xê2nh 1 0.00% (rô5ng 1 0.00% female 1 0.00% bô2m 1 0.00% kyushu 1 0.00% naomi 1 0.00% su'1a 1 0.00% kunis 1 0.00% luô1t 1 0.00% penelope 1 0.00% ói 1 0.00% cm3 1 0.00% hard 1 0.00% spears 1 0.00% ty2 1 0.00% núm 1 0.00% dô3i 1 0.00% britney 1 0.00% ha(5c 1 0.00% diêm 1 0.00% lõm 1 0.00% soco 1 0.00% hddxx 1 0.00% (la5ng 1 0.00% vòm 1 0.00% rain 1 0.00% tâu 1 0.00% vô1c 1 0.00% niê1t 1 0.00% cu'1t 1 0.00% li5a 1 0.00% traviêncu3a 1 0.00% cover 1 0.00% (ddâ2y 1 0.00% si5n 1 0.00% rãnh 1 0.00% (xa3y 1 0.00% háp 1 0.00% ternet 1 0.00% vo 1 0.00% viêc 1 0.00% si5t 1 0.00% vãy 1 0.00% bê5ch 1 0.00% (ddánh

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 246 1 0.00% (dày 1 0.00% tiên(19 1 0.00% diê1n 1 0.00% 72003 1 0.00% a(5c 1 0.00% gisella 1 0.00% a(2ng 1 0.00% hàon 1 0.00% bec 1 0.00% nhau… 1 0.00% sascha 1 0.00% ddê1ch 1 0.00% rodney 1 0.00% fernandez 1 0.00% (giam 1 0.00% (ddàn 1 0.00% thuli 1 0.00% sithole 1 0.00% ngoãn 1 0.00% pinoza 1 0.00% hrubyova 1 0.00% sikkim 1 0.00% tadjikistan 1 0.00% natasa 1 0.00% kirghizstan 1 0.00% (im 1 0.00% jacqueline 1 0.00% elisabeth 1 0.00% tu'1cthích 1 0.00% (tu'2ng 1 0.00% khoé 1 0.00% (xì 1 0.00% gích 1 0.00% (nha(2m 1 0.00% _bu'2a 1 0.00% kha(1c… 1 0.00% _lô1i 1 0.00% cu'a3 1 0.00% (kìm 1 0.00% fiction 1 0.00% (thâ1p 1 0.00% (vu5 1 0.00% (tra3i 1 0.00% liê5m 1 0.00% heroine 1 0.00% hiê1m( 1 0.00% pulp 1 0.00% huyndai 1 0.00% bay( 1 0.00% kill 1 0.00% nalthan 1 0.00% lòa

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 247 1 0.00% be3o 1 0.00% (luke 1 0.00% tê3nh 1 0.00% (mùa 1 0.00% 822 1 0.00% (tám 1 0.00% prime 1 0.00% luy 1 0.00% saunders 1 0.00% (1973 1 0.00% ad 1 0.00% liê1p 1 0.00% shivern 1 0.00% (ma(5t 1 0.00% peters 1 0.00% plat 1 0.00% tâ1y 1 0.00% buô5cpha3i 1 0.00% vù 1 0.00% flat 1 0.00% hilliman 1 0.00% ve3o 1 0.00% va(ntthu'o'ng 1 0.00% (bút 1 0.00% mia 1 0.00% (1951 1 0.00% so'n( 1 0.00% (ddi5a 1 0.00% (u'u 1 0.00% xa( 1 0.00% herman 1 0.00% belmar 1 0.00% 135 1 0.00% ga(5y 1 0.00% henrik 1 0.00% tép 1 0.00% 17h00 1 0.00% vâ3n 1 0.00% xa(1t 1 0.00% lars 1 0.00% re5t 1 0.00% nghen 1 0.00% (larson 1 0.00% kallstroem 1 0.00% hùm 1 0.00% 10m 1 0.00% medina 1 0.00% kaka 1 0.00% ze 1 0.00% juninho 1 0.00% shangdong 1 0.00% wang

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 248 1 0.00% oách 1 0.00% bei 1 0.00% cafú 1 0.00% lúc17h00 1 0.00% duesseldorf 1 0.00% ping 1 0.00% sepp 1 0.00% warning 1 0.00% (ddoan 1 0.00% early 1 0.00% massimo 1 0.00% barkey 1 0.00% strecker 1 0.00% herren 1 0.00% santis 1 0.00% andreas 1 0.00% (sweden 1 0.00% (fc 1 0.00% go3ng 1 0.00% (arsenal 1 0.00% zlatan 1 0.00% turin 1 0.00% senegal 1 0.00% (aston 1 0.00% (vua 1 0.00% mellberg 1 0.00% isaksson 1 0.00% olaf 1 0.00% (isro 1 0.00% nghi5t 1 0.00% (21t 1 0.00% sàm 1 0.00% vô3 1 0.00% nomez 1 0.00% (20t 1 0.00% tõm 1 0.00% maastricht 1 0.00% hit 1 0.00% (22t 1 0.00% 25t 1 0.00% zdf 1 0.00% turbay 1 0.00% paola 1 0.00% costinho 1 0.00% betancourt 1 0.00% paula 1 0.00% sabrosa 1 0.00% nghi5… 1 0.00% enoch 1 0.00% lenbanon 1 0.00% santomauro 1 0.00% universee

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 249 1 0.00% iaquinta 1 0.00% wolrd 1 0.00% bakker 1 0.00% (62 1 0.00% brigette 1 0.00% gamespot 1 0.00% postiga 1 0.00% murkowski 1 0.00% cafu 1 0.00% hudson 1 0.00% garragher 1 0.00% felipe 1 0.00% koblenz 1 0.00% hollande 1 0.00% mitterrand 1 0.00% capelle 1 0.00% ijssel 1 0.00% aan 1 0.00% musique 1 0.00% sofa 1 0.00% xõa 1 0.00% elysee 1 0.00% fete 1 0.00% (nghi3 1 0.00% cruise 1 0.00% bbc2 1 0.00% tomcruise 1 0.00% ruud 1 0.00% (ba3ng 1 0.00% bakhtiarizadeh 1 0.00% taymoorian 1 0.00% spike 1 0.00% spikelee 1 0.00% rezaei 1 0.00% bbc1 1 0.00% cô5te 1 0.00% kenneth 1 0.00% romaric 1 0.00% (tro'5 1 0.00% tét 1 0.00% kennet 1 0.00% fa 1 0.00% ruiz 1 0.00% (kolumbien 1 0.00% merseyside 1 0.00% 23h20 1 0.00% 17h 1 0.00% (wipo 1 0.00% plasil 1 0.00% lokvenc 1 0.00% rosicky 1 0.00% galasek

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 250 1 0.00% poborsky 1 0.00% kingston 1 0.00% bernabeu 1 0.00% addo 1 0.00% illiasu 1 0.00% paintsil 1 0.00% mensah 1 0.00% rozehnal 1 0.00% (frankreich 1 0.00% ivankovic 1 0.00% poulat 1 0.00% khatibi 1 0.00% hashemian 1 0.00% ferydoon 1 0.00% grygera 1 0.00% jankulovski 1 0.00% tschechien 1 0.00% sandfield 1 0.00% superstar 1 0.00% nwankwo 1 0.00% lo'i 1 0.00% (bdd 1 0.00% cambassio 1 0.00% aniekan 1 0.00% 5m50 1 0.00% messy 1 0.00% giovanna 1 0.00% sport 1 0.00% dudic 1 0.00% eurowindow 1 0.00% melandri 1 0.00% ekaphan 1 0.00% sicily 1 0.00% abbey 1 0.00% rudebox74 1 0.00% giuô5c 1 0.00% ddu'u'5c 1 0.00% llona 1 0.00% almeyda 1 0.00% mitsustar 1 0.00% 16m50 1 0.00% trê3 1 0.00% trâ5u 1 0.00% dello 1 0.00% (78 1 0.00% (88 1 0.00% (41 1 0.00% argentia 1 0.00% (31 1 0.00% vera 1 0.00% rijkaard 1 0.00% kanu

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 251 1 0.00% inacio 1 0.00% guô1c 1 0.00% vazquez 1 0.00% (italien 1 0.00% arturo 1 0.00% slovakia 1 0.00% wbc 1 0.00% gazzetta 1 0.00% zab 1 0.00% ergic 1 0.00% schnyder 1 0.00% patty 1 0.00% vukic 1 0.00% (50 1 0.00% katarina 1 0.00% scher 1 0.00% elijah 1 0.00% chiristophe 1 0.00% santoro 1 0.00% chez 1 0.00% cummigs 1 0.00% voice 1 0.00% vietbao 1 0.00% amor 1 0.00% mtv 1 0.00% matsco'va 1 0.00% sato 1 0.00% carmen 1 0.00% dupuis 1 0.00% moskva 1 0.00% to'n 1 0.00% pais 1 0.00% kovaliev 1 0.00% idol 1 0.00% snelling 1 0.00% luvoo 1 0.00% lutmila 1 0.00% alekseeva 1 0.00% 144 1 0.00% droits 1 0.00% l'homme 1 0.00% comité 1 0.00% 0904 1 0.00% 665 1 0.00% 048 1 0.00% nay(30 1 0.00% sê2nh 1 0.00% entourage 1 0.00% 091 1 0.00% 7336 1 0.00% sêri 1 0.00% sara

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 252 1 0.00% whyatt 1 0.00% magyar 1 0.00% equya 1 0.00% ddor 1 0.00% scot 1 0.00% cou'5 1 0.00% gopydtbcct2006 1 0.00% myu'2 1 0.00% marciel 1 0.00% hbo 1 0.00% oa 1 0.00% kirazli 1 0.00% ceyla 1 0.00% xuê 1 0.00% been 1 0.00% shaveena 1 0.00% yoga 1 0.00% singapo 1 0.00% chòi 1 0.00% cu3i 1 0.00% thom 1 0.00% (lúc 1 0.00% 1m63 1 0.00% phôtocopy 1 0.00% jetaime 1 0.00% lo'5m 1 0.00% praha 1 0.00% ly2 1 0.00% fatimih 1 0.00% ry 1 0.00% tsymbaliuk 1 0.00% doherty 1 0.00% sa(m 1 0.00% 16h30 1 0.00% ne 1 0.00% 293 1 0.00% 296 1 0.00% 318 1 0.00% 11x 1 0.00% xu' 1 0.00% 297 1 0.00% grôve 1 0.00% (houston 1 0.00% inter 1 0.00% hrw 1 0.00% cjp 1 0.00% quo3 1 0.00% litte 1 0.00% lg 1 0.00% rfa 1 0.00% kenisha 1 0.00% 50c

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 253 1 0.00% lau'1 1 0.00% nho'u'1 1 0.00% tînh 1 0.00% ballet 1 0.00% onwarin 1 0.00% prilly 1 0.00% kha´ 1 0.00% magali 1 0.00% ti´ch 1 0.00% tha´i 1 0.00% co´ 1 0.00% kapur 1 0.00% (dfb 1 0.00% adrienn 1 0.00% bende 1 0.00% neha 1 0.00% 1938 1 0.00% pha´p 1 0.00% kha´c 1 0.00% ca´ 1 0.00% (23t 1 0.00% zinedane 1 0.00% rivery 1 0.00% ti´nh 1 0.00% no´i 1 0.00% kê´t 1 0.00% ddâ´u 1 0.00% romitelli 1 0.00% ly´ 1 0.00% mayer 1 0.00% caribe 1 0.00% na5m 1 0.00% scheinsteiger 1 0.00% 21h30 1 0.00% asare 1 0.00% (puerto 1 0.00% adriana 1 0.00% daza 1 0.00% shakira 1 0.00% (bor 1 0.00% mönchengladbach 1 0.00% betina 1 0.00% (bô2 1 0.00% schumacher 1 0.00% hungary 1 0.00% vorfelder 1 0.00% platini 1 0.00% klinmann 1 0.00% dina 1 0.00% faurbiye 1 0.00% fekadu 1 0.00% zanella

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 254 1 0.00% rafaella 1 0.00% (hq 1 0.00% nxy 1 0.00% cncs 1 0.00% súp 1 0.00% thoán 1 0.00% piven 1 0.00% (nghi5 1 0.00% ki3 1 0.00% (1948 1 0.00% giào 1 0.00% (ngo 1 0.00% booth… 1 0.00% ddùm 1 0.00% su4ng 1 0.00% si4nh 1 0.00% máo 1 0.00% sê5ch 1 0.00% mê1u 1 0.00% xâ1c 1 0.00% tra3y 1 0.00% a(1ng 1 0.00% tru'o'1c(5 1 0.00% xiên 1 0.00% dângtàu 1 0.00% daredevil 1 0.00% torre 1 0.00% (hddnt 1 0.00% flower 1 0.00% avenue 1 0.00% 6th 1 0.00% gic 1 0.00% piereo 1 0.00% (thô 1 0.00% meterazzi 1 0.00% htv7 1 0.00% havard 1 0.00% joaquin 1 0.00% foxx 1 0.00% jamie 1 0.00% (ngoa5i 1 0.00% (right 1 0.00% farrell 1 0.00% transporter 1 0.00% jason 1 0.00% walk 1 0.00% statham 1 0.00% olympiastadion 1 0.00% 11mét 1 0.00% most 1 0.00% favored 1 0.00% bta

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 255 1 0.00% bilateral 1 0.00% agreement 1 0.00% stevens 1 0.00% stevie 1 0.00% quyê2 1 0.00% wonder 1 0.00% schlect 1 0.00% cat 1 0.00% kinks 1 0.00% ddaò 1 0.00% vaò 1 0.00% education 1 0.00% dr 1 0.00% certificate 1 0.00% cnet 1 0.00% 215 1 0.00% se3… 1 0.00% nay2 1 0.00% sony 1 0.00% broccoli 1 0.00% redding 1 0.00% churches 1 0.00% (written 1 0.00% tier 1 0.00% (human 1 0.00% trafficking 1 0.00% testimonies 1 0.00% riff 1 0.00% guitar 1 0.00% (lau 1 0.00% (cd 1 0.00% xbmdd 1 0.00% administration 1 0.00% ipr 1 0.00% intellectual 1 0.00% (ta(1t 1 0.00% otis 1 0.00% green 1 0.00% demo 1 0.00% (food 1 0.00% drug 1 0.00% fda 1 0.00% 301 1 0.00% 'catfish' 1 0.00% ddaì 1 0.00% tonight 1 0.00% (vna 1 0.00% (pw 1 0.00% gospel 1 0.00% nhk 1 0.00% 75m 1 0.00% priscila

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 256 1 0.00% hotel 1 0.00% 200er 1 0.00% erin 1 0.00% perales 1 0.00% full 1 0.00% 711 1 0.00% arraras 1 0.00% tiatanic 1 0.00% móp 1 0.00% ho3ang 1 0.00% celeste 1 0.00% goá 1 0.00% (igfm 1 0.00% vanh 1 0.00% dominica 1 0.00% bu'o'ng 1 0.00% flint 1 0.00% opendocument 1 0.00% kkt 1 0.00%783b2f92a5b73014c12571ae005592ca 1 0.00% uniflashes 1 0.00% nsf 1 0.00% another 1 0.00% martini 1 0.00% connery 1 0.00% thu'5o'ng 1 0.00% die 1 0.00% ngoaì 1 0.00% (link 1 0.00% qua(5p 1 0.00% audio 1 0.00% hocova 1 0.00% 325 1 0.00% wilshire 1 0.00% 674m2 1 0.00% vttm 1 0.00% (vo'3 1 0.00% tráp 1 0.00% bql 1 0.00% ngoaí 1 0.00% kung 1 0.00% (bruce 1 0.00% nho'1p 1 0.00% tràosau 1 0.00% sunanda 1 0.00% cheo 1 0.00% (myamar 1 0.00% zimbabwe 1 0.00% dda(5tc 1 0.00% hoác 1 0.00% (north 1 0.00% sticker

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 257 1 0.00% rajiv 1 0.00% tru'o'2n 1 0.00% rasheed 1 0.00% viê5cthành 1 0.00% 428 1 0.00% nhu5a 1 0.00% manohar 1 0.00% murli 1 0.00% vddck 1 0.00% jagmohan 1 0.00% (vddck 1 0.00% nigieria 1 0.00% bonneur 1 0.00% (delayed 1 0.00% (radio 1 0.00% du'1o'i 1 0.00% seok 1 0.00% replay 1 0.00% tru'1o'1c 1 0.00% lôn 1 0.00% home 1 0.00% môtip 1 0.00% 1931 1 0.00% xiê2ng 1 0.00% xô2m 1 0.00% nhám 1 0.00% thi5ch 1 0.00% (khánh 1 0.00% petrus 1 0.00% nhúa 1 0.00% (truyê2n 1 0.00% liê2m 1 0.00% thóang 1 0.00% thcand 1 0.00% nia 1 0.00% (hxhcnvn 1 0.00% ihrc 1 0.00% give 1 0.00% múôn 1 0.00% undiscovered 1 0.00% something 1 0.00% lo5at 1 0.00% balloon 1 0.00% circle 1 0.00% white 1 0.00% jafar 1 0.00% merriman 1 0.00% indonesian 1 0.00% chxnchvn 1 0.00% itunes 1 0.00% teenpop 1 0.00% warwickshire

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 258 1 0.00% rugby 1 0.00% kelly 1 0.00% f50 1 0.00% tunit 1 0.00% hot 1 0.00% dent 1 0.00% panarub 1 0.00% (silver 1 0.00% mung 1 0.00% súyt 1 0.00% mánh 1 0.00% mary 1 0.00% ddo5at 1 0.00% kenvin 1 0.00% roller 1 0.00% kidwai 1 0.00% (ptqd 1 0.00% coaster 1 0.00% sòn 1 0.00% cap 1 0.00% pizza 1 0.00% 'the 1 0.00% gold' 1 0.00% 'offside' 1 0.00% 'crimson 1 0.00% circle' 1 0.00% winsread 1 0.00% elizabeth 1 0.00% farmani 1 0.00% (2000 1 0.00% (golnaz 1 0.00% lên… 1 0.00% haryanto 1 0.00% antara 1 0.00% (papua 1 0.00% ngoan… 1 0.00% con… 1 0.00% roi… 1 0.00% ddâ1t… 1 0.00% lu'4a 1 0.00% ratna 1 0.00% sentani 1 0.00% va5y 1 0.00% 914 1 0.00% 797 1 0.00% simcard 1 0.00% shiu 1 0.00% kee 1 0.00% 980 1 0.00% ha(1t 1 0.00% ghém 1 0.00% 964

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 259 1 0.00% 326 1 0.00% 166 1 0.00% (trùng 1 0.00% ddéc 1 0.00% (khoa 1 0.00% lâ3m 1 0.00% chê1ch 1 0.00% sô1i 1 0.00% (ta5p 1 0.00% dalai 1 0.00% íck 1 0.00% lama 1 0.00% 'trô5m 1 0.00% giang' 1 0.00% ddu'o'5m 1 0.00% (drd 1 0.00% nu4ng 1 0.00% (nha 1 0.00% 431 1 0.00% (ho5c 1 0.00% nhe3o 1 0.00% 920 1 0.00% 377 1 0.00% 234 1 0.00% nhõng 1 0.00% (m 1 0.00% (nó 1 0.00% co'i 1 0.00% (acb 1 0.00% room 1 0.00% (ntk 1 0.00% (room 1 0.00% (techcombank 1 0.00% mím 1 0.00% phêng 1 0.00% nhu5t 1 0.00% (dda(1c 1 0.00% phên 1 0.00% gáo 1 0.00% vu'o'4ng 1 0.00% nga5 1 0.00% pl 1 0.00% bolzano 1 0.00% size 1 0.00% huyê4n 1 0.00% plds 1 0.00% hp 1 0.00% 524 1 0.00% hoa(2ng 1 0.00% nga3nh 1 0.00% pheng 1 0.00% 032

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 260 1 0.00% (1997 1 0.00% (tnhh 1 0.00% (vdc 1 0.00% 174 1 0.00% xa5m 1 0.00% 195 1 0.00% hóavâ4n 1 0.00% ru'o'1i 1 0.00% 778 1 0.00% 722 1 0.00% isp 1 0.00% (bài 1 0.00% (chô2ng 1 0.00% 520198 1 0.00% ama 1 0.00% luyn 1 0.00% cu'òi 1 0.00% ftp 1 0.00% telnet 1 0.00% websites 1 0.00% mo5 1 0.00% (isp 1 0.00% vogel 1 0.00% (pghh 1 0.00% elena(mia 1 0.00% 30cm 1 0.00% ghim 1 0.00% 395 1 0.00% (emmy 1 0.00% jenifer 1 0.00% rossum 1 0.00% lpdd 1 0.00% christian(mike 1 0.00% maestro 1 0.00% wolfgang 1 0.00% health 1 0.00% quy0t 1 0.00% chùc 1 0.00% ch0 1 0.00% (fredy 1 0.00% dìa 1 0.00% (â1p 1 0.00% dy 1 0.00% (mu4i 1 0.00% mâ3y 1 0.00% (ca 1 0.00% lucas 1 0.00% ucraina 1 0.00% 2142 1 0.00% na(m1981 1 0.00% (dành 1 0.00% 2150

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 261 1 0.00% twddcsvn 1 0.00% (josh 1 0.00% (cuô1n 1 0.00% thtndc 1 0.00% (giu'4a 1 0.00% (jacinda 1 0.00% lình 1 0.00% bennett 1 0.00% vàm 1 0.00% russel 1 0.00% (kurt 1 0.00% (jimmy 1 0.00% conor 1 0.00% barrett 1 0.00% stamp 1 0.00% 'chu'a 1 0.00% rubber 1 0.00% 0l 1 0.00% emma 1 0.00% djik 1 0.00% gorelik 1 0.00% mehdi 1 0.00% (636 1 0.00% shakespeare 1 0.00% (ca(1t 1 0.00% lo5an 1 0.00% ddung 1 0.00% eiu 1 0.00% 400kg 1 0.00% ghadimi 1 0.00% meregue 1 0.00% jive 1 0.00% 480 1 0.00% 7p 1 0.00% vinaconex 1 0.00% krasne 1 0.00% te 1 0.00% lucille 1 0.00% (dssgdd 1 0.00% ''gio'1i 1 0.00% tre3'' 1 0.00% táhi 1 0.00% 520 1 0.00% 193 1 0.00% qua5ch 1 0.00% troy 1 0.00% force 1 0.00% outbreak 1 0.00% aventure 1 0.00% 022459955 1 0.00% neame 1 0.00% fire

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 262 1 0.00% irwin 1 0.00% dower 1 0.00% 136 1 0.00% mún 1 0.00% thâ2h 1 0.00% protosevich 1 0.00% aid 1 0.00% hòanh 1 0.00% storm 1 0.00% (ec 1 0.00% 570 1 0.00% 488 1 0.00% opaque 1 0.00% mile 1 0.00% aei 1 0.00% ddâ5i 1 0.00% wollenberg 1 0.00% nán 1 0.00% enterprise 1 0.00% westbam 1 0.00% mundial 1 0.00% 1250 1 0.00% tiesto 1 0.00% ngênh 1 0.00% (quy 1 0.00% ddõ 1 0.00% phu'ong 1 0.00% chopin 1 0.00% dale 1 0.00% vinci 1 0.00% doàn 1 0.00% (danh 1 0.00% mu'o'1c 1 0.00% nicole 1 0.00% mackeno 1 0.00% kitô 1 0.00% hi5ch 1 0.00% (rfa 1 0.00% (love 1 0.00% mí 1 0.00% baò 1 0.00% rám 1 0.00% koa 1 0.00% varsovie 1 0.00% sàch 1 0.00% ddo'4… 1 0.00% nhâ 1 0.00% bo'3i… 1 0.00% cô5c 1 0.00% beijing 1 0.00% workers' 1 0.00% traò

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 263 1 0.00% dyk 1 0.00% u'1o'c 1 0.00% federation 1 0.00% short 1 0.00% bu'ã 1 0.00% tiergarten 1 0.00% (bwaf 1 0.00% bwaf 1 0.00% fortune 1 0.00% invites 1 0.00% firms 1 0.00% locomotive 1 0.00% nortel 1 0.00% sifang 1 0.00% join 1 0.00% brenda 1 0.00% cherry 1 0.00% macalister 1 0.00% invasion 1 0.00% tibet 1 0.00% bombardier 1 0.00% kazakhs 1 0.00% manchuria 1 0.00% 919 1 0.00% uyghur 1 0.00% (uy 1 0.00% (inner 1 0.00% 995 1 0.00% (power 1 0.00% (mongol 1 0.00% region 1 0.00% 586 1 0.00% zhang 1 0.00% brookhart 1 0.00% qinghai 1 0.00% josh 1 0.00% sq 1 0.00% ft 1 0.00% haas 1 0.00% abrahm 1 0.00% lustgarten 1 0.00% shigatze 1 0.00% harwick 1 0.00% ngu'2 1 0.00% tangula 1 0.00% foot 1 0.00% hòai 1 0.00% zero 1 0.00% macarthur 1 0.00% ra(5ng 1 0.00% thermosiphon 1 0.00% xining

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 264 1 0.00% railway 1 0.00% 930 1 0.00% (feet 1 0.00% ammonia 1 0.00% liê3ng 1 0.00% kho3ng 1 0.00% nho'4n 1 0.00% investigation 1 0.00% (mãi 1 0.00% (xâ1p 1 0.00% anbani 1 0.00% indonêxia 1 0.00% pho'4n 1 0.00% (1990 1 0.00% lady 1 0.00% trie3n 1 0.00% dcch 1 0.00% ubmttq 1 0.00% berenguel 1 0.00% bye 1 0.00% (lý 1 0.00% sun 1 0.00% (nòng 1 0.00% (cô3 1 0.00% razzi 1 0.00% ma(2n 1 0.00% papatacci 1 0.00% da3ng 1 0.00% hungari 1 0.00% 1001 1 0.00% petofi 1 0.00% suý 1 0.00% pont 1 0.00% (phóng 1 0.00% fukuoka 1 0.00% engiô2 1 0.00% kato 1 0.00% akutagawa 1 0.00% sarkae 1 0.00% d'alma 1 0.00% thie3u 1 0.00% caradec'h 1 0.00% nguòi 1 0.00% tu'o3ng 1 0.00% 67b 1 0.00% ddie3m 1 0.00% fayed 1 0.00% dodi 1 0.00% dô4ng 1 0.00% umberto 1 0.00% to5c 1 0.00% mosad

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 265 1 0.00% bèo 1 0.00% hê5n 1 0.00% ddô5ng( 1 0.00% tô5 1 0.00% vinaxad 1 0.00% odessa 1 0.00% dda3ng'' 1 0.00% _ha5n 1 0.00% ''xét 1 0.00% nghành 1 0.00% vàng… 1 0.00% qúai 1 0.00% kléber 1 0.00% ha(nh 1 0.00% boa(n 1 0.00% phòng…vv 1 0.00% (mô 1 0.00% chu'i3 1 0.00% ''quô1c 1 0.00% na5n'' 1 0.00% (cu'3a 1 0.00% ''tiê1p 1 0.00% dân'' 1 0.00% xúi 1 0.00% châ1y 1 0.00% râ5n 1 0.00% (cha(3ng 1 0.00% …mo5c 1 0.00% (ddô1t 1 0.00% thóp 1 0.00% deuxième 1 0.00% bureaux 1 0.00% cha(mpa 1 0.00% xoe 1 0.00% t'ru'ng 1 0.00% 6h30' 1 0.00% vê4nh 1 0.00% lo3a 1 0.00% 40m2 1 0.00% 180ô3 1 0.00% …và 1 0.00% ngoa(1c 1 0.00% nguyê4nva(n 1 0.00% báothanh 1 0.00% vông 1 0.00% (ra 1 0.00% viê3n 1 0.00% veulent 1 0.00% hommage 1 0.00% députés 1 0.00% guillaume 1 0.00% perrault

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 266 1 0.00% aux 1 0.00% ghe5o 1 0.00% califoocnia 1 0.00% (toronto 1 0.00% victimes 1 0.00% communisme 1 0.00% figaro 1 0.00% pour 1 0.00% française 1 0.00% (union 1 0.00% cha(4n 1 0.00% udf 1 0.00% j'ai 1 0.00% car 1 0.00% criminel 1 0.00% rue 1 0.00% changé 1 0.00% tru 1 0.00% pa(1c 1 0.00% showdown 1 0.00% why 1 0.00% pdn 1 0.00% jesse 1 0.00% lujan 1 0.00% wants 1 0.00% babbin 1 0.00% publications 1 0.00% jed 1 0.00% states 1 0.00% (cha5m 1 0.00% u'o'2n 1 0.00% su'5c 1 0.00% xo'1 1 0.00% flash 1 0.00% chày 1 0.00% cho'2i 1 0.00% nính 1 0.00% mcavoy 1 0.00% mo3m 1 0.00% audrey 1 0.00% éo 1 0.00% iiss 1 0.00% christophe 1 0.00% 1787 1 0.00% 1790 1 0.00% 1731 1 0.00% (funan 1 0.00% 1698 1 0.00% 1887 1 0.00% 253 1 0.00% 519 1 0.00% élysée

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 267 1 0.00% (gouverneur 1 0.00% général 1 0.00% 1623 1 0.00% (cai 1 0.00% 1613 1 0.00% 1306 1 0.00% mân 1 0.00% sính 1 0.00% 1635 1 0.00% 1618 1 0.00% agus 1 0.00% waluyo 1 0.00% 1620 1 0.00% shiva 1 0.00% tho 1 0.00% tiê3n 1 0.00% gallery 1 0.00% tru'2u 1 0.00% 1232 1 0.00% 41m 1 0.00% trin 1 0.00% (ha5t 1 0.00% seine 1 0.00% stalingrad 1 0.00% volgograd 1 0.00% leningrad 1 0.00% (1258 1 0.00% (thuâ5n 1 0.00% dove 1 0.00% lever 1 0.00% diva 1 0.00% amato' 1 0.00% 1874 1 0.00% 17m 1 0.00% 1264 1 0.00% 882 1 0.00% (tphcm 1 0.00% 974 1 0.00% marlon 1 0.00% guizhou 1 0.00% communes 1 0.00% 092 1 0.00% 704 1 0.00% sichuan 1 0.00% open 1 0.00% elections 1 0.00% semi 1 0.00% according 1 0.00% ivory 1 0.00% devil 1 0.00% wears 1 0.00% gulags

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 268 1 0.00% prada 1 0.00% (212 1 0.00% 183 1 0.00% 026 1 0.00% lãu3nh 1 0.00% (thô1ng 1 0.00% repressio 1 0.00% repression 1 0.00% selective 1 0.00% keenen 1 0.00% turkestan 1 0.00% inner 1 0.00% ru'o'ng 1 0.00% amdo 1 0.00% (qinghai 1 0.00% wikimedia 1 0.00% (xpcc 1 0.00% 630 1 0.00% corps 1 0.00% production 1 0.00% construction 1 0.00% 1980s 1 0.00% 6300 1 0.00% trô1t 1 0.00% (380 1 0.00% ru5c 1 0.00% ri5ch 1 0.00% nemo 1 0.00% depp 1 0.00% 1960s 1 0.00% walt 1 0.00% (232 1 0.00% californie 1 0.00% 483 1 0.00% (spratly 1 0.00% (origami 1 0.00% yellow 1 0.00% cosply 1 0.00% brunei 1 0.00% constructive 1 0.00% tàm 1 0.00% (hô2 1 0.00% 22g30 1 0.00% engagement 1 0.00% poster 1 0.00% thu'2o'ng 1 0.00% 502 1 0.00% daylights 1 0.00% publishing 1 0.00% timperlake 1 0.00% regnery 1 0.00% living

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 269 1 0.00% royale 1 0.00% 1908 1 0.00% po 1 0.00% wen 1 0.00% octopussy 1 0.00% khàn 1 0.00% far 1 0.00% composer 1 0.00% npc 1 0.00% berklee 1 0.00% vhnt 1 0.00% (nsu't 1 0.00% (chung 1 0.00% (387 1 0.00% friends 1 0.00% tíê1p 1 0.00% vât 1 0.00% rockband 1 0.00% (tài 1 0.00% (1982 1 0.00% csth 1 0.00% duo'1i 1 0.00% giâ5p 1 0.00% (1996 1 0.00% brezhnev 1 0.00% princelings 1 0.00% (huy 1 0.00% nsnd 1 0.00% rocker

Pham, Kohnert, Carney 2008, Corpora of Vietnamese Texts, Page 270