The Corpora of Vietnamese Texts was completed by Giang Pham (formerly Giang Tang) under the supervision of Kathryn Kohnert, Ph.D. CCC-SLP. Funding was provided by the Graduate Research Partnership Program in the Department of Speech-Language-Hearing Sciences at the University of Minnestoa. ** When printing, please note that this document is nearly 300 pages. ** Please cite this work using the following reference:

Pham, G., Kohnert, K., & Carney, E. (2008). Corpora of Vietnamese Texts: Lexical Effects of Intended Audience and Publication Place. Behavior Research Methods, 40, 154-163.

Corpora of Vietnamese Texts (Combined newspaper and children's literature corpora) Word frequency out of 1,055,617 total words

# Occurrence Percent Word 13076 1.30% và 12313 1.22% cu3a 10587 1.05% mô5t 10488 1.04% có 10303 1.02% là 8451 0.84% không 8387 0.83% cho 8383 0.83% các 8149 0.81% trong 7585 0.75% ddã 6620 0.66% ddu'o'5c 6434 0.64% ngu'o'2i 6065 0.60% nhu'4ng 5396 0.53% vo'1i 4984 0.49% ddê3 4881 0.48% ra 4685 0.46% con 4645 0.46% ddê1n 4548 0.45% vào 4403 0.44% này 4224 0.42% ông 4210 0.42% công 4088 0.40% nhu' 4068 0.40% cu4ng 4025 0.40% vê2 4005 0.40% o'3 3942 0.39% nhà 3890 0.39% khi 3811 0.38% dân 3806 0.38% la5i 3762 0.37% làm 3724 0.37% ddó 3637 0.36% pha3i 3484 0.35% tôi 3413 0.34% chính 3360 0.33% na(m 3290 0.33% ddi 3268 0.32% se4 3218 0.32% bi5

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 1 3195 0.32% tu'2 3176 0.31% nu'o'1c 3166 0.31% thê1 3139 0.31% quô1c 3105 0.31% ta5i 3032 0.30% thê3 3007 0.30% nói 2991 0.30% trên 2941 0.29% thì 2899 0.29% thành 2895 0.29% nhu'ng 2890 0.29% nhiê2u 2861 0.28% ngày 2847 0.28% còn 2842 0.28% chi3 2810 0.28% lên 2759 0.27% nam 2709 0.27% su'5 2705 0.27% mà 2656 0.26% ddâ2u 2571 0.25% sau 2528 0.25% ca3 2504 0.25% nhân 2476 0.25% sô1 2459 0.24% viê5c 2419 0.24% gia 2399 0.24% theo 2386 0.24% vì 2371 0.23% anh 2370 0.23% viê5t 2244 0.22% chúng 2216 0.22% chu3 2211 0.22% mình 2205 0.22% hai 2189 0.22% hô5i 2181 0.22% ho5c 2176 0.22% ddô2ng 2131 0.21% quan 2130 0.21% do 2118 0.21% ddang 2100 0.21% ta 2095 0.21% biê1t 2078 0.21% quyê2n 2075 0.21% ho'n 2048 0.20% ddô5ng 2045 0.20% hoa 2040 0.20% thâ1y 2009 0.20% qua 1998 0.20% trung 1991 0.20% ho5 1940 0.19% ky2 1925 0.19% ddây

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 2 1908 0.19% râ1t 1883 0.19% nào 1875 0.19% hiê5n 1857 0.18% tru'o'2ng 1818 0.18% tru'o'1c 1808 0.18% bô5 1807 0.18% viên 1797 0.18% cuô5c 1786 0.18% mo'1i 1766 0.17% rô2i 1729 0.17% ra(2ng 1705 0.17% ddiê2u 1681 0.17% nhâ1t 1672 0.17% cách 1668 0.17% hàng 1665 0.16% bà 1659 0.16% khác 1652 0.16% hay 1641 0.16% nay 1626 0.16% sinh 1620 0.16% ddi5nh 1609 0.16% tê1 1588 0.16% cùng 1549 0.15% cô 1539 0.15% vu5 1538 0.15% nên 1531 0.15% cái 1502 0.15% my4 1491 0.15% gio'1i 1481 0.15% vâ4n 1469 0.15% dda5i 1438 0.14% hành 1437 0.14% nhâ5n 1428 0.14% tu'5 1419 0.14% thu3 1414 0.14% ba 1413 0.14% ddô1i 1395 0.14% tháng 1388 0.14% me5 1386 0.14% tình 1368 0.14% tho'2i 1363 0.14% chu'1c 1343 0.13% phát 1330 0.13% va(n 1329 0.13% cao 1324 0.13% tiê1p 1317 0.13% ba5n 1317 0.13% báo 1314 0.13% gì 1310 0.13% em 1307 0.13% a(n 1298 0.13% kinh

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 3 1296 0.13% lý 1292 0.13% thu'1 1283 0.13% co' 1279 0.13% lo'1n 1276 0.13% 2006 1268 0.13% pháp 1263 0.13% nó 1263 0.13% quân 1259 0.12% tu' 1255 0.12% lúc 1252 0.12% nô5i 1251 0.12% tin 1248 0.12% ba(2ng 1233 0.12% ddâ1t 1230 0.12% ddê2 1221 0.12% ba(1t 1218 0.12% ý 1211 0.12% kê1t 1196 0.12% tiê2n 1195 0.12% thông 1178 0.12% tiê1ng 1172 0.12% to'1i 1169 0.12% chiê1n 1163 0.12% hình 1159 0.11% cô5ng 1159 0.11% nê1u 1159 0.11% 1 1152 0.11% gio'2 1146 0.11% gia3i 1144 0.11% ddu'o'2ng 1138 0.11% trình 1137 0.11% lo'2i 1136 0.11% ba3o 1133 0.11% liên 1133 0.11% giá 1130 0.11% lâ2n 1129 0.11% sa3n 1127 0.11% 2 1124 0.11% câ2u 1124 0.11% bình 1123 0.11% ddô5 1108 0.11% vu'2a 1103 0.11% tro'3 1101 0.11% muô1n 1100 0.11% ba3n 1099 0.11% an 1087 0.11% giáo 1086 0.11% quyê1t 1085 0.11% quá 1083 0.11% toàn 1079 0.11% thu'5c 1072 0.11% qua3

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 4 1071 0.11% ho'5p 1071 0.11% ddô5i 1066 0.11% thi 1058 0.10% nhiên 1054 0.10% chu'a 1052 0.10% tô3 1049 0.10% bé 1049 0.10% ma(5t 1043 0.10% ai 1043 0.10% tri5 1035 0.10% tâm 1033 0.10% câ2n 1029 0.10% hôm 1019 0.10% tay 1015 0.10% tài 1013 0.10% ngoài 1010 0.10% vâ5y 1000 0.10% phâ2n 998 0.10% dda3ng 991 0.10% tranh 988 0.10% thâ5t 985 0.10% nu'4a 984 0.10% du'5 984 0.10% dda5o 981 0.10% tìm 981 0.10% xuâ1t 977 0.10% vi5 975 0.10% lu'5c 968 0.10% tô3ng 946 0.09% nhau 946 0.09% sô1ng 940 0.09% thu'o'2ng 935 0.09% ddê2u 930 0.09% ngay 917 0.09% sách 914 0.09% sao 913 0.09% tên 907 0.09% chê1 907 0.09% hà 905 0.09% 3 905 0.09% thô1ng 899 0.09% tro5ng 883 0.09% thu'o'ng 882 0.09% gian 879 0.09% câ1p 878 0.09% nguyê4n 877 0.09% lâ5p 874 0.09% ddu'a 874 0.09% thi5 872 0.09% hê1t 868 0.09% tham 866 0.09% xuô1ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 5 864 0.09% ma5nh 864 0.09% ddình 864 0.09% xe 858 0.09% ta(ng 855 0.08% ho3i 854 0.08% si4 854 0.08% ddiê3m 844 0.08% luâ5t 827 0.08% bâ1t 823 0.08% lâ1y 822 0.08% lãnh 821 0.08% thu'1c 821 0.08% bên 818 0.08% chuyê5n 817 0.08% mo5i 813 0.08% bác 813 0.08% chú 809 0.08% thanh 806 0.08% ddu'1c 799 0.08% ban 798 0.08% bô1 795 0.08% gâ2n 792 0.08% nhìn 791 0.08% 7 791 0.08% bao 790 0.08% minh 785 0.08% ddá 783 0.08% phu'o'ng 779 0.08% cây 779 0.08% go5i 778 0.08% ddâ1u 774 0.08% bo3 771 0.08% tác 771 0.08% trâ5n 770 0.08% ngoa5i 770 0.08% tra3 769 0.08% ddánh 768 0.08% tâ1t 767 0.08% nguyên 759 0.08% no'i 758 0.08% giu'4a 756 0.07% â1y 756 0.07% tiên 755 0.07% nghi5 753 0.07% vâ1n 751 0.07% cu'1 742 0.07% phòng 741 0.07% tre3 740 0.07% nghe 737 0.07% hê5 733 0.07% tính 733 0.07% ty

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 6 728 0.07% na(ng 728 0.07% hóa 726 0.07% xã 725 0.07% tuy 724 0.07% 5 723 0.07% lu'o'5ng 719 0.07% triê5u 718 0.07% ddông 717 0.07% nghiê5p 716 0.07% chiê1c 715 0.07% du5ng 714 0.07% tiêu 712 0.07% tiê1n 711 0.07% thêm 709 0.07% ma5ng 702 0.07% tu5c 700 0.07% bê5nh 697 0.07% bán 696 0.07% tru'o'3ng 694 0.07% cho'i 691 0.07% cáo 689 0.07% giúp 687 0.07% sáng 686 0.07% thay 685 0.07% xem 682 0.07% cha5y 680 0.07% ddâu 680 0.07% chí 678 0.07% biê3u 678 0.07% cu'3 676 0.07% tuô3i 676 0.07% nho3 674 0.07% ca3m 672 0.07% ddiê5n 672 0.07% thái 670 0.07% 6 666 0.07% vâ5t 664 0.07% bay 664 0.07% tra5ng 662 0.07% diê4n 660 0.07% mô4i 659 0.07% thích 658 0.07% kê3 658 0.07% khó 657 0.07% án 656 0.07% ca3nh 656 0.07% vn 651 0.06% triê3n 650 0.06% giao 647 0.06% xin 642 0.06% dù 639 0.06% lo'5i

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 7 635 0.06% vô 634 0.06% chi5 632 0.06% 4 632 0.06% cá 631 0.06% thâ2n 629 0.06% yêu 628 0.06% a3nh 626 0.06% phô1 625 0.06% bóng 624 0.06% phu3 623 0.06% khoa3ng 622 0.06% loa5i 620 0.06% mâ1t 619 0.06% ddô5c 619 0.06% máy 618 0.06% bài 617 0.06% su'1c 616 0.06% hoàng 615 0.06% ddi5a 613 0.06% cuô1i 609 0.06% thân 604 0.06% ddô3i 603 0.06% tu'2ng 601 0.06% ta5o 601 0.06% khu 599 0.06% thu 598 0.06% chô1ng 598 0.06% ti3nh 598 0.06% viê5n 594 0.06% châu 594 0.06% mang 593 0.06% càng 590 0.06% hô2 589 0.06% dde5p 589 0.06% lòng 586 0.06% ba(1c 583 0.06% pha5m 581 0.06% áp 580 0.06% vùng 579 0.06% ga(5p 579 0.06% hoàn 577 0.06% gây 575 0.06% tâ1n 573 0.06% chê1t 572 0.06% tâ5p 569 0.06% du'o'1i 568 0.06% tu'3 565 0.06% vàng 559 0.06% nghi4 558 0.06% mua 558 0.06% chung 552 0.05% dài

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 8 551 0.05% khách 549 0.05% vua 549 0.05% bàn 547 0.05% mu'1c 546 0.05% sát 545 0.05% cu'3a 544 0.05% tô1t 543 0.05% kiê1n 541 0.05% ma(1t 540 0.05% ddoàn 540 0.05% ddu'1ng 538 0.05% tra 537 0.05% luâ5n 536 0.05% 10 536 0.05% chi 535 0.05% kho3i 535 0.05% phi 535 0.05% ddúng 535 0.05% hiê5u 534 0.05% sang 533 0.05% mo'3 532 0.05% ký 531 0.05% to 530 0.05% câu 530 0.05% nghi4a 530 0.05% cha(3ng 529 0.05% biê5t 529 0.05% nhâ5p 529 0.05% chân 529 0.05% khai 528 0.05% 0 528 0.05% gia3 527 0.05% di5ch 526 0.05% so'3 525 0.05% tô5i 525 0.05% tro'2i 524 0.05% chuyên 524 0.05% cu'1u 523 0.05% dâ4n 523 0.05% ddu3 523 0.05% tu'1c 521 0.05% kê1 519 0.05% hô5 514 0.05% chi5u 508 0.05% tu'o'1ng 508 0.05% dùng 508 0.05% cô1 507 0.05% dda(5c 506 0.05% nhiê5m 505 0.05% doanh 504 0.05% vu4 504 0.05% hoa5t

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 9 503 0.05% ddo'2i 499 0.05% ddàn 499 0.05% hoa(5c 498 0.05% su'3 498 0.05% tích 494 0.05% nô3i 493 0.05% phu5 492 0.05% truyê2n 487 0.05% xây 486 0.05% phía 485 0.05% tuyê3n 483 0.05% xét 482 0.05% nhâ5t 481 0.05% ít 480 0.05% tho3 479 0.05% diê5n 477 0.05% giu'4 477 0.05% xa 476 0.05% thiê1t 476 0.05% ddâ2y 475 0.05% lâu 475 0.05% cô3 474 0.05% thu' 474 0.05% gâ1u 474 0.05% ddáng 474 0.05% su' 474 0.05% châ1t 473 0.05% rõ 473 0.05% vu'5c 472 0.05% cung 471 0.05% biê1n 470 0.05% tinh 470 0.05% câ2m 469 0.05% nga 467 0.05% hô2i 467 0.05% luôn 465 0.05% du'5ng 463 0.05% lan 463 0.05% dda 463 0.05% biê3n 462 0.05% vâ5n 462 0.05% mu5c 461 0.05% ha3i 459 0.05% dda(5t 458 0.05% hâ5u 457 0.05% tây 457 0.05% thuô5c 455 0.05% bush 455 0.05% hãy 454 0.05% gái 453 0.04% bo'3i 452 0.04% trang

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 10 452 0.04% dâ2u 451 0.04% tô1i 450 0.04% vo'5 448 0.04% vo5ng 448 0.04% csvn 448 0.04% kêu 448 0.04% chu'1ng 445 0.04% vòng 443 0.04% trâ2n 441 0.04% viê1t 441 0.04% khiê1n 440 0.04% li5ch 440 0.04% câ5u 439 0.04% tu'o'3ng 439 0.04% hòa 438 0.04% chàng 434 0.04% lo 431 0.04% cu5 431 0.04% ha5 430 0.04% nàng 429 0.04% na(2m 427 0.04% niên 426 0.04% vê5 425 0.04% danh 425 0.04% khí 424 0.04% y 424 0.04% vui 424 0.04% kéo 424 0.04% tro'5 423 0.04% mâ1y 416 0.04% xa3y 415 0.04% vi 414 0.04% tha(1ng 413 0.04% sân 412 0.04% so'5 412 0.04% tuâ2n 411 0.04% nhóm 411 0.04% chu'o'ng 409 0.04% ddô 408 0.04% trách 407 0.04% chim 405 0.04% di 404 0.04% na5n 403 0.04% hoa5ch 402 0.04% hiê5p 402 0.04% ti5ch 402 0.04% phút 400 0.04% tu'o'ng 398 0.04% ngô2i 398 0.04% ddo'n 397 0.04% phong 397 0.04% riêng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 11 396 0.04% ddêm 396 0.04% trò 396 0.04% chô2ng 395 0.04% trí 395 0.04% khoa 395 0.04% tu'o'5ng 395 0.04% kiê5n 392 0.04% 8 391 0.04% â1n 391 0.04% gia3m 390 0.04% phâ3m 389 0.04% vài 389 0.04% chuyê3n 389 0.04% âu 388 0.04% hiê3u 388 0.04% phu5c 387 0.04% xu'3 387 0.04% mai 384 0.04% ddào 382 0.04% thuyê2n 382 0.04% chúa 382 0.04% tuyên 380 0.04% 20 380 0.04% cánh 380 0.04% ddòi 380 0.04% cha 379 0.04% kim 379 0.04% pha3n 379 0.04% phá 378 0.04% la(1m 378 0.04% tôn 378 0.04% trái 377 0.04% thuâ5t 377 0.04% ddô2 375 0.04% u'1ng 373 0.04% thôi 373 0.04% kha3 373 0.04% lao 372 0.04% ddâ1y 371 0.04% la5c 371 0.04% ca 371 0.04% ngu3 370 0.04% làng 370 0.04% nu'4 370 0.04% liê2n 369 0.04% khá 369 0.04% lê 369 0.04% giám 369 0.04% ma(5c 369 0.04% áo 368 0.04% le4 367 0.04% cha(1c

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 12 367 0.04% nhanh 365 0.04% hu'o'3ng 365 0.04% lê5 364 0.04% phép 364 0.04% hô3 364 0.04% cháu 363 0.04% ca(n 363 0.04% ngành 362 0.04% quy 360 0.04% chuô5t 356 0.04% nông 356 0.04% liê5u 356 0.04% phí 356 0.04% phân 354 0.04% khu3ng 354 0.04% ddáp 351 0.03% chô4 351 0.03% hâ2u 351 0.03% ddài 351 0.03% ddóng 351 0.03% châ1p 350 0.03% kiê3m 348 0.03% du5c 348 0.03% ddem 347 0.03% cho5n 345 0.03% thiê1u 345 0.03% ngo5c 343 0.03% ha5i 343 0.03% ky4 342 0.03% phóng 341 0.03% ru'2ng 340 0.03% du'o'ng 339 0.03% già 337 0.03% yê1u 337 0.03% hu'o'1ng 336 0.03% khô1i 336 0.03% miê2n 335 0.03% chuâ3n 334 0.03% màu 333 0.03% phó 333 0.03% ke3 333 0.03% ddô1c 333 0.03% tp 332 0.03% ca5nh 330 0.03% bu'1c 330 0.03% xác 329 0.03% nha3y 327 0.03% thuô1c 327 0.03% tô1 327 0.03% bâ2u 326 0.03% môn 324 0.03% 7

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 13 323 0.03% ha5n 321 0.03% thuâ5n 321 0.03% lo'1p 320 0.03% hu'4u 319 0.03% nguy 317 0.03% tiê1t 317 0.03% rô5ng 317 0.03% nê2n 317 0.03% nho'2 317 0.03% sai 315 0.03% iraq 315 0.03% qua3n 314 0.03% tt 314 0.03% to3 313 0.03% xong 312 0.03% ngu’o’2i 312 0.03% vô1n 312 0.03% u3ng 311 0.03% tra(1ng 311 0.03% na(5ng 311 0.03% kiê1m 310 0.03% chu'1 308 0.03% giang 306 0.03% chiê2u 306 0.03% ty3 306 0.03% mèo 306 0.03% so'n 305 0.03% thiên 305 0.03% dê4 304 0.03% ti3 304 0.03% ca3i 304 0.03% ddo3 304 0.03% sông 303 0.03% pha5t 303 0.03% bu'o'1c 302 0.03% nghê5 302 0.03% ngàn 302 0.03% o'i 302 0.03% nhu’4ng 301 0.03% á 301 0.03% 9 300 0.03% ngôi 300 0.03% nghèo 299 0.03% quy2nh 299 0.03% biên 299 0.03% sáu 298 0.03% 11 297 0.03% sâu 296 0.03% hàn 296 0.03% dda5t 296 0.03% tiê3u 295 0.03% qua3ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 14 294 0.03% cu'o'2i 294 0.03% lá 294 0.03% kia 294 0.03% tân 293 0.03% gà 293 0.03% ddô3 293 0.03% nho'1 293 0.03% coi 292 0.03% trai 292 0.03% cho'2 291 0.03% thu'o'5ng 291 0.03% suô1t 291 0.03% cóc 290 0.03% 30 290 0.03% ddo5c 288 0.03% cup 287 0.03% du 287 0.03% world 286 0.03% ba(ng 285 0.03% dda3o 285 0.03% nghiên 285 0.03% xanh 285 0.03% ddà 284 0.03% so 284 0.03% ba3y 284 0.03% tô5c 284 0.03% ninh 283 0.03% kha(1p 283 0.03% mãi 282 0.03% cu'5c 281 0.03% vô5i 281 0.03% thí 281 0.03% gu'3i 280 0.03% ho5p 280 0.03% binh 277 0.03% o’3 277 0.03% vu'o'ng 277 0.03% buô5c 276 0.03% sa(1p 276 0.03% ma5i 276 0.03% nguô2n 275 0.03% u'o'1c 275 0.03% bo5n 274 0.03% thiê5n 274 0.03% chia 273 0.03% tha(m 273 0.03% thú 273 0.03% kha(n 273 0.03% mong 272 0.03% bây 272 0.03% buô3i 272 0.03% chó

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 15 272 0.03% israel 272 0.03% giâ1y 272 0.03% v 271 0.03% mùa 271 0.03% mo'2i 271 0.03% ddôi 270 0.03% tai 270 0.03% (lên 270 0.03% thâ1t 269 0.03% sài 269 0.03% ngân 269 0.03% giô1ng 268 0.03% gòn 268 0.03% ky3 268 0.03% ngu'o'5c 268 0.03% hcm 267 0.03% nghiê5m 267 0.03% hô2ng 266 0.03% bang 265 0.03% so' 264 0.03% quay 263 0.03% chiê1m 262 0.03% long 262 0.03% lu'u 262 0.03% 12 260 0.03% tha(3ng 260 0.03% co3 260 0.03% sa(1c 260 0.03% bí 259 0.03% ghi 258 0.03% ngo'2 258 0.03% 2005 257 0.03% lô5 257 0.03% góp 256 0.03% thâ2y 256 0.03% tâ1m 256 0.03% ddau 255 0.03% hoà 254 0.03% ê1ch 254 0.03% ddu’o’5c 253 0.03% lê5nh 253 0.03% hy 253 0.03% tín 252 0.03% duy 252 0.03% hiê3m 251 0.02% quanh 251 0.02% ddâ3y 251 0.02% rút 250 0.02% bánh 249 0.02% ve3 248 0.02% tù 247 0.02% bô4ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 16 246 0.02% lê4 246 0.02% dung 242 0.02% tru'5c 241 0.02% thiê5t 241 0.02% lu'o'ng 241 0.02% may 240 0.02% 15 240 0.02% linh 239 0.02% gô2m 239 0.02% thu3y 239 0.02% huyê5n 239 0.02% núi 238 0.02% mô1i 237 0.02% tu'o'2ng 237 0.02% quê 237 0.02% mu'2ng 235 0.02% la 235 0.02% ddu'2ng 234 0.02% ddoa5n 234 0.02% bo'2 234 0.02% phú 234 0.02% thu'3 233 0.02% nghê2 233 0.02% so'1m 232 0.02% tâ5n 232 0.02% tra(m 232 0.02% 22 232 0.02% lai 232 0.02% iran 232 0.02% dda(ng 231 0.02% tha3o 231 0.02% xuyên 231 0.02% buô2n 230 0.02% ddu'1a 230 0.02% bào 230 0.02% nha(2m 229 0.02% to'2 228 0.02% phe 228 0.02% cu' 228 0.02% toán 228 0.02% phúc 228 0.02% khóc 227 0.02% máu 226 0.02% thoa5i 226 0.02% bò 225 0.02% hùng 225 0.02% cán 225 0.02% lu4 224 0.02% cu'o'2ng 224 0.02% du4ng 223 0.02% tu5 223 0.02% xuân

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 17 222 0.02% suy 221 0.02% phim 221 0.02% la5 220 0.02% nha5c 220 0.02% thi5t 220 0.02% gô1c 219 0.02% biê5n 219 0.02% uô1ng 219 0.02% trông 218 0.02% cu'o'1p 218 0.02% thúc 217 0.02% mâ5t 217 0.02% 23 217 0.02% 100 217 0.02% dâ5y 216 0.02% ngôn 215 0.02% câ5p 213 0.02% ha5t 213 0.02% hãng 213 0.02% ddích 213 0.02% vo’1i 212 0.02% nga5i 212 0.02% tránh 212 0.02% huy 211 0.02% lão 211 0.02% quý 211 0.02% bày 210 0.02% vai 210 0.02% ddói 208 0.02% 0 208 0.02% khô3 207 0.02% dâ2n 207 0.02% ba5c 207 0.02% yên 207 0.02% tung 207 0.02% ra(1n 206 0.02% ba5o 206 0.02% quang 205 0.02% dâ1u 205 0.02% in 205 0.02% o'n 205 0.02% thoát 205 0.02% ác 204 0.02% trào 204 0.02% nha 204 0.02% món 204 0.02% tru'o'ng 203 0.02% vu'o'2n 202 0.02% xúc 202 0.02% ta5m 201 0.02% thu5 201 0.02% t

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 18 201 0.02% thâ1p 201 0.02% ddi5ch 201 0.02% lu'o'1i 200 0.02% lành 200 0.02% quán 200 0.02% da5y 198 0.02% nô3 198 0.02% ddô5t 198 0.02% khâ3u 198 0.02% quâ2n 198 0.02% nhu4ng 198 0.02% tê5 197 0.02% cha(1n 197 0.02% cha(5t 197 0.02% vu'o'5t 197 0.02% to'1 197 0.02% nga(n 196 0.02% tàu 196 0.02% cu'5u 196 0.02% mo' 196 0.02% dành 196 0.02% gio’1i 196 0.02% lu'3a 195 0.02% ddo'4 194 0.02% lâ4n 194 0.02% dduô3i 194 0.02% ddo'5i 194 0.02% sa(4n 193 0.02% tiê5n 193 0.02% 2004 193 0.02% ma(1c 193 0.02% tra5i 191 0.02% qui 191 0.02% usd 191 0.02% âm 191 0.02% phiê1u 191 0.02% na(1m 190 0.02% câ1m 190 0.02% nuôi 190 0.02% dám 190 0.02% hu'o'ng 190 0.02% thuyê1t 189 0.02% lô4i 189 0.02% ca3ng 189 0.02% a 189 0.02% ddám 188 0.02% kích 188 0.02% hoá 188 0.02% châ1m 188 0.02% u3y 187 0.02% lúa 187 0.02% dden

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 19 187 0.02% ha5nh 187 0.02% chuyê1n 187 0.02% nhiêu 186 0.02% 21 186 0.02% ba(1n 186 0.02% tàn 185 0.02% thuê1 185 0.02% 50 184 0.02% cu5c 184 0.02% thu'o'3ng 184 0.02% nghi 183 0.02% luyê5n 183 0.02% chu’1c 183 0.02% du'1t 183 0.02% 6 183 0.02% chu5c 182 0.02% xâ1u 182 0.02% song 182 0.02% bu5ng 181 0.02% kém 181 0.02% su'1 181 0.02% chùa 181 0.02% kiê3u 181 0.02% cu4 180 0.02% tòa 180 0.02% mu'a 179 0.02% ro'i 178 0.02% môi 178 0.02% du' 178 0.02% quên 178 0.02% ddh 178 0.02% cho'5 178 0.02% trao 177 0.02% gió 177 0.02% chu'4 177 0.02% nghi3 177 0.02% co'2 176 0.02% sóng 176 0.02% giâ5n 176 0.02% ô 176 0.02% nô4i 176 0.02% giành 175 0.02% thâ5m 175 0.02% 17 175 0.02% can 175 0.02% nhiê4m 175 0.02% lính 175 0.02% kha(1c 175 0.02% cuô1n 174 0.02% tu’2 174 0.02% mô 174 0.02% triê2u

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 20 173 0.02% ly 173 0.02% thâ5p 173 0.02% kính 172 0.02% dây 172 0.02% bom 171 0.02% quen 170 0.02% quâ5n 170 0.02% dda3m 170 0.02% loan 170 0.02% bô1n 169 0.02% lu'o'5c 169 0.02% tuyê5t 169 0.02% kê 169 0.02% wto 169 0.02% giác 169 0.02% 18 168 0.02% thua 168 0.02% phô1i 168 0.02% liê5t 168 0.02% xu'a 168 0.02% cha(m 167 0.02% hô4 167 0.02% ba5i 167 0.02% ngài 167 0.02% giam 167 0.02% loài 167 0.02% trô1ng 167 0.02% hang 167 0.02% ddoán 167 0.02% toà 166 0.02% trì 166 0.02% lang 166 0.02% hát 165 0.02% cam 165 0.02% trích 165 0.02% bô2 165 0.02% chào 164 0.02% ho'i 164 0.02% nóng 163 0.02% miê5ng 162 0.02% nhé 162 0.02% a5 162 0.02% nhu 162 0.02% gio3i 161 0.02% nha(1c 161 0.02% sóc 161 0.02% mu4i 161 0.02% giê1t 161 0.02% ba5ch 160 0.02% 2003 160 0.02% 25 160 0.02% xung

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 21 159 0.02% bubu 159 0.02% phiên 158 0.02% giai 158 0.02% hê2 158 0.02% ràng 158 0.02% voi 158 0.02% lô1i 157 0.02% nguyê5n 157 0.02% dòng 157 0.02% co'm 157 0.02% abramoff 156 0.02% palestine 156 0.02% lâm 156 0.02% bè 156 0.02% tho' 155 0.02% ta(5ng 154 0.02% da 154 0.02% nhe5 154 0.02% xê1p 154 0.02% du'4 153 0.02% tuâ1n 153 0.02% al 153 0.02% thu’1 153 0.02% nu'3a 153 0.02% ngu'5a 153 0.02% hu'1a 152 0.02% niê5m 152 0.02% ba3ng 152 0.02% hezbollah 152 0.02% dê1 152 0.02% 19 152 0.02% diê5t 151 0.02% hút 151 0.02% heo 151 0.02% dê 151 0.02% chút 150 0.01% â1m 150 0.01% co'n 150 0.01% ddê2n 150 0.01% cà 150 0.01% soát 150 0.01% mày 150 0.01% phán 149 0.01% hoa3ng 149 0.01% 16 149 0.01% ánh 149 0.01% tri 149 0.01% internet 149 0.01% phê 149 0.01% túi 148 0.01% úc 148 0.01% c

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 22 148 0.01% tru'2 148 0.01% viê4n 148 0.01% khán 148 0.01% ha(1n 147 0.01% nu’o’1c 147 0.01% ích 146 0.01% chóng 146 0.01% mau 146 0.01% ngang 145 0.01% kho3e 145 0.01% thiê5u 145 0.01% ngon 144 0.01% sgk 144 0.01% lu'ng 144 0.01% gan 144 0.01% la5nh 144 0.01% ca(1t 143 0.01% tô2n 143 0.01% nghiêm 143 0.01% siêu 143 0.01% hè 142 0.01% ôm 142 0.01% lo'5n 142 0.01% lu'2a 142 0.01% kiên 142 0.01% dduôi 141 0.01% xa(ng 141 0.01% chu'4a 141 0.01% sa5ch 141 0.01% bãi 141 0.01% phô3 140 0.01% thù 140 0.01% tái 140 0.01% 24 140 0.01% tuyê1n 139 0.01% kiê2u 139 0.01% ngu4 139 0.01% ddành 139 0.01% chu'2ng 138 0.01% lu5c 138 0.01% 13 138 0.01% câ1t 138 0.01% bông 137 0.01% nga5c 137 0.01% gio5ng 137 0.01% cú 137 0.01% ddàm 137 0.01% u'o'ng 137 0.01% phái 136 0.01% thôn 136 0.01% ô1ng 136 0.01% khám

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 23 136 0.01% ma5c 136 0.01% chiê1u 136 0.01% loa5t 136 0.01% xu'o'ng 135 0.01% phâ5t 135 0.01% li 135 0.01% ddón 134 0.01% thu'a 134 0.01% thác 133 0.01% la(1ng 133 0.01% ngu'4 132 0.01% buôn 132 0.01% ve4 132 0.01% sô3 131 0.01% mu'o'2i 131 0.01% phù 131 0.01% li4nh 131 0.01% kho'3i 131 0.01% gô4 131 0.01% 14 131 0.01% uy 130 0.01% ghê1 130 0.01% ki5p 130 0.01% ga(1ng 129 0.01% ha(3n 129 0.01% (vnn 129 0.01% ngã 129 0.01% ddu'o'ng 129 0.01% phâ5n 129 0.01% di5p 128 0.01% khi3 128 0.01% ca(ng 128 0.01% sàng 128 0.01% da5 128 0.01% lâ2m 128 0.01% la(5ng 128 0.01% ro'2i 128 0.01% tha 127 0.01% thi5nh 127 0.01% nâ2y 127 0.01% thâ5n 126 0.01% ddoa5t 126 0.01% mái 126 0.01% trùn 126 0.01% lô2 126 0.01% se3 126 0.01% quy4 126 0.01% ddô1t 125 0.01% rô1i 125 0.01% qua5 125 0.01% tâ2m 125 0.01% ddâ5p

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 24 125 0.01% du'5a 125 0.01% dõi 125 0.01% cho'3 124 0.01% khô3ng 124 0.01% u'u 124 0.01% nô1i 124 0.01% dàng 124 0.01% cu'o'ng 123 0.01% nga(1n 123 0.01% me4 123 0.01% huâ1n 123 0.01% tô1n 123 0.01% khoa3n 123 0.01% mâ4u 122 0.01% chánh 122 0.01% thùng 122 0.01% tha3m 122 0.01% tru5 122 0.01% ô3n 122 0.01% nâng 122 0.01% cháy 121 0.01% di4 121 0.01% 40 121 0.01% kha(3ng 121 0.01% gia(5c 121 0.01% bo'1t 121 0.01% thâ3m 120 0.01% vu'4ng 120 0.01% xinh 119 0.01% tan 119 0.01% vinh 119 0.01% tim 119 0.01% che 119 0.01% nhiê5t 119 0.01% nô5p 119 0.01% ha5ng 119 0.01% ôi 119 0.01% ddo'5t 119 0.01% trâu 118 0.01% giàu 118 0.01% hô1i 118 0.01% bão 118 0.01% lông 118 0.01% gâ1p 118 0.01% thu'2a 118 0.01% túc 118 0.01% tha3 118 0.01% trúng 118 0.01% bê1n 118 0.01% ddt 117 0.01% 200 117 0.01% tho'5

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 25 116 0.01% hào 116 0.01% ddi3nh 116 0.01% tá 116 0.01% vang 116 0.01% lái 116 0.01% gd 116 0.01% ho5a 116 0.01% hô 116 0.01% hu'o'u 116 0.01% thuê 115 0.01% cành 114 0.01% lu'5a 114 0.01% khích 114 0.01% tí 114 0.01% mu'u 113 0.01% bu'4a 113 0.01% dda5n 113 0.01% vi5t 113 0.01% lo5t 113 0.01% ddèn 113 0.01% tha(2ng 113 0.01% ta(1c 112 0.01% su'3a 112 0.01% châ5m 112 0.01% liban 112 0.01% tao 112 0.01% trô1n 111 0.01% ta3i 111 0.01% vnn 111 0.01% gio'2i 111 0.01% cân 111 0.01% than 111 0.01% quà 111 0.01% rùa 110 0.01% quái 110 0.01% sa 110 0.01% giu'o'2ng 110 0.01% bâ1y 110 0.01% mác 110 0.01% su’5 109 0.01% treo 109 0.01% ddoan 109 0.01% trô5m 109 0.01% tròn 108 0.01% hôn 108 0.01% nha(1m 108 0.01% no'5 108 0.01% hiê1n 108 0.01% ho3a 108 0.01% tho3a 108 0.01% chi3nh 108 0.01% leo

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 26 107 0.01% mê 107 0.01% ma 107 0.01% dda(1c 107 0.01% câ5n 107 0.01% oan 107 0.01% tru’o’1c 107 0.01% loa5n 107 0.01% b 107 0.01% ra(ng 106 0.01% tu'o'i 106 0.01% hoan 106 0.01% sa(n 106 0.01% thang 105 0.01% hoa3 105 0.01% he5n 105 0.01% chu 105 0.01% ong 105 0.01% thu’5c 105 0.01% niê2m 104 0.01% du5 104 0.01% kho 104 0.01% hiê2n 104 0.01% ddô4 104 0.01% na(1ng 104 0.01% sói 104 0.01% cha(5n 104 0.01% 2002 103 0.01% 2001 103 0.01% quí 103 0.01% non 103 0.01% nêu 102 0.01% tra3i 102 0.01% nô4 102 0.01% chô1i 102 0.01% lâ5u 102 0.01% ruô5ng 102 0.01% mê5t 102 0.01% xô 102 0.01% lu'o'5t 102 0.01% tru'1ng 101 0.01% ddu5ng 101 0.01% thô 101 0.01% bù 101 0.01% ngoan 101 0.01% kha3o 101 0.01% râ5p 101 0.01% vi4nh 101 0.01% banh 101 0.01% võ 101 0.01% m 101 0.01% du'2ng 100 0.01% bô3

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 27 100 0.01% tru'2ng 100 0.01% bút 100 0.01% hài 100 0.01% trô2ng 100 0.01% tha5ch 100 0.01% ta3 100 0.01% ha(2ng 99 0.01% miê1n 99 0.01% dda5p 99 0.01% xóm 99 0.01% ngu'2ng 99 0.01% ddiê3n 99 0.01% ép 99 0.01% n 99 0.01% tra(ng 98 0.01% tâ2ng 98 0.01% h 98 0.01% co’ 98 0.01% do5a 98 0.01% cu'1ng 98 0.01% bâ5t 97 0.01% thâ2m 97 0.01% hu'ng 97 0.01% mã 97 0.01% tóc 97 0.01% ha5ch 97 0.01% tho'm 97 0.01% hamas 97 0.01% putin 97 0.01% xu'1 96 0.01% pinôkiô 96 0.01% góc 96 0.01% ao 96 0.01% hung 95 0.01% ô1c 95 0.01% phan 95 0.01% la5t 95 0.01% ho’n 95 0.01% bia 95 0.01% sung 94 0.01% tru’o’3ng 94 0.01% re3 94 0.01% út 94 0.01% mô5 94 0.01% tra5m 93 0.01% ngu'o'i 93 0.01% ddua 93 0.01% hâ1p 93 0.01% dao 93 0.01% say 93 0.01% sàn 93 0.01% miê4n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 28 93 0.01% tho'3 93 0.01% khiê1u 93 0.01% uy3 93 0.01% ta(1m 93 0.01% ta5p 93 0.01% bàng 93 0.01% dde 93 0.01% nhu’ 93 0.01% kg 93 0.01% ga5o 93 0.01% ta3ng 92 0.01% miê1ng 92 0.01% va5n 91 0.01% bo'i 91 0.01% ngoái 91 0.01% ddô1n 91 0.01% lôi 91 0.01% tiê1c 90 0.01% khóa 90 0.01% rô2ng 90 0.01% à 90 0.01% ga(1n 90 0.01% thô3i 90 0.01% giâ1c 89 0.01% ha 89 0.01% ngo5n 89 0.01% truy 89 0.01% dã 89 0.01% thô3 89 0.01% giâ1u 89 0.01% kháng 89 0.01% 60 89 0.01% gia3ng 88 0.01% hu3y 88 0.01% tru'ng 88 0.01% go'4 88 0.01% ddê5 88 0.01% tán 88 0.01% tô 88 0.01% canh 88 0.01% tru'a 88 0.01% mo5c 88 0.01% rice 88 0.01% to’1i 87 0.01% na(4ng 87 0.01% sô1t 87 0.01% cãi 87 0.01% chín 87 0.01% tôm 87 0.01% soa5n 87 0.01% ân 86 0.01% nê2

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 29 86 0.01% (theo 86 0.01% dda(3ng 86 0.01% ca3n 86 0.01% su'o'1ng 86 0.01% nxbgd 86 0.01% sanh 86 0.01% su'4a 86 0.01% ru'o'5u 86 0.01% tràn 85 0.01% bâ2y 85 0.01% gánh 85 0.01% 80 85 0.01% gia3n 85 0.01% ki5ch 85 0.01% tu 85 0.01% chui 85 0.01% hái 84 0.01% chu5p 84 0.01% châ1n 84 0.01% tô1c 84 0.01% cu3 84 0.01% 28 84 0.01% bi 84 0.01% cho'5t 84 0.01% hoang 84 0.01% tùy 83 0.01% da5ng 83 0.01% trú 83 0.01% nâu 83 0.01% no'3 83 0.01% ddo 83 0.01% san 83 0.01% washington 83 0.01% 300 83 0.01% trà 83 0.01% cám 83 0.01% no5 82 0.01% tre 82 0.01% gói 82 0.01% thách 82 0.01% dô5i 81 0.01% da(5n 81 0.01% tho'2 81 0.01% ca(5p 81 0.01% câ3n 81 0.01% nxb 81 0.01% cha(ng 80 0.01% bô1i 80 0.01% mãn 80 0.01% ddiê5u 80 0.01% ddai 80 0.01% mu4

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 30 80 0.01% mô3 80 0.01% khúc 80 0.01% màn 80 0.01% vân 80 0.01% bâ5c 80 0.01% the 79 0.01% va 79 0.01% mùi 79 0.01% mát 79 0.01% the3 79 0.01% tóm 79 0.01% ái 79 0.01% quy3 79 0.01% phu'o'2ng 79 0.01% trùng 79 0.01% chôn 79 0.01% huê1 78 0.01% mâ5u 78 0.01% bèn 78 0.01% nhi 78 0.01% u' 78 0.01% liba(ng 78 0.01% khô 78 0.01% q 78 0.01% ha3o 78 0.01% khôn 78 0.01% dda(5ng 78 0.01% hô2n 77 0.01% hô5p 77 0.01% du’5 77 0.01% ta(1t 77 0.01% sâ1u 77 0.01% tam 77 0.01% canada 77 0.01% alice 77 0.01% khuyê1n 77 0.01% ( 76 0.01% lu’5c 76 0.01% ti 76 0.01% viêm 76 0.01% trâ1n 76 0.01% 500 76 0.01% giâ5t 76 0.01% discovery 76 0.01% 0 75 0.01% ubnd 75 0.01% do5n 75 0.01% tiê4n 75 0.01% bu5i 75 0.01% triê1t 75 0.01% 2000 74 0.01% nhuâ5n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 31 74 0.01% mây 74 0.01% d 74 0.01% vo3 74 0.01% klinsmann 74 0.01% tho5 74 0.01% 27 74 0.01% suâ1t 74 0.01% pakistan 74 0.01% ddâ5u 74 0.01% vo'4 73 0.01% california 73 0.01% thiê3u 73 0.01% súng 73 0.01% tiê5c 73 0.01% tha(1c 73 0.01% bá 73 0.01% tha(ng 72 0.01% thánh 72 0.01% nhâ1n 72 0.01% nhu’ng 72 0.01% reo 72 0.01% giày 72 0.01% 90 72 0.01% sa(1t 72 0.01% tu5t 72 0.01% kha3i 72 0.01% im 71 0.01% of 71 0.01% van 71 0.01% hãi 71 0.01% a3 71 0.01% rác 71 0.01% cu'o'1i 71 0.01% nga(1m 71 0.01% trùm 71 0.01% thao 71 0.01% qaeda 71 0.01% múa 71 0.01% cs 71 0.01% la(n 71 0.01% ào 71 0.01% cát 70 0.01% do5c 70 0.01% lu'o'4ng 70 0.01% cún 70 0.01% lhq 70 0.01% dde3 70 0.01% dò 70 0.01% cô5t 70 0.01% ô2 70 0.01% thu’o’ng 70 0.01% kín

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 32 70 0.01% cdd 70 0.01% ddu'5ng 70 0.01% ddê1 70 0.01% be 69 0.01% du'o'2ng 69 0.01% vê1t 69 0.01% hô1 69 0.01% mô2i 69 0.01% xâm 69 0.01% mét 69 0.01% nâ1u 69 0.01% thoa3 69 0.01% ti4nh 69 0.01% e 69 0.01% g8 69 0.01% rào 69 0.01% nasa 69 0.01% bê1p 69 0.01% ôn 69 0.01% mô2 68 0.01% kèm 68 0.01% u 68 0.01% lò 68 0.01% nhu5c 68 0.01% cu'2u 68 0.01% sót 68 0.01% singapore 68 0.01% va(1ng 68 0.01% dda(2ng 68 0.01% tàng 68 0.01% bs 68 0.01% ngô 68 0.01% chúc 68 0.01% rích 68 0.01% cai 68 0.01% trâ2m 68 0.01% pho3ng 68 0.01% khuôn 68 0.01% 70 67 0.01% ma3nh 67 0.01% oanh 67 0.01% new 67 0.01% tràng 67 0.01% ba(1p 67 0.01% ts 67 0.01% vé 67 0.01% ro'4 67 0.01% giây 67 0.01% 26 67 0.01% khánh 67 0.01% ddan 67 0.01% ma(1n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 33 67 0.01% ballack 67 0.01% indonesia 67 0.01% dính 67 0.01% vo'2i 66 0.01% ddê1m 66 0.01% ddùa 66 0.01% én 66 0.01% sút 66 0.01% tiêm 66 0.01% ddeo 66 0.01% ddô2i 66 0.01% cha3y 66 0.01% mo’1i 66 0.01% lu’o’5ng 66 0.01% gã 65 0.01% lây 65 0.01% lãng 65 0.01% syria 65 0.01% cúm 65 0.01% khuynh 65 0.01% gio’2 65 0.01% da5i 65 0.01% sa5n 65 0.01% du'o'4ng 65 0.01% nha(5t 65 0.01% nghi5ch 65 0.01% quát 65 0.01% muôn 64 0.01% tám 64 0.01% rau 64 0.01% (hà 64 0.01% ali 64 0.01% pha 64 0.01% ngô5 64 0.01% khát 64 0.01% ca(1p 64 0.01% khen 64 0.01% bám 64 0.01% hòn 64 0.01% xì 64 0.01% bê1 64 0.01% ném 64 0.01% tru’o’2ng 64 0.01% dô1i 64 0.01% lô5n 64 0.01% vo5t 64 0.01% hét 63 0.01% tâ5t 63 0.01% ddôla 63 0.01% bô3ng 63 0.01% ddi4a 63 0.01% vong

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 34 63 0.01% ruô5t 63 0.01% hâ2m 63 0.01% x 63 0.01% nhái 63 0.01% mê5nh 63 0.01% dê5t 63 0.01% triê5t 63 0.01% sy4 63 0.01% câ3m 63 0.01% nhì 63 0.01% lu'3ng 63 0.01% thiê5p 62 0.01% bó 62 0.01% khiê1p 62 0.01% nát 62 0.01% bê 62 0.01% thép 62 0.01% truyê5n 62 0.01% mê1n 61 0.01% chu'1a 61 0.01% ddáo 61 0.01% mu5 61 0.01% â3n 61 0.01% khu'1 61 0.01% gián 61 0.01% de5p 61 0.01% su5p 61 0.01% mông 61 0.01% nhâ4n 61 0.01% luân 61 0.01% huyê1t 61 0.01% gu'o'ng 61 0.01% thu5y 61 0.01% dáng 61 0.01% bâ5n 61 0.01% trôi 60 0.01% sôi 60 0.01% 000dd 60 0.01% ngo5t 60 0.01% khâu 60 0.01% se 60 0.01% web 60 0.01% da(1t 60 0.01% ma5ch 60 0.01% chu4i 60 0.01% khung 60 0.01% ta3n 60 0.01% re4 60 0.01% thâ2u 60 0.01% ha3 60 0.01% ngâ2m 60 0.01% thuâ4n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 35 60 0.01% làn 60 0.01% dâng 60 0.01% mê2m 59 0.01% tháo 59 0.01% phiê2n 59 0.01% ca(1n 59 0.01% vã 59 0.01% nhu'o'5ng 59 0.01% ddâm 59 0.01% tru'4 59 0.01% lo5 59 0.01% mù 59 0.01% baba 58 0.01% khói 58 0.01% câ1u 58 0.01% zarqawi 58 0.01% bén 58 0.01% lát 58 0.01% ddãi 58 0.01% lui 58 0.01% túng 58 0.01% nét 58 0.01% nhàng 58 0.01% liê2u 58 0.01% trúc 58 0.01% ddu4a 57 0.01% phu'o'5ng 57 0.01% 2007 57 0.01% xá 57 0.01% ngu'ng 57 0.01% co'3i 57 0.01% chuông 57 0.01% chìm 57 0.01% dô2n 57 0.01% gay 57 0.01% tu'1 57 0.01% thai 57 0.01% bu5t 56 0.01% chép 56 0.01% mo3i 56 0.01% gõ 56 0.01% xtô 56 0.01% hiê1u 56 0.01% k 56 0.01% to3a 56 0.01% 1975 56 0.01% ám 56 0.01% cha(n 56 0.01% thoi 55 0.01% lo' 55 0.01% phiê1n 55 0.01% gâ2m

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 36 55 0.01% lùng 55 0.01% lô 55 0.01% bách 55 0.01% pho 55 0.01% nghìn 55 0.01% hôi 55 0.01% da5o 55 0.01% mo3 55 0.01% 31 55 0.01% bê2n 55 0.01% mu'o'i 55 0.01% lô4 55 0.01% chuô2n 54 0.01% móc 54 0.01% tháp 54 0.01% cô3ng 54 0.01% hu5t 54 0.01% dày 54 0.01% ve 54 0.01% hâ5n 54 0.01% lido' 53 0.01% ddiê2n 53 0.01% dinh 53 0.01% thâm 53 0.01% quyê3n 53 0.01% dda(1t 53 0.01% khâ3n 53 0.01% so'5i 53 0.01% vâ1t 53 0.01% xóa 53 0.01% vô4 53 0.01% gio' 53 0.01% cô1t 53 0.01% móng 53 0.01% khuyên 53 0.01% nhi3 53 0.01% giáp 53 0.01% thi3nh 53 0.01% khiêm 53 0.01% ddu’a 53 0.01% di5u 53 0.01% chuyê2n 53 0.01% thoa3i 53 0.01% chót 53 0.01% bu'5c 53 0.01% giu’4a 52 0.01% manh 52 0.01% ngâ5p 52 0.01% bu'u 52 0.01% ddê 52 0.01% chu’o’ng 52 0.01% mô5ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 37 52 0.01% nuô1t 52 0.01% ung 52 0.01% su 52 0.01% la(1p 52 0.01% co 52 0.01% sô 52 0.01% dâu 52 0.01% huyê2n 52 0.01% tto 52 0.01% lâ5t 52 0.01% xu'1ng 52 0.01% ru3 52 0.01% lê5ch 51 0.01% xông 51 0.01% huynh 51 0.01% mu'5c 51 0.01% go5n 51 0.01% vu'o'n 51 0.01% kênh 51 0.01% cia 51 0.01% – 51 0.01% phu'1c 51 0.01% rãi 51 0.01% bô1c 51 0.01% tro’3 51 0.01% ddô1ng 51 0.01% khô1n 51 0.01% phâ1n 51 0.01% u'1c 51 0.01% vi4 51 0.01% mo'2 51 0.01% tiê2m 50 0.01% lem 50 0.01% serbia 50 0.01% lân 50 0.01% mò 50 0.01% vay 50 0.01% na 50 0.01% rumsfeld 50 0.01% lâ5n 50 0.01% thèm 50 0.01% ta5 50 0.01% bu'o'1m 50 0.01% tang 50 0.01% gio5t 50 0.01% kem 50 0.01% toa 50 0.01% bùng 50 0.01% ru'3a 50 0.01% ta5ng 50 0.01% ddu’1c 50 0.01% du'o'5c

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 38 49 0.00% pntr 49 0.00% beirut 49 0.00% khiê3n 49 0.00% khoe 49 0.00% bùi 49 0.00% lu'o'4i 49 0.00% va3 49 0.00% vu’2a 49 0.00% km 49 0.00% hu'1ng 49 0.00% 150 49 0.00% 4 49 0.00% lô5t 49 0.00% 29 49 0.00% zidane 49 0.00% ddô2n 49 0.00% hót 49 0.00% john 49 0.00% muô5n 49 0.00% chèo 49 0.00% va3i 49 0.00% ra(1c 49 0.00% tô3n 48 0.00% tê1t 48 0.00% gu'o'm 48 0.00% o 48 0.00% kiê5t 48 0.00% cu 48 0.00% â2m 48 0.00% york 48 0.00% bo5 48 0.00% suô1i 48 0.00% nhi5p 48 0.00% tu’5 48 0.00% ngu 48 0.00% ta5c 48 0.00% ddiê5p 48 0.00% vây 48 0.00% chán 47 0.00% la(1c 47 0.00% james 47 0.00% tú 47 0.00% thoa3ng 47 0.00% ru'5c 47 0.00% lo’2i 47 0.00% khâ1u 47 0.00% rung 47 0.00% xu 47 0.00% nô 47 0.00% 8406 47 0.00% khiêu 47 0.00% 45

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 39 47 0.00% chu3ng 47 0.00% cu'o'4ng 47 0.00% hoài 46 0.00% phu 46 0.00% cho5c 46 0.00% thpt 46 0.00% … 46 0.00% ngãi 46 0.00% la5m 46 0.00% che4 46 0.00% ngô4ng 46 0.00% hiê1m 46 0.00% nhâ2m 46 0.00% 600 46 0.00% cha5m 46 0.00% táo 46 0.00% béo 46 0.00% nô2i 46 0.00% chai 46 0.00% tách 46 0.00% su'o'ng 46 0.00% ddè 46 0.00% nâ1y 46 0.00% chuô5c 46 0.00% 56 46 0.00% lánh 46 0.00% tu5ng 45 0.00% ngõ 45 0.00% doa5 45 0.00% lâ1p 45 0.00% bát 45 0.00% 2010 45 0.00% nu'o'ng 45 0.00% l 45 0.00% costa 45 0.00% nhu'o'2ng 45 0.00% lô5i 45 0.00% mu'o'5n 45 0.00% chênh 45 0.00% phâ4u 45 0.00% cu3i 45 0.00% láng 45 0.00% nu'1o'c 45 0.00% cò 45 0.00% lam 45 0.00% ney 45 0.00% tha(1t 45 0.00% ty5 45 0.00% ma3i 45 0.00% g 45 0.00% ddòn 45 0.00% ví

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 40 44 0.00% ho’5p 44 0.00% xà 44 0.00% 1 44 0.00% ddàng 44 0.00% duyê5t 44 0.00% vi5nh 44 0.00% ngô1c 44 0.00% mô5c 44 0.00% ngu'5c 44 0.00% tuân 44 0.00% bô2i 44 0.00% thuâ2n 44 0.00% tùng 44 0.00% não 44 0.00% hi 44 0.00% p 44 0.00% lau 44 0.00% tho’2i 44 0.00% brazil 43 0.00% húc 43 0.00% chu’1ng 43 0.00% ngo'5i 43 0.00% co'1 43 0.00% basayev 43 0.00% mo’3 43 0.00% hô1t 43 0.00% má 43 0.00% mu’1c 43 0.00% rô1t 43 0.00% huy2nh 43 0.00% hàm 43 0.00% no 43 0.00% nv2 43 0.00% chô1t 43 0.00% hiv 43 0.00% lùi 43 0.00% la(ng 43 0.00% 1998 42 0.00% lo'4 42 0.00% ga(1t 42 0.00% gai 42 0.00% ha(ng 42 0.00% mi3m 42 0.00% gáy 42 0.00% nai 42 0.00% u'o'1t 42 0.00% ho'3 42 0.00% huô1ng 42 0.00% lu5t 42 0.00% râ2m 42 0.00% argentina 42 0.00% trèo

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 41 42 0.00% ki 42 0.00% xí 42 0.00% george 42 0.00% tý 42 0.00% run 42 0.00% khoan 42 0.00% tu’3 42 0.00% diê5u 42 0.00% 5 42 0.00% sa(1m 41 0.00% ddô1m 41 0.00% vu 41 0.00% ddiên 41 0.00% cuô2ng 41 0.00% uranium 41 0.00% thu'o'1c 41 0.00% ii 41 0.00% tâ3y 41 0.00% rô5 41 0.00% (tu'1c 41 0.00% óc 41 0.00% tu5i 41 0.00% duyên 41 0.00% nu5 41 0.00% bóp 41 0.00% rica 41 0.00% khuyê1t 41 0.00% buýt 41 0.00% xám 41 0.00% tâu 41 0.00% dàn 41 0.00% nhánh 41 0.00% quê1 41 0.00% cuô5n 41 0.00% rét 41 0.00% 1980 41 0.00% báu 40 0.00% bambi 40 0.00% gãy 40 0.00% go'3i 40 0.00% lê2u 40 0.00% bi3 40 0.00% â1p 40 0.00% saddam 40 0.00% ho3ng 40 0.00% giê2ng 40 0.00% tuyê1t 40 0.00% bênh 40 0.00% xhcn 40 0.00% khoán 40 0.00% cu3ng 40 0.00% lô5c

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 42 40 0.00% giâ2u 40 0.00% bê5 40 0.00% ddinh 40 0.00% 1999 40 0.00% thu3ng 40 0.00% pa(ng 40 0.00% ma5o 40 0.00% ukraine 40 0.00% ddiêu 40 0.00% cu’1u 40 0.00% bô5i 39 0.00% hiê3n 39 0.00% vâ4y 39 0.00% blair 39 0.00% saudi 39 0.00% viê1ng 39 0.00% ddu’o’2ng 39 0.00% 39 0.00% na3y 39 0.00% ma5n 39 0.00% daisy 39 0.00% bo3ng 39 0.00% cu'o'4i 39 0.00% lãm 39 0.00% vâng 39 0.00% cha3 39 0.00% su'2ng 39 0.00% khô1ng 39 0.00% ghé 39 0.00% ô3 39 0.00% lung 39 0.00% khéo 39 0.00% êm 39 0.00% tro’5 39 0.00% tu’ 39 0.00% di5 39 0.00% bê2 39 0.00% phâ4n 39 0.00% hán 39 0.00% khoái 38 0.00% dô4 38 0.00% sharon 38 0.00% rao 38 0.00% diê2u 38 0.00% cúp 38 0.00% texas 38 0.00% ap 38 0.00% 38 0.00% gìn 38 0.00% buông 38 0.00% nga3 38 0.00% chô3m

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 43 38 0.00% bv 38 0.00% thu’o’2ng 38 0.00% bo 38 0.00% gác 38 0.00% giu’4 38 0.00% cù 38 0.00% man 38 0.00% lo5c 37 0.00% cua3 37 0.00% xách 37 0.00% cúi 37 0.00% nghiê5n 37 0.00% rách 37 0.00% co'4 37 0.00% vo'3 37 0.00% kho'i 37 0.00% nha(1t 37 0.00% giâ2y 37 0.00% tu3 37 0.00% (1 37 0.00% sánh 37 0.00% tông 37 0.00% khoang 37 0.00% nghiêng 37 0.00% w 37 0.00% xu'o'3ng 37 0.00% mexico 37 0.00% lo’1n 37 0.00% thiê1p 37 0.00% bu'2ng 37 0.00% trô5n 37 0.00% chén 37 0.00% 48 37 0.00% sweden 37 0.00% (tu'2 37 0.00% tuê5 37 0.00% tu'5a 37 0.00% kiê5m 37 0.00% borowski 37 0.00% 55 37 0.00% tu’o’1ng 37 0.00% 400 37 0.00% gâ5t 36 0.00% saigon 36 0.00% lu'o'5n 36 0.00% hu' 36 0.00% muô4i 36 0.00% khoác 36 0.00% peter 36 0.00% liêm 36 0.00% rê3 36 0.00% câ5y

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 44 36 0.00% de 36 0.00% lãi 36 0.00% gieo 36 0.00% sâ5p 36 0.00% tê 36 0.00% thóc 36 0.00% chà 36 0.00% hu’4u 36 0.00% bbc 36 0.00% thoái 36 0.00% hô4n 36 0.00% hu 36 0.00% vy4 36 0.00% cúng 36 0.00% giêng 36 0.00% tô2i 36 0.00% cô1m 36 0.00% nho5c 36 0.00% 38 36 0.00% lewis 35 0.00% giàn 35 0.00% ngu5 35 0.00% asean 35 0.00% 32 35 0.00% xoay 35 0.00% xu'ng 35 0.00% 34 35 0.00% hoãn 35 0.00% chuô1i 35 0.00% bo5c 35 0.00% ngo' 35 0.00% no’i 35 0.00% dô1c 35 0.00% thà 35 0.00% mi 35 0.00% 42 35 0.00% giô 35 0.00% trù 35 0.00% nhi5 35 0.00% mu'o'1p 35 0.00% ngón 35 0.00% do'3 35 0.00% trói 35 0.00% châ5t 34 0.00% trân 34 0.00% lu4ng 34 0.00% biê1u 34 0.00% michael 34 0.00% ghê 34 0.00% ddu'1t 34 0.00% 35 34 0.00% ecuador

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 45 34 0.00% kìa 34 0.00% rình 34 0.00% hãnh 34 0.00% apec 34 0.00% giê1ng 34 0.00% dumbo 34 0.00% 1995 34 0.00% gâ5y 34 0.00% trâ5t 34 0.00% lúng 34 0.00% vuô1t 34 0.00% ngu'o'4ng 34 0.00% nghiê5t 34 0.00% khôi 34 0.00% william 34 0.00% tâ2n 34 0.00% reuters 34 0.00% i 34 0.00% dâm 34 0.00% sáo 34 0.00% ngo'i 34 0.00% montenegro 33 0.00% tq 33 0.00% dùi 33 0.00% trinh 33 0.00% 1994 33 0.00% thói 33 0.00% ddáy 33 0.00% trá 33 0.00% gào 33 0.00% chô2n 33 0.00% 2 33 0.00% tráng 33 0.00% ngâ1t 33 0.00% thung 33 0.00% mâ2m 33 0.00% miss 33 0.00% lu'o'2ng 33 0.00% mô2m 33 0.00% ô1m 33 0.00% va(ng 33 0.00% mâ5p 33 0.00% mu'o'1n 33 0.00% eu 33 0.00% huê2 33 0.00% chô1c 33 0.00% lót 33 0.00% bành 33 0.00% 33 33 0.00% quyên 33 0.00% cô1i 33 0.00% hoa5

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 46 33 0.00% lu'5 32 0.00% lén 32 0.00% pho'i 32 0.00% 1990 32 0.00% xao 32 0.00% chê 32 0.00% milosevic 32 0.00% xé 32 0.00% gô1i 32 0.00% chiêu 32 0.00% clinton 32 0.00% ddâ5m 32 0.00% ven 32 0.00% khô1c 32 0.00% nê1p 32 0.00% lâ1n 32 0.00% su'o'3i 32 0.00% nhu3 32 0.00% la(5n 32 0.00% huy3 32 0.00% thét 32 0.00% yê3m 32 0.00% râ2y 32 0.00% chu’a 32 0.00% cho'1p 32 0.00% khoe3 32 0.00% kiêu 32 0.00% ddu5c 32 0.00% dd 32 0.00% boeing 32 0.00% tha5c 32 0.00% dán 31 0.00% cerberus 31 0.00% xê1 31 0.00% chuô2ng 31 0.00% vòi 31 0.00% côn 31 0.00% mâu 31 0.00% hãn 31 0.00% trêu 31 0.00% gio3 31 0.00% dô2i 31 0.00% cp 31 0.00% bô5c 31 0.00% 36 31 0.00% chechnya 31 0.00% tiê5p 31 0.00% 2008 31 0.00% khoá 31 0.00% nhãn 31 0.00% hh 31 0.00% màng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 47 31 0.00% 9 31 0.00% cay 31 0.00% tít 31 0.00% quét 31 0.00% a5t 31 0.00% va5ch 31 0.00% xuôi 31 0.00% tâ2u 31 0.00% tuy2 31 0.00% cho'1 31 0.00% phu'o'1c 31 0.00% times 31 0.00% xíu 31 0.00% kiêm 31 0.00% so3i 31 0.00% va(2n 31 0.00% la5ng 31 0.00% bóc 31 0.00% phì 31 0.00% afp 31 0.00% de3 31 0.00% hò 31 0.00% táng 31 0.00% tlt 31 0.00% la(m 30 0.00% bâ4y 30 0.00% ddâ2m 30 0.00% ngâ5m 30 0.00% kê5 30 0.00% mài 30 0.00% ga5ch 30 0.00% thoáng 30 0.00% vu5t 30 0.00% aids 30 0.00% 800 30 0.00% châ5n 30 0.00% hon 30 0.00% 46 30 0.00% phu’o’ng 30 0.00% buô2ng 30 0.00% florida 30 0.00% (có 30 0.00% ngu'5 30 0.00% ha(m 30 0.00% páo 30 0.00% xù 30 0.00% líu 30 0.00% tru'o'5t 30 0.00% gdp 30 0.00% 75 30 0.00% tâ1p 30 0.00% dì

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 48 30 0.00% xa(1n 30 0.00% u'a 30 0.00% há 30 0.00% tia 30 0.00% chu'o'1c 30 0.00% ga5t 30 0.00% 41 30 0.00% xoa 30 0.00% 1996 30 0.00% thô1t 30 0.00% mãnh 30 0.00% bin 30 0.00% dè 30 0.00% mì 30 0.00% nga5ch 30 0.00% fallon 30 0.00% yê1n 30 0.00% nè 29 0.00% ra5p 29 0.00% dn 29 0.00% 57 29 0.00% bo' 29 0.00% lo3ng 29 0.00% ro'1t 29 0.00% bâ3n 29 0.00% chu'3i 29 0.00% david 29 0.00% hèn 29 0.00% ve5t 29 0.00% môncung 29 0.00% vác 29 0.00% ca(1m 29 0.00% com 29 0.00% bini 29 0.00% cõi 29 0.00% cô1c 29 0.00% 1945 29 0.00% ddúc 29 0.00% khe4 29 0.00% nê3 29 0.00% ddo'1n 29 0.00% (my4 29 0.00% thô1i 29 0.00% nhô1t 29 0.00% xài 29 0.00% khuya 29 0.00% lu'o'5m 29 0.00% tha(1m 29 0.00% giu5c 29 0.00% ngâ2n 29 0.00% arabia 29 0.00% julie

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 49 29 0.00% cho5i 29 0.00% khoai 29 0.00% liêu 29 0.00% hâm 29 0.00% su3a 28 0.00% ddùng 28 0.00% abu 28 0.00% thiêng 28 0.00% nhã 28 0.00% chích 28 0.00% xâ3y 28 0.00% ddâ1m 28 0.00% bill 28 0.00% khuâ1t 28 0.00% le3 28 0.00% còi 28 0.00% men 28 0.00% chém 28 0.00% na5i 28 0.00% nan 28 0.00% chinh 28 0.00% go'5i 28 0.00% cô1ng 28 0.00% thám 28 0.00% râu 28 0.00% 37 28 0.00% houston 28 0.00% hu'u 28 0.00% venezuela 28 0.00% ke3o 28 0.00% ti5 28 0.00% ghana 28 0.00% hê4 28 0.00% malaysia 28 0.00% chìa 28 0.00% toa3 28 0.00% nhu'5a 28 0.00% cu'3u 28 0.00% ta(m 28 0.00% nho5n 28 0.00% ddâ4m 28 0.00% 8 28 0.00% nô1t 28 0.00% 39 28 0.00% gs 28 0.00% ca(m 27 0.00% kennedy 27 0.00% du’o’4ng 27 0.00% xo' 27 0.00% ngo'4 27 0.00% pinocchio 27 0.00% luô2ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 50 27 0.00% ê 27 0.00% trê4 27 0.00% hiê1p 27 0.00% bùn 27 0.00% materazzi 27 0.00% na(1p 27 0.00% ru3i 27 0.00% s 27 0.00% ngo3 27 0.00% ke5o 27 0.00% fred 27 0.00% he5p 27 0.00% columbia 27 0.00% pháo 27 0.00% quâ5t 27 0.00% nhiê4u 27 0.00% khinh 27 0.00% thuy3 27 0.00% bo'1i 27 0.00% su’3 27 0.00% ca5n 27 0.00% go 27 0.00% vách 27 0.00% ham 27 0.00% hussein 27 0.00% uâ1t 27 0.00% afghanistan 27 0.00% po' 27 0.00% 250 27 0.00% tha3n 26 0.00% giã 26 0.00% tro5 26 0.00% 44 26 0.00% hlv 26 0.00% ddiê2m 26 0.00% cúc 26 0.00% tô1ng 26 0.00% sòng 26 0.00% xe3 26 0.00% côte 26 0.00% tri5nh 26 0.00% vu’5c 26 0.00% vnpt 26 0.00% nhào 26 0.00% evn 26 0.00% tiê5m 26 0.00% nha(1n 26 0.00% tu3i 26 0.00% finale 26 0.00% du'ng 26 0.00% châm 26 0.00% ddu

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 51 26 0.00% 47 26 0.00% quâ1y 26 0.00% 26 0.00% lênin 26 0.00% baghdad 26 0.00% khe 26 0.00% tui 26 0.00% bì 26 0.00% vu'1t 26 0.00% delay 26 0.00% búa 26 0.00% lon 26 0.00% tu'5u 26 0.00% mo3ng 26 0.00% ô2n 26 0.00% co5p 26 0.00% nhô3 26 0.00% trâ2u 26 0.00% na5t 26 0.00% lebanon 26 0.00% dâ4u 26 0.00% vu'o'1ng 26 0.00% hum 26 0.00% uyên 26 0.00% soi 26 0.00% cào 26 0.00% né 26 0.00% thu’1c 26 0.00% re5t 26 0.00% nho'3 26 0.00% roh 26 0.00% me3 26 0.00% lay 26 0.00% giáng 26 0.00% tha3i 26 0.00% duâ3n 25 0.00% 1997 25 0.00% my5 25 0.00% and 25 0.00% ngu’ng 25 0.00% dding 25 0.00% (trong 25 0.00% rai 25 0.00% cau 25 0.00% dâ5p 25 0.00% tehran 25 0.00% ghen 25 0.00% tu’1c 25 0.00% bô 25 0.00% xa3 25 0.00% tru'o'1ng 25 0.00% tò

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 52 25 0.00% gaza 25 0.00% búp 25 0.00% râ4y 25 0.00% ru'o'1c 25 0.00% ngô5t 25 0.00% chu'2a 25 0.00% ghét 25 0.00% vu4ng 25 0.00% quãng 25 0.00% tu'o'1i 25 0.00% vo'2 25 0.00% ra5ng 25 0.00% thùy 25 0.00% nhâ5u 25 0.00% rít 25 0.00% nâ1m 25 0.00% tu'o'1c 25 0.00% yê1t 25 0.00% los 25 0.00% nhi5n 25 0.00% dào 25 0.00% chô1n 25 0.00% xôi 25 0.00% angeles 25 0.00% ngu'2a 25 0.00% chóc 25 0.00% nhu'1c 25 0.00% ke5t 25 0.00% hao 24 0.00% lào 24 0.00% hoa5i 24 0.00% bàu 24 0.00% le5 24 0.00% rock 24 0.00% berlin 24 0.00% (khoa3ng 24 0.00% ga 24 0.00% nô2ng 24 0.00% toa5 24 0.00% ngà 24 0.00% petersburg 24 0.00% rooney 24 0.00% ttct 24 0.00% ri3 24 0.00% cui 24 0.00% phu5ng 24 0.00% 49 24 0.00% cho’i 24 0.00% lô2ng 24 0.00% phiê5t 24 0.00% trinidad 24 0.00% ra3i

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 53 24 0.00% a3o 24 0.00% nút 24 0.00% lu'1a 24 0.00% quâ1n 24 0.00% ve5n 24 0.00% cha(5ng 24 0.00% ô5p 24 0.00% 1991 24 0.00% clb 24 0.00% ranh 24 0.00% airlines 24 0.00% hoành 24 0.00% tru5c 24 0.00% túy 24 0.00% mercosur 24 0.00% o'1t 24 0.00% khao 24 0.00% tà 24 0.00% miên 24 0.00% 1973 24 0.00% dang 24 0.00% ronaldo 24 0.00% thu’o’5ng 24 0.00% dai 24 0.00% lo’5i 24 0.00% mênh 24 0.00% bô2n 24 0.00% bem 24 0.00% yê1m 23 0.00% nhâ5m 23 0.00% nê 23 0.00% rã 23 0.00% shin 23 0.00% gu5c 23 0.00% u3i 23 0.00% phôi 23 0.00% thu’ 23 0.00% vo'1 23 0.00% rô3 23 0.00% hs 23 0.00% vo' 23 0.00% da(5m 23 0.00% hí 23 0.00% dãy 23 0.00% grace 23 0.00% thính 23 0.00% 5dd 23 0.00% nhê5n 23 0.00% cày 23 0.00% tuý 23 0.00% hâu 23 0.00% phanh

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 54 23 0.00% nigeria 23 0.00% nhung 23 0.00% nu'1c 23 0.00% u'ng 23 0.00% 1974 23 0.00% tra(1c 23 0.00% cút 23 0.00% bích 23 0.00% 52 23 0.00% kiê1p 23 0.00% jordan 23 0.00% tro5n 23 0.00% condoleezza 23 0.00% do'2i 23 0.00% sex 23 0.00% tobago 23 0.00% thòi 23 0.00% (trung 23 0.00% philippines 23 0.00% elottery 23 0.00% nhu'o'5c 23 0.00% nhô 23 0.00% hoa5n 23 0.00% moscow 23 0.00% khô1 23 0.00% xót 23 0.00% chay 23 0.00% nhu'1t 23 0.00% gán 23 0.00% smddh 23 0.00% ngo5 23 0.00% gióng 23 0.00% nho 23 0.00% hãm 23 0.00% do'4 23 0.00% ddâ5y 23 0.00% ráp 22 0.00% nô5 22 0.00% lu'2ng 22 0.00% vung 22 0.00% bergkamp 22 0.00% gia(5t 22 0.00% arsenal 22 0.00% su’1c 22 0.00% nòng 22 0.00% nha5y 22 0.00% ricardo 22 0.00% h5n1 22 0.00% ddo5ng 22 0.00% cu’1 22 0.00% ba5t 22 0.00% xiê1c

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 55 22 0.00% liê4u 22 0.00% mòn 22 0.00% chuô4i 22 0.00% 700 22 0.00% qua(ng 22 0.00% 1982 22 0.00% st 22 0.00% xúm 22 0.00% xa5 22 0.00% ga(5m 22 0.00% telecom 22 0.00% â3m 22 0.00% be5p 22 0.00% thò 22 0.00% vaccine 22 0.00% rô5n 22 0.00% lu’o’ng 22 0.00% súc 22 0.00% ddôn 22 0.00% cu'o'1c 22 0.00% tu'ng 22 0.00% kahn 22 0.00% túm 22 0.00% huê5 22 0.00% tru'1o'c 22 0.00% ngoa5n 22 0.00% vô2 22 0.00% khiêng 22 0.00% nghênh 22 0.00% ddam 22 0.00% khái 22 0.00% go'1m 22 0.00% kìm 22 0.00% bernard 22 0.00% lu'o'1t 22 0.00% nhâ1c 22 0.00% chua 22 0.00% dda(1n 22 0.00% lô5ng 22 0.00% xáo 22 0.00% klose 22 0.00% ngâm 22 0.00% bâ3m 22 0.00% yasushi 22 0.00% jose 22 0.00% siê1t 22 0.00% sà 22 0.00% chu'ng 22 0.00% nón 22 0.00% ngó 21 0.00% vãn 21 0.00% ro

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 56 21 0.00% bã 21 0.00% 58 21 0.00% annan 21 0.00% toan 21 0.00% túp 21 0.00% ni 21 0.00% league 21 0.00% lilly 21 0.00% ddu'5c 21 0.00% 1979 21 0.00% hô1c 21 0.00% ky5 21 0.00% usa 21 0.00% tg 21 0.00% donald 21 0.00% ke5p 21 0.00% dép 21 0.00% si 21 0.00% ma(ng 21 0.00% 1976 21 0.00% neuville 21 0.00% mamút 21 0.00% tunisia 21 0.00% cong 21 0.00% lâ2y 21 0.00% henry 21 0.00% chuô5ng 21 0.00% tha5o 21 0.00% kehl 21 0.00% nhai 21 0.00% munich 21 0.00% cài 21 0.00% ngu'3a 21 0.00% xít 21 0.00% rê4 21 0.00% té 21 0.00% matt 21 0.00% du’1t 21 0.00% tom 21 0.00% timor 21 0.00% diego 21 0.00% hê5t 21 0.00% ho5ng 21 0.00% hân 21 0.00% ma(1ng 20 0.00% tro'5n 20 0.00% kê2 20 0.00% ddô1 20 0.00% mao 20 0.00% oán 20 0.00% (chi3 20 0.00% sly

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 57 20 0.00% nhan 20 0.00% hu3 20 0.00% gom 20 0.00% 59 20 0.00% lehmann 20 0.00% d'ivoire 20 0.00% ga3 20 0.00% bo'm 20 0.00% va(1t 20 0.00% chelsea 20 0.00% nàn 20 0.00% laden 20 0.00% haditha 20 0.00% kofi 20 0.00% xi 20 0.00% nén 20 0.00% 51 20 0.00% he3m 20 0.00% xi3 20 0.00% cô4 20 0.00% co5c 20 0.00% sàigòn 20 0.00% cu'ng 20 0.00% ni3 20 0.00% son 20 0.00% giu'o'ng 20 0.00% trade 20 0.00% thê2 20 0.00% miê5t 20 0.00% www 20 0.00% co5 20 0.00% râ2u 20 0.00% xích 20 0.00% ráo 20 0.00% hddnd 20 0.00% ahmadinejad 20 0.00% ruô2i 20 0.00% nhát 20 0.00% oai 20 0.00% nu’4 20 0.00% tha(5ng 19 0.00% isabel 19 0.00% khê1 19 0.00% mãng 19 0.00% 120 19 0.00% nít 19 0.00% vôi 19 0.00% vi3a 19 0.00% cu'5 19 0.00% tro 19 0.00% vddv 19 0.00% ách

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 58 19 0.00% iii 19 0.00% gazprom 19 0.00% karaoke 19 0.00% quâ2y 19 0.00% lùn 19 0.00% schweinsteiger 19 0.00% kissinger 19 0.00% le 19 0.00% 110 19 0.00% 53 19 0.00% (2 19 0.00% na3n 19 0.00% phâ1t 19 0.00% méo 19 0.00% nha5t 19 0.00% angola 19 0.00% phun 19 0.00% (không 19 0.00% richter 19 0.00% tran 19 0.00% vo'5t 19 0.00% jack 19 0.00% châ3n 19 0.00% ddò 19 0.00% add 19 0.00% thiêu 19 0.00% tô1p 19 0.00% lâ3n 19 0.00% dda3 19 0.00% vo 19 0.00% phách 19 0.00% de3o 19 0.00% ó 19 0.00% tqlc 19 0.00% hhhv 19 0.00% khu'o'ng 19 0.00% birbal 19 0.00% mép 19 0.00% vo'1t 19 0.00% buô2m 19 0.00% mail 19 0.00% pha(3ng 19 0.00% da3i 19 0.00% phui 19 0.00% rong 19 0.00% nâ1p 19 0.00% cu5m 19 0.00% ddcs 19 0.00% ghép 19 0.00% vladimir 19 0.00% khía 19 0.00% 83

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 59 19 0.00% mâ2u 19 0.00% so'2 19 0.00% pop 19 0.00% lê2 19 0.00% mâ3u 19 0.00% chôm 19 0.00% universe 19 0.00% f 19 0.00% 85 19 0.00% vút 18 0.00% chèn 18 0.00% no5c 18 0.00% hu’o’3ng 18 0.00% tha(1n 18 0.00% tra(n 18 0.00% for 18 0.00% ván 18 0.00% thê 18 0.00% phen 18 0.00% j 18 0.00% xa3o 18 0.00% merkel 18 0.00% qui4 18 0.00% album 18 0.00% am 18 0.00% nhét 18 0.00% ca5p 18 0.00% bê3 18 0.00% mê4 18 0.00% mi5t 18 0.00% robert 18 0.00% râ5m 18 0.00% top 18 0.00% rudy 18 0.00% nhót 18 0.00% canxi 18 0.00% â5p 18 0.00% cole 18 0.00% ca(3ng 18 0.00% ngu'3i 18 0.00% su'u 18 0.00% nu'o'1ng 18 0.00% scowcroft 18 0.00% trô3 18 0.00% to5a 18 0.00% ddê5m 18 0.00% ddính 18 0.00% to' 18 0.00% va5i 18 0.00% trô5i 18 0.00% nho' 18 0.00% kipper

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 60 18 0.00% (tp 18 0.00% su'a 18 0.00% barcelona 18 0.00% liège 18 0.00% elmer 18 0.00% voa 18 0.00% trán 18 0.00% mít 18 0.00% la(2n 18 0.00% nâ1c 18 0.00% hu’ 18 0.00% khép 18 0.00% mo'4 18 0.00% 1993 18 0.00% kpa(h 18 0.00% pghh 18 0.00% thu'2ng 18 0.00% tòan 18 0.00% lu'o'2i 18 0.00% (nê1u 18 0.00% nguyê2n 18 0.00% cáp 18 0.00% mac 18 0.00% post 18 0.00% 62 18 0.00% phê1 18 0.00% câ1n 18 0.00% côi 18 0.00% cuô1ng 18 0.00% muông 18 0.00% xiê1t 18 0.00% ru4 18 0.00% (sinh 18 0.00% cuô1c 18 0.00% 65 18 0.00% dda(1ng 18 0.00% 1970 18 0.00% bu'ng 18 0.00% paraguay 18 0.00% xoá 18 0.00% ma3ng 18 0.00% phùng 18 0.00% lo'2 18 0.00% scanlon 17 0.00% bâ1m 17 0.00% hú 17 0.00% fc 17 0.00% vét 17 0.00% 3 17 0.00% vhtt 17 0.00% muô1i 17 0.00% nha(n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 61 17 0.00% ngàng 17 0.00% mô1t 17 0.00% bô5t 17 0.00% u’1ng 17 0.00% 2009 17 0.00% kho'1p 17 0.00% thu'5 17 0.00% (hay 17 0.00% láo 17 0.00% thình 17 0.00% quy2 17 0.00% sa(5c 17 0.00% bet 17 0.00% su'o'2n 17 0.00% lí 17 0.00% 170 17 0.00% cuba 17 0.00% ló 17 0.00% bn 17 0.00% nu'1t 17 0.00% tát 17 0.00% ngu3i 17 0.00% thâ1m 17 0.00% hersh 17 0.00% bi5t 17 0.00% mi5 17 0.00% (a3nh 17 0.00% qh 17 0.00% dda(1p 17 0.00% ngào 17 0.00% bú 17 0.00% lâ4y 17 0.00% chen 17 0.00% giãn 17 0.00% dvd 17 0.00% còng 17 0.00% bombay 17 0.00% iaea 17 0.00% tha5nh 17 0.00% háo 17 0.00% reed 17 0.00% tn 17 0.00% chô2m 17 0.00% nôn 17 0.00% 43 17 0.00% 61 17 0.00% mahmoud 17 0.00% choàng 17 0.00% no'1i 17 0.00% náo 17 0.00% video 17 0.00% (nhu'

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 62 17 0.00% java 17 0.00% thúy 17 0.00% núp 17 0.00% ddo'1i 17 0.00% chao 17 0.00% ngát 17 0.00% khuâ3n 17 0.00% ireland 17 0.00% ngu' 17 0.00% cua 17 0.00% nu’4a 16 0.00% miê5n 16 0.00% figo 16 0.00% nghiê1n 16 0.00% vành 16 0.00% sxh 16 0.00% nhí 16 0.00% hong 16 0.00% lim 16 0.00% rên 16 0.00% hiên 16 0.00% cody 16 0.00% gao 16 0.00% cu’5c 16 0.00% rán 16 0.00% 81 16 0.00% nhi4 16 0.00% 160 16 0.00% 67 16 0.00% website 16 0.00% bolivia 16 0.00% virginia 16 0.00% tro' 16 0.00% kì 16 0.00% triê2n 16 0.00% xua 16 0.00% 1986 16 0.00% beckenbauer 16 0.00% bhatia 16 0.00% robben 16 0.00% xôn 16 0.00% lâ3m 16 0.00% dortmund 16 0.00% frings 16 0.00% cha3i 16 0.00% bô1ng 16 0.00% dduô1i 16 0.00% beckham 16 0.00% tony 16 0.00% del 16 0.00% tour 16 0.00% thòng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 63 16 0.00% bu 16 0.00% da5n 16 0.00% sen 16 0.00% chùm 16 0.00% ro'm 16 0.00% thuo'3 16 0.00% chu’4a 16 0.00% tu’o’2ng 16 0.00% 69 16 0.00% shiite 16 0.00% 1978 16 0.00% lách 16 0.00% ngây 16 0.00% (mô5t 16 0.00% quâ3n 16 0.00% btc 16 0.00% nato 16 0.00% sâ2u 16 0.00% ddày 16 0.00% khoát 16 0.00% xu'3a 16 0.00% vênh 16 0.00% bianca 16 0.00% bremen 16 0.00% la5y 16 0.00% ùn 16 0.00% ddsq 16 0.00% rê5t 16 0.00% tu’2ng 16 0.00% gas 16 0.00% mâm 16 0.00% ga(5t 16 0.00% hu'o'2ng 16 0.00% axít 16 0.00% hecta 16 0.00% kitty 16 0.00% gang 16 0.00% cha(1p 16 0.00% dâ1n 16 0.00% gôn 16 0.00% 63 16 0.00% ganh 16 0.00% nha3 16 0.00% 64 16 0.00% london 16 0.00% nhe5n 16 0.00% nho'n 16 0.00% australia 16 0.00% bong 16 0.00% nâ5p 16 0.00% dô1t 16 0.00% na5p

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 64 16 0.00% campuchia 16 0.00% no3 16 0.00% ma5t 16 0.00% be3 16 0.00% vàn 16 0.00% lima 16 0.00% su5c 16 0.00% tivi 16 0.00% tra5ch 15 0.00% 1949 15 0.00% hòng 15 0.00% cd 15 0.00% vcd 15 0.00% cô5i 15 0.00% vâ2n 15 0.00% waterford 15 0.00% plastech 15 0.00% dda5c 15 0.00% vu5ng 15 0.00% gia(2ng 15 0.00% nepal 15 0.00% phiêu 15 0.00% lee 15 0.00% khiê1m 15 0.00% tru'o'2n 15 0.00% kong 15 0.00% thu5c 15 0.00% news 15 0.00% gâ2y 15 0.00% khoa(n 15 0.00% nga5t 15 0.00% large 15 0.00% khoét 15 0.00% mâ4n 15 0.00% bu'o'1u 15 0.00% lút 15 0.00% ò 15 0.00% châ5u 15 0.00% dili 15 0.00% junta 15 0.00% ru'o'3i 15 0.00% cõng 15 0.00% lít 15 0.00% hâ1t 15 0.00% dhs 15 0.00% 54 15 0.00% vy 15 0.00% bâ3y 15 0.00% roi 15 0.00% tím 15 0.00% sv 15 0.00% (chu3

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 65 15 0.00% cheney 15 0.00% pap 15 0.00% vùi 15 0.00% ru5ng 15 0.00% cháo 15 0.00% ru'o'4i 15 0.00% 1989 15 0.00% chói 15 0.00% carlos 15 0.00% chdcnd 15 0.00% gàng 15 0.00% ddãng 15 0.00% len 15 0.00% xen 15 0.00% nghe5n 15 0.00% dec 15 0.00% lm 15 0.00% qua5t 15 0.00% stuttgart 15 0.00% cho' 15 0.00% rành 15 0.00% dubai 15 0.00% bi5p 15 0.00% miu 15 0.00% war 15 0.00% vóc 15 0.00% mô1c 15 0.00% thán 15 0.00% dc 15 0.00% richard 15 0.00% palu 15 0.00% chì 14 0.00% su’3a 14 0.00% ulianovxco 14 0.00% rô 14 0.00% va(5n 14 0.00% virus 14 0.00% nê1m 14 0.00% tâ5u 14 0.00% ngâ3n 14 0.00% loát 14 0.00% nghé 14 0.00% trút 14 0.00% su'3ng 14 0.00% cm 14 0.00% kremlin 14 0.00% (25 14 0.00% mìn 14 0.00% rái 14 0.00% óng 14 0.00% ddút 14 0.00% guam

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 66 14 0.00% chô5p 14 0.00% vnexpress 14 0.00% gài 14 0.00% xu'o'1ng 14 0.00% xòe 14 0.00% sheldon 14 0.00% phô2ng 14 0.00% nhích 14 0.00% paul 14 0.00% email 14 0.00% ngu5c 14 0.00% chu’1a 14 0.00% ngách 14 0.00% bít 14 0.00% tdtt 14 0.00% hbsag 14 0.00% lu4y 14 0.00% nghiã 14 0.00% kiê2m 14 0.00% quâ5y 14 0.00% r 14 0.00% re 14 0.00% chéo 14 0.00% 72 14 0.00% su5t 14 0.00% hùm 14 0.00% m2 14 0.00% loay 14 0.00% giâ5m 14 0.00% tro5c 14 0.00% tuô5c 14 0.00% va(5t 14 0.00% vèo 14 0.00% (iran 14 0.00% hoay 14 0.00% tráo 14 0.00% qua(3ng 14 0.00% bu'2a 14 0.00% ba(n 14 0.00% hô3ng 14 0.00% fan 14 0.00% nhu'o'4ng 14 0.00% loáng 14 0.00% 1968 14 0.00% international 14 0.00% 130 14 0.00% 350 14 0.00% khâm 14 0.00% thâ1u 14 0.00% 14 0.00% tâ1u 14 0.00% genève

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 67 14 0.00% 86 14 0.00% ddô5n 14 0.00% chê5ch 14 0.00% cairo 14 0.00% rocket 14 0.00% nhím 14 0.00% nài 14 0.00% nguyê5t 14 0.00% mc 14 0.00% hì 14 0.00% phalin 14 0.00% vâ1p 14 0.00% reagan 14 0.00% peru 14 0.00% loa 14 0.00% chài 14 0.00% rùng 14 0.00% thu5t 14 0.00% nga(1t 13 0.00% ngâ1m 13 0.00% ho 13 0.00% jenny 13 0.00% no' 13 0.00% laser 13 0.00% microsoft 13 0.00% pho'3 13 0.00% (xã 13 0.00% thuy5 13 0.00% basmati 13 0.00% harry 13 0.00% chíp 13 0.00% lê1t 13 0.00% ô1i 13 0.00% khanh 13 0.00% inc 13 0.00% uô1n 13 0.00% norquist 13 0.00% (ngày 13 0.00% taleban 13 0.00% cambodia 13 0.00% sét 13 0.00% câm 13 0.00% sên 13 0.00% keo 13 0.00% nga5n 13 0.00% bùm 13 0.00% ca(2n 13 0.00% ddát 13 0.00% teo 13 0.00% 1988 13 0.00% podolski 13 0.00% johnson

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 68 13 0.00% euro 13 0.00% ngu5y 13 0.00% the5n 13 0.00% 1967 13 0.00% schneider 13 0.00% lahm 13 0.00% (11 13 0.00% oliver 13 0.00% nã 13 0.00% zambrotta 13 0.00% bui 13 0.00% ngâ4m 13 0.00% nhao 13 0.00% rà 13 0.00% ng 13 0.00% nhô2i 13 0.00% tokyo 13 0.00% thuý 13 0.00% xô5n 13 0.00% nhiê1p 13 0.00% da(5t 13 0.00% lóng 13 0.00% phao 13 0.00% thâ3n 13 0.00% hindu 13 0.00% la(5p 13 0.00% virút 13 0.00% (qua3ng 13 0.00% ngùi 13 0.00% lu’u 13 0.00% ddu’1ng 13 0.00% american 13 0.00% 220 13 0.00% taliban 13 0.00% (bô5 13 0.00% luyê1n 13 0.00% tiê2u 13 0.00% bddhq 13 0.00% tho3i 13 0.00% hannah 13 0.00% cu’3 13 0.00% no'4 13 0.00% thô1n 13 0.00% hu’o’1ng 13 0.00% so’3 13 0.00% toa3n 13 0.00% ngóng 13 0.00% mô2ng 13 0.00% 77 13 0.00% ddbscl 13 0.00% o' 13 0.00% dô5t

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 69 13 0.00% cu'5a 13 0.00% tu’1 13 0.00% bo'5 13 0.00% mu'o'5t 13 0.00% giun 13 0.00% o3i 13 0.00% nv1 13 0.00% time 13 0.00% champions 13 0.00% úp 13 0.00% 13 0.00% ê1 13 0.00% châ2u 13 0.00% chu' 13 0.00% tuô5t 13 0.00% do'i 13 0.00% liêng 13 0.00% pmu 13 0.00% ” 13 0.00% lâ2u 13 0.00% ldd 13 0.00% preston 13 0.00% uma 12 0.00% 82 12 0.00% xát 12 0.00% (hoa 12 0.00% (4 12 0.00% chiên 12 0.00% arafat 12 0.00% khuê 12 0.00% (công 12 0.00% then 12 0.00% ten 12 0.00% ml 12 0.00% (tên 12 0.00% tro'1 12 0.00% rico 12 0.00% leach 12 0.00% visa 12 0.00% philips 12 0.00% và… 12 0.00% nê1n 12 0.00% lu'4 12 0.00% soon 12 0.00% liverpool 12 0.00% cóp 12 0.00% thuram 12 0.00% kông 12 0.00% university 12 0.00% ru3a 12 0.00% (gio'2 12 0.00% my

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 70 12 0.00% 1992 12 0.00% ru'o'2ng 12 0.00% teddy 12 0.00% ha(1c 12 0.00% carter 12 0.00% ohio 12 0.00% choai 12 0.00% kha 12 0.00% brown 12 0.00% si3 12 0.00% 84 12 0.00% vu5n 12 0.00% riê1t 12 0.00% tnhh 12 0.00% xô1p 12 0.00% cô2n 12 0.00% wendy 12 0.00% dick 12 0.00% u'2 12 0.00% ginting 12 0.00% la(3ng 12 0.00% da5c 12 0.00% penalty 12 0.00% oánh 12 0.00% hé 12 0.00% i3 12 0.00% vá 12 0.00% quâ1t 12 0.00% 78 12 0.00% tho'1t 12 0.00% bo’3i 12 0.00% hu'1c 12 0.00% phiá 12 0.00% tê2 12 0.00% ddùi 12 0.00% sâ2m 12 0.00% ru5t 12 0.00% nháy 12 0.00% ghe3 12 0.00% 66 12 0.00% khang 12 0.00% trót 12 0.00% mo'1 12 0.00% cu’5u 12 0.00% rê1t 12 0.00% râ5n 12 0.00% váy 12 0.00% bôi 12 0.00% râm 12 0.00% gyanendra 12 0.00% 900 12 0.00% nhàn

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 71 12 0.00% (tu'o'2ng 12 0.00% lo'3 12 0.00% na(n 12 0.00% a3i 12 0.00% rèn 12 0.00% dãi 12 0.00% ráng 12 0.00% phình 12 0.00% lo5ng 12 0.00% va5 12 0.00% quyê1n 12 0.00% gu4i 12 0.00% go5ng 12 0.00% colombia 12 0.00% 1954 12 0.00% hu'3ng 12 0.00% bìa 12 0.00% liê1m 12 0.00% kíp 12 0.00% nách 12 0.00% jerry 12 0.00% (sài 12 0.00% eo 12 0.00% olympic 12 0.00% hu5c 12 0.00% hu'5c 12 0.00% du'a 12 0.00% ddo5a 12 0.00% berne 12 0.00% nga5o 12 0.00% cha5p 12 0.00% trô2 12 0.00% gerrard 12 0.00% 12 0.00% miê1u 12 0.00% quì 12 0.00% zawahiri 12 0.00% guô2ng 12 0.00% o’n 12 0.00% ríu 11 0.00% 1000 11 0.00% nghía 11 0.00% con… 11 0.00% musharraf 11 0.00% dda5m 11 0.00% ngán 11 0.00% dance 11 0.00% ròng 11 0.00% prodi 11 0.00% xu’3 11 0.00% giòng 11 0.00% chóp

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 72 11 0.00% su'1t 11 0.00% ho'1n 11 0.00% nom 11 0.00% me 11 0.00% ddiê1m 11 0.00% kê2m 11 0.00% thiê2n 11 0.00% valencia 11 0.00% ngác 11 0.00% nguy5 11 0.00% muô4ng 11 0.00% phàn 11 0.00% (dda5i 11 0.00% (thuô5c 11 0.00% trô4i 11 0.00% xui 11 0.00% to'i 11 0.00% nghe5t 11 0.00% nháp 11 0.00% gddtla 11 0.00% dâ4m 11 0.00% lác 11 0.00% robot 11 0.00% kèo 11 0.00% dnnn 11 0.00% hu4 11 0.00% horno 11 0.00% cali 11 0.00% mu'1t 11 0.00% rêu 11 0.00% hô5t 11 0.00% chát 11 0.00% dduô1c 11 0.00% bu'o'u 11 0.00% giò 11 0.00% (sô1 11 0.00% frank 11 0.00% vuông 11 0.00% laura 11 0.00% vê1 11 0.00% li5m 11 0.00% ì 11 0.00% su'4ng 11 0.00% mi5n 11 0.00% bo'2i 11 0.00% 87 11 0.00% shell 11 0.00% shia 11 0.00% que 11 0.00% ghraib 11 0.00% kiley 11 0.00% tro5t

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 73 11 0.00% saint 11 0.00% 74 11 0.00% siêng 11 0.00% ttxvn 11 0.00% suu 11 0.00% ndd 11 0.00% cu5t 11 0.00% lâ4m 11 0.00% léo 11 0.00% (nguyên 11 0.00% kyi 11 0.00% nhâ1m 11 0.00% tro'n 11 0.00% bessie 11 0.00% sâ1m 11 0.00% su'ng 11 0.00% lô2i 11 0.00% ngòi 11 0.00% okondor 11 0.00% ddanh 11 0.00% buô1t 11 0.00% fatah 11 0.00% khoanh 11 0.00% suýt 11 0.00% biê1ng 11 0.00% lóc 11 0.00% puerto 11 0.00% kê2nh 11 0.00% metzelder 11 0.00% sar 11 0.00% mark 11 0.00% ngao 11 0.00% lèo 11 0.00% (21 11 0.00% natalie 11 0.00% tv 11 0.00% lô1c 11 0.00% khét 11 0.00% ddiê1u 11 0.00% ' 11 0.00% (do 11 0.00% sôcôla 11 0.00% a(1p 11 0.00% chè 11 0.00% jeremy 11 0.00% càn 11 0.00% gu’3i 11 0.00% pa 11 0.00% gia(ng 11 0.00% takagi 11 0.00% va3y 11 0.00% aodun

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 74 11 0.00% ha5c 11 0.00% rô1ng 11 0.00% you 11 0.00% la5p 11 0.00% belarus 11 0.00% thâu 11 0.00% niêm 11 0.00% ho'2n 11 0.00% i4 11 0.00% di4a 11 0.00% nhóc 11 0.00% súp 11 0.00% mào 11 0.00% atang 11 0.00% ti3a 11 0.00% suông 11 0.00% khoáng 11 0.00% chùi 11 0.00% girl 11 0.00% hít 10 0.00% nghi4nh 10 0.00% nhàm 10 0.00% 181 10 0.00% uý 10 0.00% to'3m 10 0.00% camera 10 0.00% ho'3i 10 0.00% rights 10 0.00% bâ2n 10 0.00% action 10 0.00% ru 10 0.00% mõm 10 0.00% likud 10 0.00% 1947 10 0.00% nv3 10 0.00% phô4ng 10 0.00% bu’o’1c 10 0.00% tns 10 0.00% la(1t 10 0.00% choáng 10 0.00% morgan 10 0.00% tru’2 10 0.00% frankfurt 10 0.00% câ2y 10 0.00% thcs 10 0.00% game 10 0.00% sudan 10 0.00% darfur 10 0.00% xoa(1n 10 0.00% lình 10 0.00% netviet 10 0.00% nhúng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 75 10 0.00% dennis 10 0.00% nu'4a… 10 0.00% stanford 10 0.00% 76 10 0.00% sea 10 0.00% tri4u 10 0.00% tb 10 0.00% diê4m 10 0.00% epson 10 0.00% ra5ch 10 0.00% ke4 10 0.00% hagl 10 0.00% ho'4i 10 0.00% morales 10 0.00% (tô3ng 10 0.00% uruguay 10 0.00% geneva 10 0.00% sô1c 10 0.00% baucus 10 0.00% châ1u 10 0.00% vú 10 0.00% nu’3a 10 0.00% show 10 0.00% khan 10 0.00% cho’3 10 0.00% timothée 10 0.00% (ddu'o'5c 10 0.00% nhu’o’4ng 10 0.00% kèn 10 0.00% gio'3 10 0.00% (thu'o'2ng 10 0.00% bayern 10 0.00% pan 10 0.00% street 10 0.00% dunga 10 0.00% 95 10 0.00% soro 10 0.00% xoài 10 0.00% giùm 10 0.00% tanh 10 0.00% bai 10 0.00% chirac 10 0.00% rày 10 0.00% rivera 10 0.00% bilic 10 0.00% hoi 10 0.00% lxl 10 0.00% kho’3i 10 0.00% lanh 10 0.00% nhõm 10 0.00% nhô5n 10 0.00% tha(3m

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 76 10 0.00% juventus 10 0.00% sistani 10 0.00% landis 10 0.00% bô3n 10 0.00% olmert 10 0.00% ddao 10 0.00% (ngu'o'2i 10 0.00% rè 10 0.00% kha(ng 10 0.00% nãy 10 0.00% ngo'5m 10 0.00% nhô1i 10 0.00% ma(5n 10 0.00% tha(1p 10 0.00% quây 10 0.00% dõng 10 0.00% nguô5i 10 0.00% a(ng 10 0.00% manuel 10 0.00% diana 10 0.00% lù 10 0.00% hormon 10 0.00% arena 10 0.00% allianz 10 0.00% chu'3ng 10 0.00% abc 10 0.00% chê1ch 10 0.00% chavez 10 0.00% sarah 10 0.00% chu'5c 10 0.00% friedrich 10 0.00% nhâ3y 10 0.00% gu'2 10 0.00% hollywood 10 0.00% so'4 10 0.00% xuê3 10 0.00% quack 10 0.00% moo 10 0.00% trê5 10 0.00% bruce 10 0.00% ddo'1p 10 0.00% (6 10 0.00% 89 10 0.00% tào 10 0.00% gheppettô 10 0.00% gâ4y 10 0.00% to'5n 10 0.00% tphcm 10 0.00% nhang 10 0.00% yahoo 10 0.00% jacquelyn 10 0.00% ma5

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 77 10 0.00% tép 10 0.00% 3000 10 0.00% vo’4 10 0.00% rúc 10 0.00% ct 10 0.00% ddít 10 0.00% marahute 10 0.00% síu 10 0.00% dde4o 10 0.00% bond 10 0.00% owen 10 0.00% bu'1o'c 10 0.00% gân 10 0.00% ta(5c 10 0.00% giaó 10 0.00% cu’3a 10 0.00% china 10 0.00% vu'o'5ng 10 0.00% sành 10 0.00% bâ2m 10 0.00% haifa 10 0.00% tdv 10 0.00% kiêng 10 0.00% dde4 10 0.00% lo’1p 10 0.00% la(1k 10 0.00% cqn 10 0.00% 1987 10 0.00% su’ 10 0.00% bangkok 10 0.00% 1966 10 0.00% khay 10 0.00% (wto 10 0.00% pha3 10 0.00% hù 10 0.00% sanchez 9 0.00% beiruth 9 0.00% câ5t 9 0.00% nho'3n 9 0.00% go’3i 9 0.00% blatter 9 0.00% gãi 9 0.00% hill 9 0.00% wata 9 0.00% thui 9 0.00% nín 9 0.00% vnch 9 0.00% là… 9 0.00% marinko 9 0.00% phô 9 0.00% ren 9 0.00% tru5i

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 78 9 0.00% (reuters 9 0.00% wayne 9 0.00% gandhi 9 0.00% chúi 9 0.00% châ2m 9 0.00% percy 9 0.00% vna 9 0.00% lô1 9 0.00% lu'5u 9 0.00% ha5m 9 0.00% joe 9 0.00% (sau 9 0.00% unesco 9 0.00% rosie 9 0.00% (30 9 0.00% center 9 0.00% yeltsin 9 0.00% nhúc 9 0.00% coalition 9 0.00% bính 9 0.00% dengue 9 0.00% pekerman 9 0.00% tu’o’ng 9 0.00% chiê3u 9 0.00% cuô2n 9 0.00% bu'3u 9 0.00% beach 9 0.00% xâ1p 9 0.00% chòi 9 0.00% quách 9 0.00% 73 9 0.00% oà 9 0.00% rajana 9 0.00% 1948 9 0.00% chan 9 0.00% 88 9 0.00% pv 9 0.00% két 9 0.00% sáp 9 0.00% kai 9 0.00% tuô2ng 9 0.00% vl 9 0.00% grove 9 0.00% michel 9 0.00% jake 9 0.00% sco 9 0.00% ehud 9 0.00% croatia 9 0.00% múc 9 0.00% (3 9 0.00% (24 9 0.00% ddung

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 79 9 0.00% sunni 9 0.00% nghê1ch 9 0.00% mo3m 9 0.00% khuâ1y 9 0.00% tru’5c 9 0.00% xào 9 0.00% pmu18 9 0.00% matxco'va 9 0.00% lu3i 9 0.00% cô5c 9 0.00% francisco 9 0.00% khâ1n 9 0.00% nham 9 0.00% irish 9 0.00% da(2ng 9 0.00% giô4 9 0.00% lemerre 9 0.00% 1950 9 0.00% (ddu'1c 9 0.00% lì 9 0.00% tuô1t 9 0.00% pfister 9 0.00% rô4i 9 0.00% y3 9 0.00% virut 9 0.00% (pntr 9 0.00% phung 9 0.00% nha(2n 9 0.00% ru'o'5t 9 0.00% khóm 9 0.00% ngu'1a 9 0.00% mách 9 0.00% gò 9 0.00% ni5nh 9 0.00% hu'o'1c 9 0.00% (22 9 0.00% mamy 9 0.00% ustr 9 0.00% 1962 9 0.00% nhoi 9 0.00% gâ1m 9 0.00% taxi 9 0.00% thiêm 9 0.00% luô5c 9 0.00% ha(1t 9 0.00% poseidon 9 0.00% alabama 9 0.00% marshall 9 0.00% state 9 0.00% (17 9 0.00% (29 9 0.00% vía

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 80 9 0.00% koroman 9 0.00% hòang 9 0.00% khuân 9 0.00% hkd 9 0.00% pgs 9 0.00% latinh 9 0.00% saviola 9 0.00% (còn 9 0.00% kè 9 0.00% quàng 9 0.00% honda 9 0.00% hatter 9 0.00% áng 9 0.00% life 9 0.00% uzbekistan 9 0.00% lhasa 9 0.00% 1960 9 0.00% angela 9 0.00% petit 9 0.00% truô2ng 9 0.00% lún 9 0.00% cu’o’2ng 9 0.00% quýt 9 0.00% nu'o'2m 9 0.00% ki4 9 0.00% vu'o'5n 9 0.00% tho'2n 9 0.00% bo'n 9 0.00% nu'o'5p 9 0.00% jong 9 0.00% nóc 9 0.00% ddttsg 9 0.00% ro’2i 9 0.00% law 9 0.00% xe3ng 9 0.00% ù 9 0.00% lu' 9 0.00% 180 9 0.00% ddi4 9 0.00% hattie 9 0.00% sawaco 9 0.00% hoong 9 0.00% tara 9 0.00% rò 9 0.00% cáu 9 0.00% coóc 9 0.00% so5 9 0.00% le3n 9 0.00% cu'u 9 0.00% martin 9 0.00% dìu 9 0.00% 99

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 81 9 0.00% du'o'5t 9 0.00% (mà 9 0.00% nhâ1p 9 0.00% vù 9 0.00% nannup 9 0.00% group 9 0.00% vâ3n 9 0.00% ngô1n 9 0.00% chiêm 9 0.00% lenin 9 0.00% 118 9 0.00% câ1y 9 0.00% cstq 9 0.00% ôtô 9 0.00% eric 9 0.00% shamil 9 0.00% ri 9 0.00% (trên 9 0.00% sì 9 0.00% 235 9 0.00% kheng 9 0.00% kí 9 0.00% ém 9 0.00% (the 8 0.00% rén 8 0.00% today 8 0.00% chuô1c 8 0.00% châ5p 8 0.00% rón 8 0.00% hori 8 0.00% (ba 8 0.00% thomas 8 0.00% níu 8 0.00% ra(n 8 0.00% christian 8 0.00% lu 8 0.00% nhoáng 8 0.00% cha3o 8 0.00% cátxim 8 0.00% tu'o'5c 8 0.00% phô3i 8 0.00% (tiê2n 8 0.00% 8 0.00% hâ3m 8 0.00% 500dd 8 0.00% (26 8 0.00% hông 8 0.00% osama 8 0.00% (q 8 0.00% 000m3 8 0.00% ngáp 8 0.00% du5m

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 82 8 0.00% hòm 8 0.00% chô2i 8 0.00% bung 8 0.00% chác 8 0.00% ngu'o'1c 8 0.00% ho’i 8 0.00% (10 8 0.00% ukraina 8 0.00% kate 8 0.00% nancy 8 0.00% dda(5n 8 0.00% koizumi 8 0.00% win 8 0.00% quít 8 0.00% nga(5t 8 0.00% dí 8 0.00% ra(1m 8 0.00% ben 8 0.00% lia 8 0.00% photocopy 8 0.00% nha3n 8 0.00% ngo5ai 8 0.00% bo'3 8 0.00% ngu'o'5ng 8 0.00% (ba(1c 8 0.00% ngo'1t 8 0.00% viettel 8 0.00% ptqd 8 0.00% xay 8 0.00% (28 8 0.00% rót 8 0.00% day 8 0.00% xh 8 0.00% u'o'i 8 0.00% tu3ng 8 0.00% ngáy 8 0.00% garden 8 0.00% ddu'o'2i 8 0.00% thìa 8 0.00% zuleyka 8 0.00% sa5p 8 0.00% 2x 8 0.00% ngu'u 8 0.00% 140 8 0.00% singh 8 0.00% úng 8 0.00% u3 8 0.00% ti5nh 8 0.00% national 8 0.00% 1952 8 0.00% loãng 8 0.00% domenech

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 83 8 0.00% heroin 8 0.00% mcgirk 8 0.00% sbs 8 0.00% meo 8 0.00% ddoa3n 8 0.00% uyê3n 8 0.00% giãy 8 0.00% city 8 0.00% murray 8 0.00% teng 8 0.00% sacombank 8 0.00% vc 8 0.00% piê1p 8 0.00% scott 8 0.00% vùn 8 0.00% gong 8 0.00% jazz 8 0.00% ru'1t 8 0.00% 79 8 0.00% mickey 8 0.00% (ddh 8 0.00% xi3a 8 0.00% oang 8 0.00% qdd 8 0.00% ddiê1c 8 0.00% wales 8 0.00% gù 8 0.00% (8 8 0.00% thuyên 8 0.00% elizondo 8 0.00% stephen 8 0.00% ngu’o’5c 8 0.00% du’5a 8 0.00% isser 8 0.00% suy5t 8 0.00% bu'o'3i 8 0.00% (tháng 8 0.00% italia 8 0.00% quê5 8 0.00% burns 8 0.00% chi4a 8 0.00% yasser 8 0.00% tòi 8 0.00% co’n 8 0.00% nhe 8 0.00% dìm 8 0.00% 560 8 0.00% thô1 8 0.00% globe 8 0.00% â3u 8 0.00% 1984 8 0.00% smith

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 84 8 0.00% 125 8 0.00% sê1p 8 0.00% rát 8 0.00% hô3i 8 0.00% gô2ng 8 0.00% om 8 0.00% royce 8 0.00% nhè 8 0.00% khodorkovsky 8 0.00% hispanic 8 0.00% 113 8 0.00% ms 8 0.00% relations 8 0.00% loang 8 0.00% ri4 8 0.00% mike 8 0.00% katrina 8 0.00% liê5ng 8 0.00% net 8 0.00% mô1ng 8 0.00% sào 8 0.00% keng 8 0.00% cu4i 8 0.00% bob 8 0.00% thoa5t 8 0.00% du’o’ng 8 0.00% táp 8 0.00% ngâ3ng 8 0.00% su'5c 8 0.00% suôn 8 0.00% mo'1m 8 0.00% sùng 8 0.00% (gia3m 8 0.00% xúp 8 0.00% (vì 8 0.00% mennonite 8 0.00% ralph 8 0.00% (phút 8 0.00% manager 8 0.00% khoa3nh 8 0.00% nixon 8 0.00% phai3 8 0.00% cnn 8 0.00% nép 8 0.00% lìa 8 0.00% hu'2ng 8 0.00% ttg 8 0.00% â1u 8 0.00% alzheimer 8 0.00% náu 8 0.00% morrison 8 0.00% ra3

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 85 8 0.00% phác 8 0.00% phò 8 0.00% nha(2ng 8 0.00% kashmir 8 0.00% mo 8 0.00% (mez 8 0.00% wolbachia 8 0.00% toát 8 0.00% condolezza 8 0.00% (saprissa 8 0.00% luis 8 0.00% nâ2n 8 0.00% club 8 0.00% tr 8 0.00% asia 8 0.00% 108 8 0.00% ngông 8 0.00% bo’1t 8 0.00% chông 8 0.00% koehler 8 0.00% 787 8 0.00% perfume 8 0.00% héo 8 0.00% lin 8 0.00% chimp 8 0.00% mercedes 8 0.00% suv 8 0.00% quy5 8 0.00% inh 8 0.00% georgia 8 0.00% (các 8 0.00% abbas 8 0.00% mad 8 0.00% chày 8 0.00% lã 8 0.00% ngô3n 8 0.00% nao 8 0.00% va(3ng 8 0.00% ngai 8 0.00% qúa 8 0.00% ta5nh 8 0.00% togo 8 0.00% bo5t 8 0.00% rô2i… 8 0.00% cô5 8 0.00% (vo'1i 8 0.00% puppy 8 0.00% ddo’5t 8 0.00% crespo 8 0.00% 7 8 0.00% nghiê2n 8 0.00% http

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 86 8 0.00% hugo 8 0.00% mandela 8 0.00% pw 8 0.00% le3o 8 0.00% (con 8 0.00% 1985 8 0.00% freedom 8 0.00% du’o’1i 8 0.00% santos 8 0.00% jerusalem 7 0.00% da(m 7 0.00% xi3u 7 0.00% chiê2n 7 0.00% (eu 7 0.00% tuâ1t 7 0.00% fair 7 0.00% che5t 7 0.00% kén 7 0.00% ra(5ng 7 0.00% (gia 7 0.00% tho'1i 7 0.00% aung 7 0.00% cholesterol 7 0.00% thiu 7 0.00% thày 7 0.00% mci 7 0.00% rabin 7 0.00% kadima 7 0.00% 1981 7 0.00% tze 7 0.00% 102 7 0.00% òa 7 0.00% faz 7 0.00% manmohan 7 0.00% 105 7 0.00% duô4i 7 0.00% islamabad 7 0.00% xó 7 0.00% ro’i 7 0.00% opec 7 0.00% um 7 0.00% jimmy 7 0.00% helms 7 0.00% ethanol 7 0.00% bô5p 7 0.00% fax 7 0.00% (hoa(5c 7 0.00% na(1n 7 0.00% bch 7 0.00% luô2n 7 0.00% mba 7 0.00% kuwait

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 87 7 0.00% pin 7 0.00% mo5t 7 0.00% qùy 7 0.00% west 7 0.00% ch 7 0.00% ru'o'5i 7 0.00% ray 7 0.00% so’1m 7 0.00% toét 7 0.00% goodlathe 7 0.00% 1959 7 0.00% bilis 7 0.00% lè 7 0.00% lo'5 7 0.00% ddèo 7 0.00% kcx 7 0.00% reinado 7 0.00% (chính 7 0.00% (evn 7 0.00% oái 7 0.00% o'2 7 0.00% huênh 7 0.00% gô5i 7 0.00% mikel 7 0.00% qui3 7 0.00% te5o 7 0.00% lampard 7 0.00% 8255 7 0.00% ru'ng 7 0.00% dùm 7 0.00% united 7 0.00% (9 7 0.00% rú 7 0.00% (33 7 0.00% cho’2 7 0.00% bvd 7 0.00% thu’3 7 0.00% ks 7 0.00% ngoa(5t 7 0.00% lhtn 7 0.00% nôi 7 0.00% mcloughlin 7 0.00% hk 7 0.00% les 7 0.00% beveren 7 0.00% foundation 7 0.00% cddnvtd 7 0.00% ru'5a 7 0.00% eriksson 7 0.00% hô3n 7 0.00% fbi 7 0.00% chanh

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 88 7 0.00% so5c 7 0.00% kê1ch 7 0.00% ngóc 7 0.00% nhác 7 0.00% chxhcn 7 0.00% 777 7 0.00% mía 7 0.00% juan 7 0.00% liê1ng 7 0.00% nhuô5m 7 0.00% têm 7 0.00% bê1t 7 0.00% thím 7 0.00% cu'a 7 0.00% lu'4ng 7 0.00% chuôi 7 0.00% th 7 0.00% 1946 7 0.00% nha(4n 7 0.00% vê5n 7 0.00% (nhu'4ng 7 0.00% ls 7 0.00% hâ5m 7 0.00% ella 7 0.00% nsu't 7 0.00% la5ch 7 0.00% michigan 7 0.00% (18 7 0.00% boris 7 0.00% tha(2n 7 0.00% loretta 7 0.00% tu3y 7 0.00% nhút 7 0.00% qlvnch 7 0.00% ngoa(2n 7 0.00% cuô5i 7 0.00% kilomet 7 0.00% wilson 7 0.00% nu'5c 7 0.00% neo 7 0.00% bu'1t 7 0.00% xo'i 7 0.00% le4o 7 0.00% dda(1k 7 0.00% is 7 0.00% ruili 7 0.00% akram 7 0.00% cóng 7 0.00% vu’o’5t 7 0.00% drogba 7 0.00% tiger 7 0.00% xu’1

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 89 7 0.00% tru'o'5ng 7 0.00% so5t 7 0.00% nhu'… 7 0.00% du’5ng 7 0.00% nè… 7 0.00% dâ1y 7 0.00% ngoa3nh 7 0.00% taekwondo 7 0.00% nòi 7 0.00% ngâ4u 7 0.00% nu’1t 7 0.00% khít 7 0.00% hòan 7 0.00% ga(ng 7 0.00% ddo5 7 0.00% nhe5m 7 0.00% lu'o'n 7 0.00% vietnam4all 7 0.00% bussiness 7 0.00% nhàu 7 0.00% (o'3 7 0.00% bi5nh 7 0.00% què 7 0.00% ddo’2i 7 0.00% grosso 7 0.00% ddo’5i 7 0.00% cho'1i 7 0.00% tâ1c 7 0.00% 1958 7 0.00% ngói 7 0.00% ulkraine 7 0.00% cph 7 0.00% sá 7 0.00% vu'o'3ng 7 0.00% rodriguez 7 0.00% 1972 7 0.00% 1983 7 0.00% rang 7 0.00% râ3y 7 0.00% der 7 0.00% south 7 0.00% christopher 7 0.00% daily 7 0.00% miguel 7 0.00% 18h00 7 0.00% xô5c 7 0.00% du'3ng 7 0.00% ddóa 7 0.00% trau 7 0.00% ngoa(5c 7 0.00% na(5n 7 0.00% kwh

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 90 7 0.00% italy 7 0.00% xxi 7 0.00% bô2ng 7 0.00% lester 7 0.00% lâ1m 7 0.00% xi5t 7 0.00% walter 7 0.00% 230 7 0.00% ho'2 7 0.00% phê1t 7 0.00% werder 7 0.00% te3 7 0.00% gô5p 7 0.00% ta3o 7 0.00% phào 7 0.00% asha 7 0.00% lép 7 0.00% xo3 7 0.00% 1m70 7 0.00% il 7 0.00% ford 7 0.00% (23 7 0.00% thê2m 7 0.00% bu5c 7 0.00% greenberg 7 0.00% simao 7 0.00% (bbc 7 0.00% 7 0.00% seoul 7 0.00% nung 7 0.00% bô5n 7 0.00% dda(1m 7 0.00% nháo 7 0.00% (ông 7 0.00% vina 7 0.00% mi3a 7 0.00% scolari 7 0.00% shabak 7 0.00% tu3a 7 0.00% tua 7 0.00% (anh 7 0.00% colin 7 0.00% harel 7 0.00% nazan 7 0.00% so’ 7 0.00% d' 7 0.00% 68 7 0.00% gates 7 0.00% tu’o’3ng 7 0.00% vo'i 7 0.00% khmer 7 0.00% ra3nh

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 91 7 0.00% 2020 7 0.00% khàn 7 0.00% riquelme 7 0.00% giê1m 7 0.00% 21h00 7 0.00% cristiano 7 0.00% boston 7 0.00% (bán 7 0.00% sâ1y 7 0.00% mertesacker 7 0.00% tô5t 7 0.00% dim 7 0.00% basa 6 0.00% cpc 6 0.00% vè 6 0.00% ðo3 6 0.00% fsb 6 0.00% ran 6 0.00% ingushetia 6 0.00% kamaz 6 0.00% li5nh 6 0.00% daniel 6 0.00% mnc 6 0.00% gelsenkirchen 6 0.00% roberto 6 0.00% jens 6 0.00% hastert 6 0.00% quality 6 0.00% kaesong 6 0.00% whitney 6 0.00% pratt 6 0.00% pasteur 6 0.00% max 6 0.00% tu’o’5ng 6 0.00% out 6 0.00% bgk 6 0.00% 131 6 0.00% sa(1n 6 0.00% prambanan 6 0.00% entertainment 6 0.00% thót 6 0.00% go’4 6 0.00% ba5 6 0.00% bùa 6 0.00% cu’ 6 0.00% 550 6 0.00% phillip 6 0.00% ve3n 6 0.00% lichtenstein 6 0.00% carmona 6 0.00% vitamin 6 0.00% ghe

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 92 6 0.00% thu’o’3ng 6 0.00% 138 6 0.00% kuala 6 0.00% paulson 6 0.00% (thái 6 0.00% xâ2m 6 0.00% khê 6 0.00% ma(m 6 0.00% 360 6 0.00% (nay 6 0.00% ddo'2 6 0.00% 98 6 0.00% 96 6 0.00% (tu'o'ng 6 0.00% syrie 6 0.00% 260 6 0.00% chu'o'1ng 6 0.00% u’o’1c 6 0.00% nv 6 0.00% oscar 6 0.00% ba3nh 6 0.00% livni 6 0.00% hun 6 0.00% abkhazia 6 0.00% jakarta 6 0.00% (time 6 0.00% delhi 6 0.00% chypre 6 0.00% pacông 6 0.00% brussels 6 0.00% rala(ng 6 0.00% adoption 6 0.00% wong 6 0.00% lanka 6 0.00% heng 6 0.00% xén 6 0.00% mão 6 0.00% múi 6 0.00% su’1 6 0.00% powell 6 0.00% vãi 6 0.00% sri 6 0.00% ngo'5p 6 0.00% ton 6 0.00% lisa 6 0.00% 5000 6 0.00% online 6 0.00% (nhà 6 0.00% ronald 6 0.00% jacques 6 0.00% bar 6 0.00% gót

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 93 6 0.00% oslo 6 0.00% cu'3i 6 0.00% rinh 6 0.00% kgb 6 0.00% nobel 6 0.00% suez 6 0.00% sam 6 0.00% vó 6 0.00% that 6 0.00% giòn 6 0.00% lòi 6 0.00% giông 6 0.00% bái 6 0.00% ok 6 0.00% lét 6 0.00% ngoi 6 0.00% sinai 6 0.00% tel 6 0.00% aviv 6 0.00% ddiê1ng 6 0.00% cnxh 6 0.00% 91 6 0.00% thì… 6 0.00% da5t 6 0.00% basten 6 0.00% ngòai 6 0.00% 'có' 6 0.00% 'không' 6 0.00% cho'2n 6 0.00% antonio 6 0.00% slna 6 0.00% (20 6 0.00% ddai5 6 0.00% oi 6 0.00% diên 6 0.00% na(5c 6 0.00% 6 0.00% giâu 6 0.00% mafia 6 0.00% nhen 6 0.00% (tây 6 0.00% che5n 6 0.00% bi5a 6 0.00% larry 6 0.00% ngu'2o'i 6 0.00% trent 6 0.00% met 6 0.00% lùa 6 0.00% bèo 6 0.00% smartypants 6 0.00% contra 6 0.00% g7

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 94 6 0.00% mâ5n 6 0.00% (nhâ5t 6 0.00% ostrava 6 0.00% (cô5ng 6 0.00% totti 6 0.00% (04 6 0.00% sô5 6 0.00% tuông 6 0.00% toni 6 0.00% to5t 6 0.00% rô4ng 6 0.00% davis 6 0.00% river 6 0.00% xuâ3n 6 0.00% (xxi 6 0.00% róm 6 0.00% cheo 6 0.00% (tho'2i 6 0.00% chicago 6 0.00% nê1t 6 0.00% ra3o 6 0.00% vâ2ng 6 0.00% nâ3y 6 0.00% ismail 6 0.00% (ap 6 0.00% psa 6 0.00% rì 6 0.00% (nam 6 0.00% vnah 6 0.00% da(3ng 6 0.00% vu'5a 6 0.00% cô2ng 6 0.00% sê1n 6 0.00% diê1p 6 0.00% bìu 6 0.00% biê1c 6 0.00% võng 6 0.00% sê 6 0.00% us 6 0.00% tho5c 6 0.00% a(4m 6 0.00% ngáo 6 0.00% ngô3 6 0.00% quito 6 0.00% tròng 6 0.00% 25dd 6 0.00% rossi 6 0.00% ha(2n 6 0.00% trâ4m 6 0.00% hâ1n 6 0.00% bõ 6 0.00% normal

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 95 6 0.00% ktm 6 0.00% 3495 6 0.00% thoa(1t 6 0.00% liê1c 6 0.00% crouch 6 0.00% khu'3 6 0.00% ma3 6 0.00% audi 6 0.00% paulo 6 0.00% z 6 0.00% tru 6 0.00% u3n 6 0.00% (câ1p 6 0.00% williams 6 0.00% bayer 6 0.00% ba(m 6 0.00% kali 6 0.00% beer 6 0.00% kara 6 0.00% vgsv 6 0.00% thi3u 6 0.00% ga(1p 6 0.00% ngoãn 6 0.00% váng 6 0.00% nâ5u 6 0.00% thiê3n 6 0.00% mulally 6 0.00% petro 6 0.00% 737 6 0.00% iss 6 0.00% ti5t 6 0.00% nhu4 6 0.00% (13 6 0.00% ubtvqh 6 0.00% huýt 6 0.00% jr 6 0.00% campell 6 0.00% dduo'5c 6 0.00% ahmed 6 0.00% (16 6 0.00% lóe 6 0.00% toronto 6 0.00% (hai 6 0.00% ú 6 0.00% a(1t 6 0.00% csg 6 0.00% quota 6 0.00% se5o 6 0.00% bo'1 6 0.00% ky 6 0.00% ffa 6 0.00% buffon

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 96 6 0.00% tùm 6 0.00% (19 6 0.00% ngoèo 6 0.00% khiê1t 6 0.00% hê3n 6 0.00% vuong 6 0.00% db 6 0.00% sa3nh 6 0.00% anastasia 6 0.00% han 6 0.00% gus 6 0.00% qúôc 6 0.00% thít 6 0.00% uô3ng 6 0.00% leslie 6 0.00% ddán 6 0.00% vun 6 0.00% luô1ng 6 0.00% disney 6 0.00% gia(1t 6 0.00% tomas 6 0.00% gâ4m 6 0.00% church 6 0.00% sydney 6 0.00% dô1p 6 0.00% (thay 6 0.00% simi 6 0.00% cu5i 6 0.00% chang 6 0.00% ddia5 6 0.00% kình 6 0.00% gu'2ng 6 0.00% vát 6 0.00% ca(5m 6 0.00% xô1c 6 0.00% díu 6 0.00% 11m 6 0.00% mùng 6 0.00% du'2a 6 0.00% trìu 6 0.00% (qua 6 0.00% nuo'1c 6 0.00% (alajuela 6 0.00% mirzapour 6 0.00% kone 6 0.00% wanchope 6 0.00% a5ch 6 0.00% persie 6 0.00% hoe 6 0.00% xo'1i 6 0.00% tizie 6 0.00% vietnamnet

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 97 6 0.00% trâ3y 6 0.00% (sv 6 0.00% douglas 6 0.00% ddun 6 0.00% 92 6 0.00% thúng 6 0.00% miêu 6 0.00% chiê1t 6 0.00% thabet 6 0.00% xi5n 6 0.00% yorker 6 0.00% kosovo 6 0.00% xuô2ng 6 0.00% ba(5t 6 0.00% bo'4 6 0.00% huê 6 0.00% ga(m 6 0.00% bi5ch 6 0.00% (tru'o'2ng 6 0.00% (chiê1m 6 0.00% (cao 6 0.00% agency 6 0.00% cô4i 6 0.00% huy5ch 6 0.00% cho'n 6 0.00% radio 6 0.00% harper 6 0.00% ptt 6 0.00% hollings 6 0.00% energy 6 0.00% pring 6 0.00% tíu 6 0.00% louisiana 6 0.00% he 6 0.00% safavian 6 0.00% ye 6 0.00% kidan 6 0.00% chô5t 6 0.00% va5t 6 0.00% bâ5y 6 0.00% intranet 6 0.00% ddi… 6 0.00% ui 6 0.00% kezman 6 0.00% jevric 6 0.00% 6 0.00% (khu 6 0.00% ken 6 0.00% 71 6 0.00% bmw 6 0.00% phích 6 0.00% cót

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 98 6 0.00% mòng 6 0.00% sùi 6 0.00% pacific 6 0.00% ivoire 6 0.00% thaksin 5 0.00% (permanent 5 0.00% larson 5 0.00% sâ2n 5 0.00% vít 5 0.00% 97 5 0.00% von 5 0.00% phóc 5 0.00% 320 5 0.00% have 5 0.00% pac 5 0.00% 1481 5 0.00% nho’2 5 0.00% ngùng 5 0.00% tong 5 0.00% thurman 5 0.00% damadola 5 0.00% suncruz 5 0.00% bajur 5 0.00% wolfowitz 5 0.00% miami 5 0.00% slovenia 5 0.00% sebastian 5 0.00% sisulu 5 0.00% róc 5 0.00% galang 5 0.00% ô3i 5 0.00% baldemor 5 0.00% human 5 0.00% ra5c 5 0.00% unocal 5 0.00% kazakhstan 5 0.00% dô2ng 5 0.00% mccain 5 0.00% (5 5 0.00% exxonmobil 5 0.00% reform 5 0.00% dê3 5 0.00% 619 5 0.00% negroponte 5 0.00% nsw 5 0.00% howard 5 0.00% maradona 5 0.00% frist 5 0.00% ghiê1c 5 0.00% lynn 5 0.00% hargreaves 5 0.00% grover

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 99 5 0.00% 7000 5 0.00% tâ3u 5 0.00% charles 5 0.00% ed 5 0.00% toanh 5 0.00% tía 5 0.00% rove 5 0.00% loã 5 0.00% policy 5 0.00% daucher 5 0.00% games 5 0.00% máng 5 0.00% nhún 5 0.00% albright 5 0.00% nghe4n 5 0.00% ních 5 0.00% clawson 5 0.00% mâ3y 5 0.00% carvalho 5 0.00% jean 5 0.00% bón 5 0.00% mó 5 0.00% oxfam 5 0.00% thê5 5 0.00% natanz 5 0.00% maoiste 5 0.00% kfc 5 0.00% ga5 5 0.00% chantha 5 0.00% 1934 5 0.00% la3ng 5 0.00% ngo'2i 5 0.00% tru5y 5 0.00% bún 5 0.00% thaung 5 0.00% slobodan 5 0.00% trâm 5 0.00% htun 5 0.00% cô5m 5 0.00% origami 5 0.00% hq 5 0.00% li5a 5 0.00% cô5p 5 0.00% le5t 5 0.00% gomez 5 0.00% czech 5 0.00% ramos 5 0.00% váo 5 0.00% lu’o’5c 5 0.00% di4nh 5 0.00% rangoon 5 0.00% jones

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 100 5 0.00% trô1 5 0.00% asem 5 0.00% parreira 5 0.00% jensen 5 0.00% vò 5 0.00% hitzlsperger 5 0.00% phu5t 5 0.00% mu'o'1t 5 0.00% sagnol 5 0.00% ddô4i 5 0.00% oa(2n 5 0.00% (st 5 0.00% nanh 5 0.00% kumar 5 0.00% co3i 5 0.00% réo 5 0.00% xiao 5 0.00% 202 5 0.00% ve5o 5 0.00% mo5ng 5 0.00% vái 5 0.00% gô5c 5 0.00% he3o 5 0.00% stankovic 5 0.00% mikhail 5 0.00% rule 5 0.00% hànô5i 5 0.00% (khi 5 0.00% leverkusen 5 0.00% porras 5 0.00% ma(1m 5 0.00% ddong 5 0.00% huyên 5 0.00% (05 5 0.00% tã 5 0.00% nhuyê4n 5 0.00% kilo 5 0.00% tày 5 0.00% uri 5 0.00% karan 5 0.00% (na(m 5 0.00% pirlo 5 0.00% karl 5 0.00% franz 5 0.00% ddóan 5 0.00% ghpgvntn 5 0.00% da(2n 5 0.00% 1969 5 0.00% truâ1t 5 0.00% pelz 5 0.00% susan 5 0.00% mi5ch

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 101 5 0.00% lu5ng 5 0.00% heitinga 5 0.00% sa(1ng 5 0.00% mu’2ng 5 0.00% chi5t 5 0.00% ra5 5 0.00% gallas 5 0.00% zinedine 5 0.00% rê2n 5 0.00% cha(2ng 5 0.00% (chu'a 5 0.00% borussia 5 0.00% malouda 5 0.00% hitler 5 0.00% kiê5u 5 0.00% lê3 5 0.00% thát 5 0.00% terry 5 0.00% quai 5 0.00% rome 5 0.00% rúng 5 0.00% xi3n 5 0.00% ro'5 5 0.00% làu 5 0.00% raymond 5 0.00% tra(2n 5 0.00% núng 5 0.00% yukos 5 0.00% ne3o 5 0.00% khui 5 0.00% tu5y 5 0.00% cáy 5 0.00% tháu 5 0.00% ngo 5 0.00% katmandu 5 0.00% nghê5n 5 0.00% hòe 5 0.00% duê5 5 0.00% gilberto 5 0.00% lu’5a 5 0.00% tiê4u 5 0.00% da(5c 5 0.00% (national 5 0.00% ho'2i 5 0.00% phi3nh 5 0.00% collect 5 0.00% tenet 5 0.00% qúy 5 0.00% phu’o’5ng 5 0.00% thè 5 0.00% thu’2a 5 0.00% amnesty

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 102 5 0.00% teheran 5 0.00% damascus 5 0.00% vu’4ng 5 0.00% nu'4u 5 0.00% me5… 5 0.00% gông 5 0.00% airasia 5 0.00% lumpur 5 0.00% qua(1p 5 0.00% marathon 5 0.00% er 5 0.00% bâ4m 5 0.00% tu5m 5 0.00% lo5ai 5 0.00% ngùn 5 0.00% ngu5t 5 0.00% melbourne 5 0.00% ho'1t 5 0.00% hê 5 0.00% diê4u 5 0.00% jayapura 5 0.00% lu’3a 5 0.00% nha(m 5 0.00% economist 5 0.00% dokdo 5 0.00% 238 5 0.00% nghê4u 5 0.00% tho’5 5 0.00% thon 5 0.00% (dda3ng 5 0.00% pho'1t 5 0.00% trezeguet 5 0.00% nd 5 0.00% thêu 5 0.00% xuyê1n 5 0.00% delta 5 0.00% idf 5 0.00% yale 5 0.00% nilon 5 0.00% râ2n 5 0.00% la(5t 5 0.00% 1953 5 0.00% fleming 5 0.00% xiêm 5 0.00% tõm 5 0.00% somalia 5 0.00% vâ3y 5 0.00% anthony 5 0.00% (canada 5 0.00% cbcc 5 0.00% 420 5 0.00% dream

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 103 5 0.00% (giá 5 0.00% ruê5 5 0.00% festival 5 0.00% nhá 5 0.00% tw 5 0.00% 114 5 0.00% 119 5 0.00% ca5o 5 0.00% card 5 0.00% free 5 0.00% (12 5 0.00% jag 5 0.00% lucifer 5 0.00% nl 5 0.00% nn 5 0.00% gala 5 0.00% cambridge 5 0.00% akbar 5 0.00% pleiku 5 0.00% diê5p 5 0.00% tréo 5 0.00% (se4 5 0.00% drizella 5 0.00% estrogen 5 0.00% (thu'5c 5 0.00% (quâ5n 5 0.00% ri5a 5 0.00% sonia 5 0.00% nmnth 5 0.00% phâ5p 5 0.00% media 5 0.00% ta5t 5 0.00% fullerton 5 0.00% cty 5 0.00% everett 5 0.00% (cu4ng 5 0.00% nga5nh 5 0.00% trô2i 5 0.00% vê5t 5 0.00% shrine 5 0.00% zim 5 0.00% chô5i 5 0.00% silva 5 0.00% phèn 5 0.00% (thanh 5 0.00% mi3 5 0.00% ethiopia 5 0.00% hop 5 0.00% piano 5 0.00% hip 5 0.00% su3ng 5 0.00% 145

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 104 5 0.00% emirates 5 0.00% shona 5 0.00% a3m 5 0.00% tu'o'3i 5 0.00% (ý 5 0.00% nôm 5 0.00% golf 5 0.00% (tru'o'1c 5 0.00% câ3u 5 0.00% evo 5 0.00% universal 5 0.00% wright 5 0.00% myanmar 5 0.00% mecca 5 0.00% toáng 5 0.00% (new 5 0.00% (nga 5 0.00% stalin 5 0.00% lddbdd 5 0.00% eboue 5 0.00% little 5 0.00% (ba3n 5 0.00% 1961 5 0.00% tóe 5 0.00% sa5t 5 0.00% buffy 5 0.00% nguôi 5 0.00% (ca3 5 0.00% tem 5 0.00% (go5i 5 0.00% sâ5y 5 0.00% vu'2ng 5 0.00% (vào 5 0.00% kiatisak 5 0.00% nha5nh 5 0.00% ddu3ng 5 0.00% (brazil 5 0.00% pha(ng 5 0.00% gdgt 5 0.00% cannavaro 5 0.00% makelele 5 0.00% tuyê2n 5 0.00% acb 5 0.00% kv 5 0.00% khâ5p 5 0.00% (vê2 5 0.00% o'dell 5 0.00% ……………… 5 0.00% 75dd 5 0.00% slogan 5 0.00% hia 5 0.00% nation

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 105 5 0.00% edge 5 0.00% nin 5 0.00% mè 5 0.00% (ddê3 5 0.00% 750 5 0.00% (và 5 0.00% cha5nh 5 0.00% google 5 0.00% rôn 5 0.00% phàng 5 0.00% do3m 5 0.00% (cho 5 0.00% baa 5 0.00% (dd 5 0.00% calorie 5 0.00% (ch 5 0.00% cddsp 5 0.00% nhâm 5 0.00% cluck 5 0.00% phu4 5 0.00% corporation 5 0.00% so3 5 0.00% (ddô2ng 5 0.00% baldemo 5 0.00% ddòan 5 0.00% (hô5i 5 0.00% dnt 5 0.00% 199 5 0.00% america 5 0.00% jiminy 5 0.00% u'1 5 0.00% it 5 0.00% (liên 5 0.00% ntu 5 0.00% space 5 0.00% va(1c 5 0.00% thé 5 0.00% dylan 5 0.00% vtv 5 0.00% 121 5 0.00% 280 5 0.00% dhl 5 0.00% na5o 5 0.00% cain 5 0.00% kho'2 5 0.00% anna 5 0.00% buttercup 5 0.00% oa5p 5 0.00% org 5 0.00% mohamed 5 0.00% (tru'2 5 0.00% kevin

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 106 5 0.00% craig 5 0.00% mu3i 5 0.00% qua(5c 5 0.00% mo'n 5 0.00% malaysiakini 5 0.00% ngoe 5 0.00% lõi 5 0.00% lõng 5 0.00% rìa 5 0.00% ro' 5 0.00% misa 5 0.00% on 5 0.00% no'1t 5 0.00% bi5u 5 0.00% nha3u 5 0.00% mâ1u 5 0.00% gâu 5 0.00% 1965 5 0.00% steven 5 0.00% 000m 5 0.00% randy 5 0.00% hamid 5 0.00% xô2m 5 0.00% cho5t 5 0.00% doãn 5 0.00% phô2n 4 0.00% toure 4 0.00% nistelrooy 4 0.00% (california 4 0.00% worldcup 4 0.00% 133 4 0.00% fabio 4 0.00% science 4 0.00% (arf 4 0.00% gavrancic 4 0.00% lagerbaeck 4 0.00% gô1m 4 0.00% zokora 4 0.00% (ddang 4 0.00% (cu4 4 0.00% thìn 4 0.00% hu'4ng 4 0.00% (phát 4 0.00% vietcombank 4 0.00% na5ng 4 0.00% sorin 4 0.00% boka 4 0.00% villa 4 0.00% chxhcnvn 4 0.00% one 4 0.00% (u'o'1c 4 0.00% khxh

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 107 4 0.00% corp 4 0.00% 4 0.00% 950 4 0.00% (ddây 4 0.00% valente 4 0.00% (an 4 0.00% horacio 4 0.00% tiago 4 0.00% petrolimex 4 0.00% manchester 4 0.00% 104 4 0.00% va(1n 4 0.00% mathijsen 4 0.00% bronckhorst 4 0.00% neville 4 0.00% fans 4 0.00% latin 4 0.00% (vi4nh 4 0.00% mu3 4 0.00% 000ha 4 0.00% bâ1p 4 0.00% hóc 4 0.00% xinhua 4 0.00% (râ1t 4 0.00% nadj 4 0.00% djordjevic 4 0.00% wall 4 0.00% (dn 4 0.00% chris 4 0.00% total 4 0.00% (quô1c 4 0.00% prudential 4 0.00% quáng 4 0.00% somkid 4 0.00% bommel 4 0.00% vanderwall 4 0.00% griles 4 0.00% (no'i 4 0.00% family 4 0.00% caucasus 4 0.00% tvc 4 0.00% (ma(5c 4 0.00% nho5 4 0.00% tro’2i 4 0.00% sin 4 0.00% 154 4 0.00% (viê5n 4 0.00% vu'à 4 0.00% whip 4 0.00% icc 4 0.00% 115 4 0.00% cunningham

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 108 4 0.00% kimmitt 4 0.00% tax 4 0.00% (international 4 0.00% kuznets 4 0.00% blue 4 0.00% distefano 4 0.00% business 4 0.00% el 4 0.00% pangandaran 4 0.00% (public 4 0.00% chalabi 4 0.00% (nhu'ng 4 0.00% lashkar 4 0.00% go'5n 4 0.00% public 4 0.00% shop 4 0.00% gardiner 4 0.00% (gia3i 4 0.00% (thu5y 4 0.00% kyat 4 0.00% dermalogica 4 0.00% yushchenko 4 0.00% mumbai 4 0.00% seymour 4 0.00% (miê2n 4 0.00% junichiro 4 0.00% alexei 4 0.00% cho’5 4 0.00% sgd 4 0.00% ddu'á 4 0.00% giu'5t 4 0.00% trainer 4 0.00% nhúm 4 0.00% b61 4 0.00% graham 4 0.00% siniora 4 0.00% tiê1u 4 0.00% mnchen 4 0.00% milan 4 0.00% hyun 4 0.00% kv1 4 0.00% fiorentina 4 0.00% franco 4 0.00% andrea 4 0.00% luanda 4 0.00% kurd 4 0.00% (tôi 4 0.00% qu3a 4 0.00% 1977 4 0.00% kyrgyzstan 4 0.00% zinha 4 0.00% roger

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 109 4 0.00% neubrandenburg 4 0.00% (ddiê2u 4 0.00% ntk 4 0.00% leipzig 4 0.00% (mô4i 4 0.00% mueller 4 0.00% hamburg 4 0.00% gamarra 4 0.00% valdez 4 0.00% kiessling 4 0.00% fonseca 4 0.00% centeno 4 0.00% nho'1t 4 0.00% tê3 4 0.00% goss 4 0.00% bêtông 4 0.00% ptnt 4 0.00% press 4 0.00% 2m 4 0.00% bea 4 0.00% (1998 4 0.00% tampa 4 0.00% kiê2n 4 0.00% gallon 4 0.00% 1m 4 0.00% 3m 4 0.00% 107 4 0.00% takeshima 4 0.00% pétrus 4 0.00% (kiên 4 0.00% mosoco 4 0.00% downer 4 0.00% e5p 4 0.00% clateman 4 0.00% mubarak 4 0.00% (nhâ1t 4 0.00% o5p 4 0.00% lukashenka 4 0.00% arbatov 4 0.00% act 4 0.00% bidong 4 0.00% bhxh 4 0.00% mohammed 4 0.00% (thôn 4 0.00% ddu’1a 4 0.00% carbon 4 0.00% (world 4 0.00% (dc 4 0.00% htv 4 0.00% ngoai5 4 0.00% people 4 0.00% organization

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 110 4 0.00% kho3an 4 0.00% cb 4 0.00% andrew 4 0.00% (pháp 4 0.00% super 4 0.00% lincoln 4 0.00% plaza 4 0.00% munoz 4 0.00% 1955 4 0.00% (loa5i 4 0.00% kh 4 0.00% nobuko 4 0.00% (bình 4 0.00% xoành 4 0.00% ex 4 0.00% girlfriend 4 0.00% superman 4 0.00% tantillo 4 0.00% cbs 4 0.00% (2005 4 0.00% wolf 4 0.00% des 4 0.00% ian 4 0.00% schwarzenegger 4 0.00% firket 4 0.00% (beirut 4 0.00% griffin 4 0.00% lom 4 0.00% séc 4 0.00% srifa 4 0.00% liberia 4 0.00% queo 4 0.00% north 4 0.00% doan 4 0.00% sieng 4 0.00% fox 4 0.00% review 4 0.00% asian 4 0.00% grassley 4 0.00% (washington 4 0.00% alan 4 0.00% (sydney 4 0.00% lynnphuong 4 0.00% 8g 4 0.00% lu5p 4 0.00% catherine 4 0.00% nhuâ2n 4 0.00% manga 4 0.00% 1971 4 0.00% who 4 0.00% trèm 4 0.00% ru'o'2m

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 111 4 0.00% tròm 4 0.00% chip 4 0.00% photo 4 0.00% ontario 4 0.00% nhe5t 4 0.00% kho3a 4 0.00% logo 4 0.00% (15 4 0.00% (hiê5n 4 0.00% (ha3i 4 0.00% systems 4 0.00% svtn 4 0.00% lamy 4 0.00% pascal 4 0.00% (làm 4 0.00% gaø 4 0.00% javier 4 0.00% centre 4 0.00% association 4 0.00% tvdddd 4 0.00% gien 4 0.00% (cu3a 4 0.00% ria 4 0.00% hddba 4 0.00% ttl 4 0.00% tm 4 0.00% (ddu'o'2ng 4 0.00% 210 4 0.00% barthez 4 0.00% gilardino 4 0.00% islands 4 0.00% category 4 0.00% 1m73 4 0.00% philippine 4 0.00% globebeauties 4 0.00% 380 4 0.00% armi 4 0.00% wayans 4 0.00% porsche 4 0.00% ferrari 4 0.00% 1m75 4 0.00% benz 4 0.00% cosplay 4 0.00% baldomir 4 0.00% robbie 4 0.00% madonna 4 0.00% cincinnati 4 0.00% parade 4 0.00% love 4 0.00% mo’4 4 0.00% bgh 4 0.00% contest

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 112 4 0.00% kurara 4 0.00% chibana 4 0.00% carolina 4 0.00% ddâ1ng 4 0.00% color 4 0.00% imaging 4 0.00% tand 4 0.00% su5 4 0.00% loé 4 0.00% hi3 4 0.00% ayala 4 0.00% noi 4 0.00% sofia 4 0.00% abbondanzieri 4 0.00% loà 4 0.00% phômai 4 0.00% thuy2 4 0.00% teen 4 0.00% ryan 4 0.00% giâý 4 0.00% ia 4 0.00% stanol 4 0.00% (thu3 4 0.00% du’o’5c 4 0.00% 200m 4 0.00% seattle 4 0.00% ddê3u 4 0.00% ariel 4 0.00% glebova 4 0.00% víu 4 0.00% mundhra 4 0.00% diê1m 4 0.00% ddái 4 0.00% tê4 4 0.00% doha 4 0.00% 106 4 0.00% (bê5nh 4 0.00% (ddã 4 0.00% a4 4 0.00% huntsville 4 0.00% b14 4 0.00% mississippi 4 0.00% giâ4m 4 0.00% 2018 4 0.00% café 4 0.00% ddác 4 0.00% perrotta 4 0.00% camoranesi 4 0.00% thierry 4 0.00% ebay 4 0.00% ddu'o'c 4 0.00% missing

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 113 4 0.00% (finale 4 0.00% niê5u 4 0.00% bhyt 4 0.00% phèo 4 0.00% • 4 0.00% play 4 0.00% oxytocine 4 0.00% philippin 4 0.00% phiê5n 4 0.00% 1930 4 0.00% mcnaught 4 0.00% bavik 4 0.00% min 4 0.00% point 4 0.00% 244 4 0.00% grondona 4 0.00% intifada 4 0.00% elenildo 4 0.00% miê2u 4 0.00% (châu 4 0.00% kha(1t 4 0.00% real 4 0.00% ashley 4 0.00% adb 4 0.00% uni 4 0.00% iowa 4 0.00% jeffrey 4 0.00% kirchner 4 0.00% vii 4 0.00% vieira 4 0.00% patrick 4 0.00% general 4 0.00% kaemi 4 0.00% h5n2 4 0.00% khoáy 4 0.00% chat 4 0.00% 103 4 0.00% hassan 4 0.00% iom 4 0.00% silicon 4 0.00% sellers 4 0.00% sãi 4 0.00% marquez 4 0.00% 650 4 0.00% lowy 4 0.00% papua 4 0.00% ibrahimovic 4 0.00% predator 4 0.00% kippour 4 0.00% (ddà 4 0.00% 1956 4 0.00% plo

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 114 4 0.00% economic 4 0.00% system 4 0.00% star 4 0.00% (bê1n 4 0.00% leonardo 4 0.00% marine 4 0.00% institute 4 0.00% lou 4 0.00% giô1c 4 0.00% oil 4 0.00% dan 4 0.00% ptsg 4 0.00% cùm 4 0.00% hu'1o'ng 4 0.00% 525 4 0.00% luiz 4 0.00% phi3 4 0.00% huyê5t 4 0.00% biê1m 4 0.00% foreign 4 0.00% pervez 4 0.00% (tô3 4 0.00% pei 4 0.00% rafael 4 0.00% with 4 0.00% edward 4 0.00% dà 4 0.00% dê2 4 0.00% du’ 4 0.00% còm 4 0.00% tuê1 4 0.00% mu’u 4 0.00% dong 4 0.00% ra(2m 4 0.00% 290 4 0.00% kilogram 4 0.00% so’5 4 0.00% sa3o 4 0.00% bâ5p 4 0.00% úy 4 0.00% nhu4i 4 0.00% do' 4 0.00% tu4m 4 0.00% me5o 4 0.00% nga(1c 4 0.00% khu'o'1c 4 0.00% du’o’5t 4 0.00% hách 4 0.00% ngu5m 4 0.00% sa5o 4 0.00% mích 4 0.00% nhó

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 115 4 0.00% nhâ3m 4 0.00% thu3i 4 0.00% píp 4 0.00% dâ2m 4 0.00% sum 4 0.00% toác 4 0.00% du5i 4 0.00% khu'5ng 4 0.00% dreamliner 4 0.00% ngâ1u 4 0.00% nhô5ng 4 0.00% chít 4 0.00% muô4m 4 0.00% o… 4 0.00% ro5i 4 0.00% ô2m 4 0.00% watt 4 0.00% sa3ng 4 0.00% ca5ch 4 0.00% mo'3n 4 0.00% leng 4 0.00% hu5i 4 0.00% chu’4 4 0.00% u’o’ng 4 0.00% nha3y… 4 0.00% xa5c 4 0.00% muhammad 4 0.00% na5 4 0.00% soland 4 0.00% xa(m 4 0.00% quyê5t 4 0.00% 175 4 0.00% raj 4 0.00% mu 4 0.00% tru’o’ng 4 0.00% lóp 4 0.00% la(4ng 4 0.00% po'2 4 0.00% toe 4 0.00% cheshire 4 0.00% bu'o'm 4 0.00% field 4 0.00% ngu'4a 4 0.00% du’o’2ng 4 0.00% tím… 4 0.00% xu'o'1c 4 0.00% hâ2y 4 0.00% ddo’n 4 0.00% ru’2ng 4 0.00% 1964 4 0.00% ted 4 0.00% bê5ch

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 116 4 0.00% et 4 0.00% xuýt 4 0.00% kampuchia 4 0.00% héc 4 0.00% mèn 4 0.00% xa… 4 0.00% i3n 4 0.00% tho 4 0.00% hê1n 4 0.00% ddô3ng 4 0.00% ca(2m 4 0.00% ro'5p 4 0.00% ta(2m 4 0.00% quâ4y 4 0.00% khom 4 0.00% li3nh 4 0.00% lu5a 4 0.00% no'm 4 0.00% xiêu 4 0.00% lu5y 4 0.00% ngo'1 4 0.00% thê1… 4 0.00% sính 4 0.00% ùa 4 0.00% vê2… 4 0.00% hi5ch 4 0.00% xoe 4 0.00% phùn 4 0.00% pho' 4 0.00% lâ1t 4 0.00% lu'2 4 0.00% dó 4 0.00% miê1t 4 0.00% ri3a 4 0.00% trê 4 0.00% que3 4 0.00% ra5n 4 0.00% lu3ng 4 0.00% ru'o'1i 4 0.00% sa3 4 0.00% ngu5p 4 0.00% nhô3m 4 0.00% chòm 4 0.00% ghê2nh 4 0.00% chùn 4 0.00% nghi3m 4 0.00% mâ3n 4 0.00% giúi 4 0.00% nghêu 4 0.00% do3ng 4 0.00% la5i… 4 0.00% chãi

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 117 4 0.00% kháo 4 0.00% lo'5p 4 0.00% to3i 4 0.00% du'1a 4 0.00% hoa(1t 4 0.00% thuô2ng 4 0.00% tre3o 4 0.00% tê1… 4 0.00% veo 4 0.00% panhpronen 4 0.00% xoong 4 0.00% qua(2n 4 0.00% ga3y 4 0.00% nguâ3y 4 0.00% cu'2 4 0.00% dda(2m 4 0.00% ro'1m 4 0.00% ki3 4 0.00% xtrômbôli 4 0.00% dông 4 0.00% thênh 4 0.00% tành 4 0.00% mu'o'2ng 4 0.00% 'cánh 4 0.00% nô5m 4 0.00% mu5n 4 0.00% ga(1m 4 0.00% dâ1m 4 0.00% mui 4 0.00% xém 4 0.00% tha3y 4 0.00% máo 4 0.00% xoáy 4 0.00% mê1u 4 0.00% jumbo 4 0.00% bu3a 4 0.00% la3o 4 0.00% tri5ch 4 0.00% 'yêu 4 0.00% gu'o'5ng 4 0.00% sê1u 4 0.00% này… 4 0.00% xuô1ng… 4 0.00% thu'4ng 4 0.00% kên 4 0.00% xòm 4 0.00% 'con 4 0.00% bu'5 4 0.00% nhe5p 4 0.00% ngoáy 4 0.00% a(1ng 4 0.00% jim

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 118 4 0.00% bruno 4 0.00% tuôn 4 0.00% ddùn 4 0.00% bét 4 0.00% nuô1i 4 0.00% ðông 4 0.00% beckett 4 0.00% quác 4 0.00% lênh 4 0.00% toé 4 0.00% tzipi 4 0.00% ùm 4 0.00% dâ2y 4 0.00% hê1ch 4 0.00% ngoa5m 4 0.00% ru'2ng… 4 0.00% hê3 4 0.00% ddâ5t 4 0.00% lùm 4 0.00% càu 4 0.00% mô5t… 4 0.00% gia5t 4 0.00% chuô2i 4 0.00% cha(2m 3 0.00% we 3 0.00% king 3 0.00% náy 3 0.00% luy4 3 0.00% (na5n 3 0.00% bòn 3 0.00% xo'3 3 0.00% muô5i 3 0.00% (huyê5n 3 0.00% ca(5n 3 0.00% thong 3 0.00% 7g30 3 0.00% sê5 3 0.00% golan 3 0.00% giri 3 0.00% thuô5t 3 0.00% 5cm 3 0.00% bank 3 0.00% u'1a 3 0.00% (huê1 3 0.00% alkatiri 3 0.00% thâ4m 3 0.00% xe5t 3 0.00% (saint 3 0.00% pô 3 0.00% chlb 3 0.00% horta 3 0.00% u'5c

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 119 3 0.00% east 3 0.00% abraham 3 0.00% 192 3 0.00% a… 3 0.00% nguo'2i 3 0.00% 1m2 3 0.00% beta 3 0.00% a(n… 3 0.00% 340 3 0.00% 1010 3 0.00% nhe4 3 0.00% bear 3 0.00% 730 3 0.00% 7g 3 0.00% foote 3 0.00% du'o'5ng 3 0.00% kilômet 3 0.00% lmvntd 3 0.00% ddùm 3 0.00% quái' 3 0.00% xe5p 3 0.00% dow 3 0.00% du'a5 3 0.00% bondi 3 0.00% campbell 3 0.00% khâ1m 3 0.00% tay' 3 0.00% nhu4n 3 0.00% soái 3 0.00% ru3ng 3 0.00% ge 3 0.00% rr 3 0.00% thõng 3 0.00% ho5ach 3 0.00% (84 3 0.00% shimbun 3 0.00% yemen 3 0.00% 489 3 0.00% 438 3 0.00% vùa 3 0.00% (bang 3 0.00% brrr 3 0.00% be4 3 0.00% (trâ2n 3 0.00% mfn 3 0.00% saleh 3 0.00% (bi5 3 0.00% sandy 3 0.00% (83 3 0.00% (82 3 0.00% ghán 3 0.00% zealand

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 120 3 0.00% patrushev 3 0.00% brownback 3 0.00% qua5i 3 0.00% pravda 3 0.00% áy 3 0.00% gordon 3 0.00% va(2ng 3 0.00% thút 3 0.00% ro5m 3 0.00% nu'1a 3 0.00% xoàng 3 0.00% bali 3 0.00% francis 3 0.00% clark 3 0.00% lô1t 3 0.00% perth 3 0.00% híp 3 0.00% cbkt 3 0.00% sergei 3 0.00% dopamin 3 0.00% (viê5t 3 0.00% straits 3 0.00% xinhuanet 3 0.00% 362 3 0.00% 5602 3 0.00% (viê1t 3 0.00% (trà 3 0.00% u'o'1m 3 0.00% strasbourg 3 0.00% (thành 3 0.00% vo'3n 3 0.00% will 3 0.00% copywriter 3 0.00% office 3 0.00% dduo'2ng 3 0.00% bi3nh 3 0.00% lo'3n 3 0.00% …… 3 0.00% congress 3 0.00% your 3 0.00% (tu'5 3 0.00% anders 3 0.00% sb 3 0.00% aaja 3 0.00% 1500 3 0.00% tu'3… 3 0.00% luông 3 0.00% 303 3 0.00% lu4i 3 0.00% dewar 3 0.00% to'5p 3 0.00% béc

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 121 3 0.00% andy 3 0.00% (tiê3u 3 0.00% giê 3 0.00% ma5p 3 0.00% americans 3 0.00% maryland 3 0.00% amyloid 3 0.00% (vm 3 0.00% 2g 3 0.00% aragon 3 0.00% (chuyên 3 0.00% abidal 3 0.00% bo'5m 3 0.00% hiê2m 3 0.00% khoa(1ng 3 0.00% arnold 3 0.00% seafood 3 0.00% (ban 3 0.00% gim 3 0.00% xu5p 3 0.00% ngu'òi 3 0.00% qu4y 3 0.00% (san 3 0.00% dans 3 0.00% chêm 3 0.00% vi5n 3 0.00% u'2ng 3 0.00% be5 3 0.00% mit 3 0.00% tha3ng 3 0.00% (email 3 0.00% vàng… 3 0.00% ki3nh 3 0.00% intelligence 3 0.00% hão 3 0.00% cáng 3 0.00% ma3y 3 0.00% grand 3 0.00% thóp 3 0.00% idecaf 3 0.00% (kê1t 3 0.00% lu5i 3 0.00% vô2n 3 0.00% giâ1m 3 0.00% (nguyê4n 3 0.00% nê5n 3 0.00% muô2i 3 0.00% (tiê1ng 3 0.00% vót 3 0.00% petrochina 3 0.00% 270 3 0.00% qiantang

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 122 3 0.00% tu3m 3 0.00% sâm 3 0.00% mút 3 0.00% (cú 3 0.00% jazeera 3 0.00% nho'4 3 0.00% golmud 3 0.00% fredrick 3 0.00% trâ2y 3 0.00% ponce 3 0.00% láu 3 0.00% staline 3 0.00% nghi5t 3 0.00% railpartners 3 0.00% elena 3 0.00% maria 3 0.00% ti3m 3 0.00% vánh 3 0.00% soul 3 0.00% ames 3 0.00% ddo5an 3 0.00% hormone 3 0.00% cornwall 3 0.00% sucacnô 3 0.00% iv 3 0.00% búi 3 0.00% (ba(2ng 3 0.00% qua(5ng 3 0.00% go' 3 0.00% yu 3 0.00% lóa 3 0.00% aso 3 0.00% destination 3 0.00% marx 3 0.00% final 3 0.00% (0 3 0.00% loren 3 0.00% ho3n 3 0.00% 430 3 0.00% pho5t 3 0.00% (k 3 0.00% barbara 3 0.00% 534370 3 0.00% quâ4n 3 0.00% ddo'2m 3 0.00% dâ5n 3 0.00% (thê1 3 0.00% ná 3 0.00% dda(4n 3 0.00% 200g 3 0.00% derby 3 0.00% bu'1ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 123 3 0.00% thau 3 0.00% anan 3 0.00% rô2 3 0.00% gu 3 0.00% cho3 3 0.00% (csvn 3 0.00% xâu 3 0.00% ra… 3 0.00% shalit 3 0.00% tre4n 3 0.00% panahi 3 0.00% u'o'1p 3 0.00% búyt 3 0.00% o'n' 3 0.00% (48 3 0.00% (afp 3 0.00% la(5c 3 0.00% chái 3 0.00% amsterdam 3 0.00% phà 3 0.00% gilad 3 0.00% meo… 3 0.00% giãi 3 0.00% haniya 3 0.00% ko 3 0.00% nua 3 0.00% shafer 3 0.00% newsweek 3 0.00% dâ1p 3 0.00% thx 3 0.00% (h 3 0.00% mass 3 0.00% chu'4ng 3 0.00% diê5m 3 0.00% (dân 3 0.00% (u3y 3 0.00% ngay2 3 0.00% chum 3 0.00% lucia 3 0.00% zambia 3 0.00% adolf 3 0.00% tràm 3 0.00% khoa3i 3 0.00% pen 3 0.00% mercury 3 0.00% 255 3 0.00% (bv 3 0.00% 256 3 0.00% castro 3 0.00% air 3 0.00% tâng 3 0.00% 167

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 124 3 0.00% shangri 3 0.00% reza 3 0.00% dollar 3 0.00% giong 3 0.00% fidel 3 0.00% valentin 3 0.00% tóan 3 0.00% phuo'ng 3 0.00% line 3 0.00% và…nó 3 0.00% allen 3 0.00% pha3ng 3 0.00% gorbatchev 3 0.00% chey 3 0.00% gen 3 0.00% chetta 3 0.00% chiêng 3 0.00% chi5ch 3 0.00% perfumebay 3 0.00% korea 3 0.00% hô5c 3 0.00% kha3n 3 0.00% pho'1i 3 0.00% lòm 3 0.00% lénine 3 0.00% lagarde 3 0.00% un 3 0.00% asefi 3 0.00% 1925 3 0.00% (nhiê2u 3 0.00% lòa 3 0.00% ho5at 3 0.00% súy 3 0.00% aksu 3 0.00% canadian 3 0.00% anz 3 0.00% make 3 0.00% shield 3 0.00% louis 3 0.00% sa5 3 0.00% br 3 0.00% ottawa 3 0.00% gio'2… 3 0.00% big 3 0.00% tróc 3 0.00% dô 3 0.00% (ba5n 3 0.00% tsc 3 0.00% 465 3 0.00% call 3 0.00% alberta 3 0.00% tddc

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 125 3 0.00% sjc 3 0.00% hillary 3 0.00% like 3 0.00% ph 3 0.00% isbell 3 0.00% exmocare 3 0.00% km2 3 0.00% inca 3 0.00% mé 3 0.00% todd 3 0.00% xèo 3 0.00% ricefish 3 0.00% museum 3 0.00% mission 3 0.00% nicotine 3 0.00% taos 3 0.00% lo3i 3 0.00% pu 3 0.00% mram 3 0.00% aa 3 0.00% sts 3 0.00% motor 3 0.00% metallidurans 3 0.00% ralstonia 3 0.00% sa5c 3 0.00% guardian 3 0.00% hare 3 0.00% nhoe5t 3 0.00% ngái 3 0.00% huân 3 0.00% mình… 3 0.00% nhâ2y 3 0.00% night 3 0.00% abdullah 3 0.00% (câ2n 3 0.00% phu3i 3 0.00% kiê1ng 3 0.00% 116 3 0.00% kcn 3 0.00% 117 3 0.00% rô1c 3 0.00% nhòe 3 0.00% 330 3 0.00% lên… 3 0.00% vog 3 0.00% natri 3 0.00% paddington 3 0.00% (ddhqg 3 0.00% alô 3 0.00% boong 3 0.00% harvard 3 0.00% liga

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 126 3 0.00% estratest 3 0.00% bô1t 3 0.00% wb 3 0.00% 9999 3 0.00% carnegie 3 0.00% (dda(5ng 3 0.00% u3a 3 0.00% 93 3 0.00% brzezinski 3 0.00% c5 3 0.00% dna 3 0.00% cu'4 3 0.00% ttytdp 3 0.00% joseph 3 0.00% blackledge 3 0.00% tcty 3 0.00% (indonesia 3 0.00% chô3i 3 0.00% (dda 3 0.00% tro3 3 0.00% nha(5ng 3 0.00% xi5 3 0.00% khi5t 3 0.00% at 3 0.00% rupiah 3 0.00% metro 3 0.00% (chi 3 0.00% go'2 3 0.00% college 3 0.00% christ 3 0.00% khoa(1t 3 0.00% nokia 3 0.00% bp 3 0.00% hâ4ng 3 0.00% 559 3 0.00% (ddâ2u 3 0.00% (nguô2n 3 0.00% services 3 0.00% princeton 3 0.00% juliet 3 0.00% 400dd 3 0.00% county 3 0.00% orange 3 0.00% thompson 3 0.00% vnvnonn 3 0.00% valley 3 0.00% nghè 3 0.00% cô5c… 3 0.00% ceo 3 0.00% calmette 3 0.00% mon 3 0.00% trãi

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 127 3 0.00% (2006 3 0.00% (ddúng 3 0.00% eximbank 3 0.00% container 3 0.00% góa 3 0.00% c6 3 0.00% phâ3y 3 0.00% cech 3 0.00% 1940 3 0.00% 3 0.00% ho'5i 3 0.00% o'5 3 0.00% xx 3 0.00% 1910 3 0.00% gt 3 0.00% nhú 3 0.00% (27dd 3 0.00% huyênh 3 0.00% khiê4ng 3 0.00% nhu'5t 3 0.00% argiope 3 0.00% ddtddd 3 0.00% francois 3 0.00% nhu'3 3 0.00% bc 3 0.00% 128 3 0.00% myco 3 0.00% sui 3 0.00% ba(5m 3 0.00% bi5ch… 3 0.00% me5t 3 0.00% (vô1n 3 0.00% 6m 3 0.00% vo3n 3 0.00% canterbury 3 0.00% shinawatra 3 0.00% hua 3 0.00% per 3 0.00% aston 3 0.00% bentley 3 0.00% (e 3 0.00% rãnh 3 0.00% sandal 3 0.00% bi5ch…bình 3 0.00% co5ng 3 0.00% dòn 3 0.00% edaw 3 0.00% quo' 3 0.00% tmn 3 0.00% hillevi 3 0.00% 1m80 3 0.00% go'3

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 128 3 0.00% ghiê2n 3 0.00% amaobi 3 0.00% oxana 3 0.00% (viettel 3 0.00% tostao 3 0.00% (vnpt 3 0.00% tiê5t 3 0.00% bosworth 3 0.00% gov 3 0.00% returns 3 0.00% first 3 0.00% mr 3 0.00% ngoan… 3 0.00% (phâ2n 3 0.00% (giâ1y 3 0.00% sô2i 3 0.00% cu'1a 3 0.00% nbc 3 0.00% mourinho 3 0.00% jackson 3 0.00% pennsylvania 3 0.00% virgin 3 0.00% mèm 3 0.00% panikian 3 0.00% martinez 3 0.00% 143 3 0.00% 460 3 0.00% chây 3 0.00% châ2n 3 0.00% chu'2 3 0.00% huo' 3 0.00% cu'o'5c 3 0.00% johnny 3 0.00% nhô5t 3 0.00% week 3 0.00% kalou 3 0.00% toang 3 0.00% ddu'o'5c… 3 0.00% yang 3 0.00% wu 3 0.00% ltd 3 0.00% lula 3 0.00% nafta 3 0.00% jodric 3 0.00% brand 3 0.00% rmit 3 0.00% apollo 3 0.00% democracy 3 0.00% (na 3 0.00% mo5n 3 0.00% cisco 3 0.00% (du'o'1i

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 129 3 0.00% (perth 3 0.00% (internet 3 0.00% libya 3 0.00% (cùng 3 0.00% condoleeza 3 0.00% turkmenistan 3 0.00% xoan 3 0.00% swashbuckle 3 0.00% xoa5ch 3 0.00% ross 3 0.00% thinh 3 0.00% dixon 3 0.00% pa(2ng 3 0.00% (ai 3 0.00% scandal 3 0.00% suyê3n 3 0.00% liveshow 3 0.00% o'1 3 0.00% riê2ng 3 0.00% cruyff 3 0.00% nestor 3 0.00% muô1t 3 0.00% council 3 0.00% khê2u 3 0.00% coast 3 0.00% marco 3 0.00% cruz 3 0.00% (gô2m 3 0.00% razak 3 0.00% (1986 3 0.00% 1963 3 0.00% huw 3 0.00% ình 3 0.00% waynerooney 3 0.00% simon 3 0.00% 1951 3 0.00% da… 3 0.00% tra5c 3 0.00% samba 3 0.00% this 3 0.00% (27 3 0.00% nguâ5y 3 0.00% ddn 3 0.00% cncnnb 3 0.00% santa 3 0.00% jun 3 0.00% wipo 3 0.00% (ddê2u 3 0.00% levis 3 0.00% 190 3 0.00% nylon 3 0.00% bu'o'n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 130 3 0.00% nhoài 3 0.00% ttn 3 0.00% (ddi 3 0.00% asp 3 0.00% xê 3 0.00% tncn 3 0.00% ngo5ng 3 0.00% ptth 3 0.00% lô3ng 3 0.00% khâ1t 3 0.00% nha3m 3 0.00% (bí 3 0.00% nhói 3 0.00% everton 3 0.00% òn 3 0.00% lêu 3 0.00% (ddsq 3 0.00% becamex 3 0.00% hám 3 0.00% muffet 3 0.00% 211 3 0.00% (nhân 3 0.00% lauriane 3 0.00% mâ2n 3 0.00% sòm 3 0.00% gâu… 3 0.00% khua 3 0.00% tarud 3 0.00% tha3nh 3 0.00% tót 3 0.00% quào 3 0.00% ro'5n 3 0.00% hu'á 3 0.00% inna 3 0.00% oa3i 3 0.00% conner 3 0.00% gillieron 3 0.00% sình 3 0.00% nghì 3 0.00% trác 3 0.00% pisico 3 0.00% (tha(1ng 3 0.00% (co' 3 0.00% bê2nh 3 0.00% dáo 3 0.00% (phu'o'2ng 3 0.00% (ta5i 3 0.00% gatti 3 0.00% lithuania 3 0.00% 10h 3 0.00% qatar 3 0.00% vck

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 131 3 0.00% 179 3 0.00% (úc 3 0.00% nhón 3 0.00% royal 3 0.00% nh 3 0.00% ra(1t 3 0.00% tun 3 0.00% alaska 3 0.00% (40 3 0.00% (penalty 3 0.00% 850 3 0.00% gonzález 3 0.00% (du'5 3 0.00% martínez 3 0.00% marín 3 0.00% (so 3 0.00% ubayda 3 0.00% sequeira 3 0.00% solís 3 0.00% kasuri 3 0.00% thôi… 3 0.00% mauricio 3 0.00% 3 0.00% nha5o 3 0.00% ubndtp 3 0.00% brandenburg 3 0.00% ammar 3 0.00% denis 3 0.00% haye 3 0.00% president 3 0.00% truyê1t 3 0.00% techno 3 0.00% dragutinovic 3 0.00% (tâ1t 3 0.00% kháu 3 0.00% khi3nh 3 0.00% zigic 3 0.00% merk 3 0.00% diê5c 3 0.00% (in 3 0.00% radar 3 0.00% ba3 3 0.00% armey 3 0.00% vinagame 3 0.00% su’o’2n 3 0.00% ôrian 3 0.00% phlora 3 0.00% traurig 3 0.00% faith 3 0.00% liê2m 3 0.00% (âu 3 0.00% 840

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 132 3 0.00% traditional 3 0.00% atr 3 0.00% express 3 0.00% values 3 0.00% 112 3 0.00% 148 3 0.00% pelé 3 0.00% que5t 3 0.00% césar 3 0.00% rónald 3 0.00% gómez 3 0.00% lampart 3 0.00% stillwell 3 0.00% ddu’o’ng 3 0.00% trát 3 0.00% mo'1i… 3 0.00% journal 3 0.00% xtêphan 3 0.00% (kiê3u 3 0.00% 122 3 0.00% xu’1ng 3 0.00% mánh 3 0.00% golmohammadi 3 0.00% khóe 3 0.00% cocu 3 0.00% phài 3 0.00% bakary 3 0.00% mahdavikia 3 0.00% ujfalusi 3 0.00% muntari 3 0.00% gyan 3 0.00% zandi 3 0.00% hâ1u 3 0.00% táy 3 0.00% karimi 3 0.00% nhành 3 0.00% (chu'4 3 0.00% krstajic 3 0.00% gonzales 3 0.00% anime 3 0.00% humvee 3 0.00% pool 3 0.00% dòm 3 0.00% (46 3 0.00% haniyah 3 0.00% so'2n 3 0.00% gutierrez 3 0.00% sean 3 0.00% thào 3 0.00% giu5a 3 0.00% hannover 3 0.00% odonkor

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 133 3 0.00% pereiro 3 0.00% salon 3 0.00% shinya 3 0.00% 187 3 0.00% (7 3 0.00% kílô 3 0.00% ac 3 0.00% mãi… 3 0.00% serie 3 0.00% number 3 0.00% pokal 3 0.00% cho'1m 3 0.00% goofy 3 0.00% dfb 3 0.00% tyre 3 0.00% thu'à 3 0.00% ddoa5 3 0.00% tòm 3 0.00% pope 3 0.00% mastroeni 3 0.00% mcbride 3 0.00% (cái 3 0.00% caribbean 3 0.00% floyd 3 0.00% prize 3 0.00% waschtschuk 3 0.00% schleck 3 0.00% schowkowski 3 0.00% qua(5t 3 0.00% stagflation 3 0.00% petrodollars 3 0.00% canaveral 3 0.00% hosni 3 0.00% bujr 3 0.00% sâ4m 3 0.00% ports 3 0.00% amr 3 0.00% osirak 3 0.00% bu’2a 3 0.00% allawi 3 0.00% tamil 3 0.00% 409 3 0.00% muslim 3 0.00% brotherhood 3 0.00% (pakistan 3 0.00% npt 3 0.00% augustine 3 0.00% aleksandr 3 0.00% harcharik 3 0.00% zimmer 3 0.00% cricket 3 0.00% dpa

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 134 3 0.00% rand 3 0.00% capital 3 0.00% vv 3 0.00% pa(1c 3 0.00% requelme 3 0.00% luy5 3 0.00% ruô2ng 3 0.00% power 3 0.00% nicaragua 3 0.00% casey 3 0.00% arizona 3 0.00% ttv 3 0.00% 20000 3 0.00% lai5 3 0.00% lu’o’1i 3 0.00% dulles 3 0.00% ddo’4 3 0.00% powers 3 0.00% eisenhower 3 0.00% porter 3 0.00% 357 3 0.00% niger 3 0.00% jeff 3 0.00% 'dân 3 0.00% (a 3 0.00% kilô 3 0.00% truman 3 0.00% fossum 3 0.00% piers 3 0.00% oxy 3 0.00% champion 3 0.00% rita 3 0.00% 602 3 0.00% hackett 3 0.00% gô2 3 0.00% hydrogen 3 0.00% 283 3 0.00% 725 3 0.00% shimane 3 0.00% kuranyi 3 0.00% hen 3 0.00% khênh 3 0.00% baker 3 0.00% worldwide 3 0.00% schalke 3 0.00% committee 3 0.00% crowe 3 0.00% hô1ng 3 0.00% winfield 3 0.00% chu'o'3ng 3 0.00% brian 3 0.00% thoang

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 135 3 0.00% so’n 3 0.00% boich 3 0.00% henri 3 0.00% citizens 3 0.00% lian 3 0.00% (majority 3 0.00% guimaraes 3 0.00% bloomberg 3 0.00% josé 3 0.00% niu 3 0.00% worldcom 3 0.00% idaho 3 0.00% bu’o’1u 3 0.00% tánh 3 0.00% o'1i 3 0.00% boggs 3 0.00% patton 3 0.00% tèo 3 0.00% khamenei 3 0.00% nho’1 3 0.00% solana 3 0.00% la3 3 0.00% project 3 0.00% (plea 3 0.00% knock 3 0.00% airbus 3 0.00% mcclaren 3 0.00% hu’o’ng 3 0.00% gâ3y 3 0.00% luâ3n 3 0.00% phé 3 0.00% ledge 3 0.00% index 3 0.00% bon 3 0.00% piero 3 0.00% ngu’4 3 0.00% khu’1 3 0.00% ossetia 3 0.00% tajikistan 3 0.00% rìu 3 0.00% nhíu 3 0.00% (golf 3 0.00% nhu3i 3 0.00% bargain 3 0.00% ayman 3 0.00% musab 3 0.00% ba(4ng 3 0.00% steinberg 3 0.00% ddênh 3 0.00% ljungberg 3 0.00% party 3 0.00% spa

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 136 3 0.00% xo 3 0.00% risal 3 0.00% ngâ3m 3 0.00% trô1i 3 0.00% pha5ch 3 0.00% hu’1a 3 0.00% bosnia 3 0.00% hanke 3 0.00% jiegao 3 0.00% basra 3 0.00% phành 3 0.00% fouad 3 0.00% 10 3 0.00% (central 3 0.00% christoph 3 0.00% tech 3 0.00% seidel 3 0.00% (hans 3 0.00% algeria 3 0.00% ddâ2n 3 0.00% bremer 3 0.00% ne3 3 0.00% worl 3 0.00% bernd 3 0.00% lê1ch 3 0.00% chiê1p 3 0.00% thein 3 0.00% nuô5t 3 0.00% key 3 0.00% driver 3 0.00% sihanouk 3 0.00% 8x 3 0.00% roed 3 0.00% ferdinand 3 0.00% 9x 3 0.00% xõa 3 0.00% (mnc 3 0.00% synovations 3 0.00% hai… 3 0.00% illarionov 3 0.00% kimbrell 3 0.00% dô4i 3 0.00% thâ1y… 3 0.00% ho' 3 0.00% live 3 0.00% (ireland 3 0.00% 164 3 0.00% joker 3 0.00% 5m 3 0.00% bo’2 3 0.00% na5m 3 0.00% (thua

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 137 3 0.00% 2500 3 0.00% 5x 3 0.00% nhê1ch 3 0.00% 6x 3 0.00% leu 3 0.00% yogyakarta 3 0.00% toi 3 0.00% vén 3 0.00% ghosh 3 0.00% (châ1t 3 0.00% command 3 0.00% gie3 3 0.00% cu’o’1p 3 0.00% otaku 3 0.00% dô5ng 3 0.00% gí 3 0.00% nuno 3 0.00% nu'o'1c… 3 0.00% villar 3 0.00% chu’2ng 3 0.00% nhô1n 3 0.00% philipp 3 0.00% chye 3 0.00% lô3 3 0.00% waziristan 3 0.00% ddu’5ng 3 0.00% yellowcake 3 0.00% (nus 3 0.00% heinze 3 0.00% kiang 3 0.00% mottaki 3 0.00% ngo’2 3 0.00% zahar 3 0.00% enjoy 3 0.00% meira 3 0.00% gerhard 3 0.00% nus 2 0.00% cha5y… 2 0.00% (23dd 2 0.00% yonhap 2 0.00% vaa 2 0.00% (sacombank 2 0.00% signatures 2 0.00% future 2 0.00% kê2u 2 0.00% úi 2 0.00% leaders 2 0.00% (scotland 2 0.00% hiss 2 0.00% b727 2 0.00% tito 2 0.00% yamanaka

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 138 2 0.00% noble 2 0.00% beethoven 2 0.00% (47 2 0.00% 609 2 0.00% shopping 2 0.00% go'2m 2 0.00% gambling 2 0.00% prohibition 2 0.00% berger 2 0.00% viktor 2 0.00% xóc 2 0.00% nowakowski 2 0.00% xâ3u 2 0.00% schmidt 2 0.00% azerbaijan 2 0.00% andrei 2 0.00% (lobbyist 2 0.00% congressional 2 0.00% (khoai 2 0.00% nrcc 2 0.00% hddqt 2 0.00% stb 2 0.00% èo 2 0.00% uô5t 2 0.00% hóng 2 0.00% schatz 2 0.00% lu'5o'c 2 0.00% lòng' 2 0.00% (gâ1p 2 0.00% ca5y 2 0.00% fries 2 0.00% enzym 2 0.00% procter 2 0.00% gamble 2 0.00% 1935 2 0.00% packard 2 0.00% 'o' 2 0.00% whirlpool 2 0.00% hewlett 2 0.00% 666 2 0.00% sidon 2 0.00% building 2 0.00% zayat 2 0.00% lindsey 2 0.00% wmd 2 0.00% walsh 2 0.00% 4m 2 0.00% herti 2 0.00% si5t 2 0.00% shizuoka 2 0.00% management 2 0.00% yong

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 139 2 0.00% allison 2 0.00% titanic 2 0.00% kiê3ng 2 0.00% adventures 2 0.00% feinberg 2 0.00% jihad 2 0.00% government 2 0.00% (hàn 2 0.00% (iss 2 0.00% 'cho 2 0.00% vo'2n 2 0.00% yudhoyono 2 0.00% glasgow 2 0.00% 15g 2 0.00% reith 2 0.00% 15dd 2 0.00% union 2 0.00% voz 2 0.00% thâ4n 2 0.00% cross 2 0.00% (iaea 2 0.00% (ljworld 2 0.00% dynamics 2 0.00% matech 2 0.00% (honorariums 2 0.00% bellaire 2 0.00% 439 2 0.00% (fbi 2 0.00% uttaradit 2 0.00% (chu'1 2 0.00% fund 2 0.00% tndk 2 0.00% bkaa12345 2 0.00% bambang 2 0.00% zinni 2 0.00% xuông 2 0.00% (must 2 0.00% jersey 2 0.00% technology 2 0.00% tennis 2 0.00% 's' 2 0.00% susilo 2 0.00% cyprus 2 0.00% dda(m 2 0.00% chevron 2 0.00% tweedledum 2 0.00% bernanke 2 0.00% (bureau 2 0.00% be5n 2 0.00% mayflower 2 0.00% analysis 2 0.00% blog

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 140 2 0.00% loanh 2 0.00% qtkd 2 0.00% dinah 2 0.00% rõi 2 0.00% (saigon 2 0.00% (ít 2 0.00% bu5 2 0.00% tweedledee 2 0.00% grai 2 0.00% (n 2 0.00% yunnan 2 0.00% (tính 2 0.00% march 2 0.00% sepa 2 0.00% 349 2 0.00% xuý 2 0.00% 34cm 2 0.00% 52cm 2 0.00% allah 2 0.00% (37 2 0.00% khùng 2 0.00% (phía 2 0.00% vu'o'2ng 2 0.00% nasrallah 2 0.00% su'o'5ng 2 0.00% (ho'n 2 0.00% westhead 2 0.00% (republic 2 0.00% dealing 2 0.00% brent 2 0.00% watergate 2 0.00% oakland 2 0.00% mccone 2 0.00% aslan 2 0.00% nhoong 2 0.00% (tt 2 0.00% prông 2 0.00% quran 2 0.00% ida 2 0.00% slaven 2 0.00% agni 2 0.00% buô5t 2 0.00% dwight 2 0.00% 2500km 2 0.00% (pearl 2 0.00% harbor 2 0.00% (di 2 0.00% típ 2 0.00% nymex 2 0.00% 'câ2n 2 0.00% (là 2 0.00% (mo'1i

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 141 2 0.00% (kcn 2 0.00% winn 2 0.00% type 2 0.00% ae 2 0.00% ftc 2 0.00% tiê1p… 2 0.00% (vòng 2 0.00% móm 2 0.00% webster 2 0.00% woolsey 2 0.00% ca5m 2 0.00% 14h 2 0.00% ddô1p 2 0.00% a(3ng 2 0.00% gore 2 0.00% loét 2 0.00% â5m 2 0.00% ahmanidejad 2 0.00% gheit 2 0.00% proliferation 2 0.00% (kích 2 0.00% treaty 2 0.00% so'3i 2 0.00% nld 2 0.00% clorua 2 0.00% electric 2 0.00% va3ng 2 0.00% (hbsag 2 0.00% fluor 2 0.00% associated 2 0.00% 10g 2 0.00% research 2 0.00% nhâ3u 2 0.00% (ngu5 2 0.00% sâ1p 2 0.00% khodorskovsky 2 0.00% hagel 2 0.00% chuck 2 0.00% (low 2 0.00% pho'2 2 0.00% intel 2 0.00% (al 2 0.00% ounce 2 0.00% kerry 2 0.00% (engagement 2 0.00% russell 2 0.00% buckley 2 0.00% munoa 2 0.00% 273 2 0.00% glenne 2 0.00% eirik 2 0.00% schuman

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 142 2 0.00% tuaàn 2 0.00% uae 2 0.00% nyunt 2 0.00% petronas 2 0.00% (china 2 0.00% peace 2 0.00% endowment 2 0.00% (cha 2 0.00% quá… 2 0.00% ba3i 2 0.00% (anti 2 0.00% bangladesh 2 0.00% penh 2 0.00% xâ1c 2 0.00% white 2 0.00% phnom 2 0.00% angkor 2 0.00% authority 2 0.00% xu'o'5c 2 0.00% wat 2 0.00% c4 2 0.00% cities 2 0.00% bourland 2 0.00% dãn 2 0.00% cbt 2 0.00% mg 2 0.00% (ty3 2 0.00% india 2 0.00% scid 2 0.00% mu'3a 2 0.00% shwe 2 0.00% soa5t 2 0.00% dpw 2 0.00% (phòng 2 0.00% ytdp 2 0.00% ô1p 2 0.00% dhabi 2 0.00% dexamethasone 2 0.00% không' 2 0.00% reyes 2 0.00% kyodo 2 0.00% flatley 2 0.00% â1y… 2 0.00% khi5 2 0.00% 50m2 2 0.00% 388 2 0.00% danaan 2 0.00% nè…và 2 0.00% ailen 2 0.00% bé… 2 0.00% sikh 2 0.00% missouri

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 143 2 0.00% cupid 2 0.00% nhoe3n 2 0.00% xê1ch 2 0.00% frazier 2 0.00% gianh 2 0.00% orleans 2 0.00% vietmatchbiz 2 0.00% 659 2 0.00% mukherjee 2 0.00% ptcs 2 0.00% (kê3 2 0.00% (va(n 2 0.00% híc 2 0.00% vô2ng 2 0.00% pranab 2 0.00% ghili 2 0.00% (philippines 2 0.00% re3ng 2 0.00% 264 2 0.00% ddiã 2 0.00% phím 2 0.00% independent 2 0.00% nu'5ng 2 0.00% xb 2 0.00% gút 2 0.00% 80 2 0.00% sô1i 2 0.00% biggins 2 0.00% hall 2 0.00% la5n 2 0.00% thiê1c 2 0.00% magazine 2 0.00% sanskrit 2 0.00% ilearn 2 0.00% ewell 2 0.00% 100m 2 0.00% motors 2 0.00% kiê1t 2 0.00% xía 2 0.00% master 2 0.00% xoe5 2 0.00% ngô5p 2 0.00% kempner 2 0.00% trimquest 2 0.00% 275 2 0.00% 24k 2 0.00% môt 2 0.00% nê1u… 2 0.00% boo 2 0.00% hindi 2 0.00% tuy5 2 0.00% security

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 144 2 0.00% network 2 0.00% link 2 0.00% bild 2 0.00% glass 2 0.00% nang 2 0.00% romano 2 0.00% angel 2 0.00% kashmiri 2 0.00% malayalam 2 0.00% auctionassist 2 0.00% bobbidi 2 0.00% assist 2 0.00% auction 2 0.00% mart 2 0.00% aurelio 2 0.00% so'5… 2 0.00% wal 2 0.00% morientes 2 0.00% marc 2 0.00% tbn 2 0.00% cun 2 0.00% emmanuel 2 0.00% puyol 2 0.00% xu5 2 0.00% xè 2 0.00% service 2 0.00% (special 2 0.00% lâ3y 2 0.00% emerson 2 0.00% ds 2 0.00% viera 2 0.00% nedved 2 0.00% oliveira 2 0.00% sucre 2 0.00% beo 2 0.00% ba… 2 0.00% (argentina 2 0.00% chiarelli 2 0.00% duarte 2 0.00% (uruguay 2 0.00% tabare 2 0.00% tmncsg 2 0.00% essien 2 0.00% lyn 2 0.00% asier 2 0.00% dâ5m 2 0.00% schumer 2 0.00% murtha 2 0.00% sarayoot 2 0.00% uê1 2 0.00% (1988 2 0.00% (tdv

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 145 2 0.00% 450 2 0.00% grant 2 0.00% 11h 2 0.00% 15h 2 0.00% riê4u 2 0.00% (ti3nh 2 0.00% guatemala 2 0.00% ha… 2 0.00% lu'á 2 0.00% (1968 2 0.00% ro'2n 2 0.00% passport 2 0.00% 214 2 0.00% (ks 2 0.00% (ddiê5n 2 0.00% (coalition 2 0.00% khê4nh 2 0.00% phone 2 0.00% cho'i… 2 0.00% ba5n… 2 0.00% armstrong 2 0.00% ajax 2 0.00% johan 2 0.00% lance 2 0.00% pandora 2 0.00% (bs 2 0.00% adelaide 2 0.00% eddi 2 0.00% safa 2 0.00% 62m 2 0.00% dìn 2 0.00% muà 2 0.00% shifter 2 0.00% 243 2 0.00% rê2 2 0.00% buô2n… 2 0.00% 21h 2 0.00% feeble 2 0.00% kha3m 2 0.00% (gio'1i 2 0.00% priest 2 0.00% mót 2 0.00% zace 2 0.00% rich 2 0.00% cirincione 2 0.00% republicans 2 0.00% nâ4u 2 0.00% do5 2 0.00% há…a(1t 2 0.00% marketing 2 0.00% aderholt 2 0.00% (so'3

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 146 2 0.00% tauzin 2 0.00% (u'1ng 2 0.00% nâ1n 2 0.00% la(2ng 2 0.00% (p 2 0.00% stanley 2 0.00% what 2 0.00% vasell 2 0.00% furama 2 0.00% lúi 2 0.00% nghét 2 0.00% (quê 2 0.00% elcb 2 0.00% mississipi 2 0.00% ngót 2 0.00% nablus 2 0.00% red 2 0.00% 2h 2 0.00% lehrman 2 0.00% sim 2 0.00% scorpion 2 0.00% (nô3i 2 0.00% quyê5n 2 0.00% mariana 2 0.00% jeep 2 0.00% cái… 2 0.00% cordesman 2 0.00% hyundai 2 0.00% hu'2 2 0.00% nesta 2 0.00% bu5p 2 0.00% lamborghini 2 0.00% rover 2 0.00% 911 2 0.00% ellis 2 0.00% ni4a 2 0.00% shekel 2 0.00% leader 2 0.00% grimaldi 2 0.00% 'xin 2 0.00% mossad 2 0.00% ói 2 0.00% smart 2 0.00% (mâ1t 2 0.00% minority 2 0.00% ncppr 2 0.00% rayban 2 0.00% alliance 2 0.00% robin 2 0.00% xô3 2 0.00% shandwick 2 0.00% (vu'o'5t

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 147 2 0.00% 4m2 2 0.00% weyrich 2 0.00% shai 2 0.00% hummer 2 0.00% vanquish 2 0.00% range 2 0.00% xô1i 2 0.00% daum 2 0.00% dobson 2 0.00% q7 2 0.00% majority 2 0.00% raul 2 0.00% xép 2 0.00% (plo 2 0.00% ruy 2 0.00% cuà 2 0.00% hariri 2 0.00% rùm 2 0.00% straw 2 0.00% luô1c 2 0.00% rebibbia 2 0.00% camp 2 0.00% standard 2 0.00% hsbc 2 0.00% chartered 2 0.00% bomb 2 0.00% atomic 2 0.00% tmcp 2 0.00% not 2 0.00% hormuz 2 0.00% woodward 2 0.00% island 2 0.00% ''iran 2 0.00% giu4 2 0.00% sê5t 2 0.00% massachusetts 2 0.00% fountain 2 0.00% nano 2 0.00% westminster 2 0.00% nghiê4m 2 0.00% tambuiforjudge 2 0.00% perle 2 0.00% beng 2 0.00% (thi 2 0.00% (ttxvn 2 0.00% (lhq 2 0.00% chê5 2 0.00% la5o 2 0.00% chê4m 2 0.00% 250m 2 0.00% dutch 2 0.00% boulis

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 148 2 0.00% sumatra 2 0.00% hunter 2 0.00% darwazeh 2 0.00% borneo 2 0.00% hadley 2 0.00% heu 2 0.00% simpson 2 0.00% georgetown 2 0.00% issah 2 0.00% ahmad 2 0.00% paloma 2 0.00% rccb 2 0.00% húi 2 0.00% (co 2 0.00% lo3n 2 0.00% armitage 2 0.00% monoxide 2 0.00% go5t 2 0.00% 197 2 0.00% biê1n… 2 0.00% sám 2 0.00% xiii 2 0.00% missiles 2 0.00% bi5… 2 0.00% libbi 2 0.00% hiu 2 0.00% 591 2 0.00% khabab 2 0.00% mujahedeen 2 0.00% 222 2 0.00% (ma5nh 2 0.00% 132 2 0.00% ix 2 0.00% mcclellan 2 0.00% yersin 2 0.00% biê2n 2 0.00% kunar 2 0.00% tru'2o'ng 2 0.00% arouna 2 0.00% ré 2 0.00% yaya 2 0.00% (ho5 2 0.00% bahrain 2 0.00% nekounam 2 0.00% (ihrc 2 0.00% offside 2 0.00% kaebi 2 0.00% nosrati 2 0.00% du'5c 2 0.00% elfenbeinkueste 2 0.00% nhùng 2 0.00% boulahrouz

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 149 2 0.00% (dù 2 0.00% kolo 2 0.00% qua(5n 2 0.00% meite 2 0.00% dâ1t 2 0.00% rôma 2 0.00% yes 2 0.00% kyl 2 0.00% jon 2 0.00% koh 2 0.00% kaiserslautern 2 0.00% keller 2 0.00% pablo 2 0.00% lddldd 2 0.00% xavi 2 0.00% naò 2 0.00% alonso 2 0.00% túa 2 0.00% by 2 0.00% house 2 0.00% otto 2 0.00% commission 2 0.00% madanchi 2 0.00% property 2 0.00% citigroup 2 0.00% caspers 2 0.00% seiple 2 0.00% (dda(5c 2 0.00% vncs 2 0.00% ngóai 2 0.00% ro'1 2 0.00% ra5o 2 0.00% report 2 0.00% toa5c 2 0.00% ho'1 2 0.00% maxi 2 0.00% drancy 2 0.00% nhoà 2 0.00% tevez 2 0.00% thóat 2 0.00% phãi 2 0.00% vang… 2 0.00% ngâ1p 2 0.00% (70 2 0.00% u'o'2n 2 0.00% affairs 2 0.00% burdisso 2 0.00% mascherano 2 0.00% (south 2 0.00% minxin 2 0.00% kho3ang 2 0.00% petkovic

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 150 2 0.00% rosetti 2 0.00% tu'1o'ng 2 0.00% phâ1p 2 0.00% hõan 2 0.00% (thâ5t 2 0.00% khu'1a 2 0.00% sneijder 2 0.00% moi 2 0.00% cambiasso 2 0.00% ooijer 2 0.00% ngu'o'3ng 2 0.00% ngô3m 2 0.00% sít 2 0.00% duljaj 2 0.00% bâ1c 2 0.00% lô3m 2 0.00% ljuboja 2 0.00% 207 2 0.00% 1628 2 0.00% 111 2 0.00% 1860 2 0.00% chornidow 2 0.00% impossible 2 0.00% 740 2 0.00% marufshonow 2 0.00% 540 2 0.00% yung 2 0.00% (xin 2 0.00% alfredo 2 0.00% fdi 2 0.00% aie 2 0.00% jockey 2 0.00% (mu'o'2i 2 0.00% 441 2 0.00% malta 2 0.00% gamborgno 2 0.00% trô3i 2 0.00% hariyanto 2 0.00% hu3i 2 0.00% bernhard 2 0.00% ngày… 2 0.00% komlan 2 0.00% nì 2 0.00% rdx 2 0.00% kathmandu 2 0.00% na(2n 2 0.00% vêu 2 0.00% xìu 2 0.00% lahore 2 0.00% zahlé 2 0.00% umana 2 0.00% thuo'ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 151 2 0.00% khac 2 0.00% bernardino 2 0.00% (ke3 2 0.00% (1945 2 0.00% hùa 2 0.00% palacio 2 0.00% 2550 2 0.00% vicente 2 0.00% nenad 2 0.00% (hê1t 2 0.00% xuê 2 0.00% luther 2 0.00% khuy3u 2 0.00% ulkrain 2 0.00% bo'2n 2 0.00% (su' 2 0.00% chình 2 0.00% huo'3n 2 0.00% namouchi 2 0.00% aziz 2 0.00% rennes 2 0.00% bô4n 2 0.00% l6 2 0.00% jaziri 2 0.00% 4084d 2 0.00% jeserski 2 0.00% torres 2 0.00% nong 2 0.00% garcia 2 0.00% lgw 2 0.00% (68 2 0.00% (hình 2 0.00% schewtschenko 2 0.00% 165 2 0.00% canadda 2 0.00% oda 2 0.00% sobolewski 2 0.00% sái 2 0.00% krzynowek 2 0.00% zurawski 2 0.00% nhu'o'ng 2 0.00% bulgarien 2 0.00% tho3m 2 0.00% 593 2 0.00% gáp 2 0.00% (94 2 0.00% bê4 2 0.00% giâ5p 2 0.00% bolton 2 0.00% kahtani 2 0.00% dollars 2 0.00% studdert

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 152 2 0.00% 390 2 0.00% aipo 2 0.00% 733 2 0.00% boruc 2 0.00% 627 2 0.00% lenovo 2 0.00% (ddu'1ng 2 0.00% ibm 2 0.00% na3 2 0.00% giô1i 2 0.00% (gnp 2 0.00% afganistan 2 0.00% dda3n 2 0.00% 820 2 0.00% qua(n 2 0.00% 1911 2 0.00% lukas 2 0.00% taro 2 0.00% ht 2 0.00% rôm 2 0.00% vhdd 2 0.00% 236 2 0.00% 300km 2 0.00% uighurs 2 0.00% miroslav 2 0.00% ngòm 2 0.00% (zang 2 0.00% cho'3m 2 0.00% pô1t 2 0.00% (chiê1n 2 0.00% (lao 2 0.00% lo'3m 2 0.00% è 2 0.00% bói 2 0.00% 1918 2 0.00% (bà 2 0.00% khiu 2 0.00% nghê 2 0.00% nhuô1m 2 0.00% (bô1n 2 0.00% ddui 2 0.00% biêp 2 0.00% ferreira 2 0.00% (1959 2 0.00% 536787 2 0.00% 6a 2 0.00% (bo3 2 0.00% (thích 2 0.00% fernando 2 0.00% istanbul 2 0.00% 1944 2 0.00% (1966

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 153 2 0.00% (09 2 0.00% eusebio 2 0.00% 24h 2 0.00% ôstrâylia 2 0.00% pretoria 2 0.00% hunggary 2 0.00% mobi 2 0.00% nathaniel 2 0.00% saha 2 0.00% lippi 2 0.00% (argentinien 2 0.00% wiltord 2 0.00% khu' 2 0.00% gattuso 2 0.00% schaeuble 2 0.00% (ddá 2 0.00% steinmeier 2 0.00% ribery 2 0.00% uê3 2 0.00% (nói 2 0.00% giõi 2 0.00% ê1m 2 0.00% jános 2 0.00% 94 2 0.00% torsten 2 0.00% bastian 2 0.00% vo'5i 2 0.00% ru'1c 2 0.00% mobai 2 0.00% computer 2 0.00% 295 2 0.00% tbt 2 0.00% marcell 2 0.00% ep 2 0.00% nowotny 2 0.00% ccb 2 0.00% (magyar 2 0.00% benyhe 2 0.00% co'i 2 0.00% jansen 2 0.00% dâ4y 2 0.00% toái 2 0.00% quo'3 2 0.00% lobby 2 0.00% rsf 2 0.00% toa5i 2 0.00% (group 2 0.00% giang' 2 0.00% (nghi4a 2 0.00% ca(1c 2 0.00% goá 2 0.00% khóang

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 154 2 0.00% phu'2ng 2 0.00% cantalejo 2 0.00% lucio 2 0.00% portugal 2 0.00% dida 2 0.00% (spanien 2 0.00% hy3 2 0.00% thu3a 2 0.00% dongfang 2 0.00% na3i 2 0.00% baó 2 0.00% yuan 2 0.00% vô1 2 0.00% ô1 2 0.00% ke 2 0.00% tri4 2 0.00% truo'2ng 2 0.00% eastern 2 0.00% people's 2 0.00% moreno 2 0.00% byron 2 0.00% dda3i 2 0.00% autonomous 2 0.00% mô5i 2 0.00% (ca3nh 2 0.00% du3 2 0.00% jokers 2 0.00% mongolia 2 0.00% hoen 2 0.00% ddu'2o'ng 2 0.00% french 2 0.00% xinjiang 2 0.00% gregory 2 0.00% xê5ch 2 0.00% ect 2 0.00% sém 2 0.00% kamikawa 2 0.00% (thiên 2 0.00% dda(4ng 2 0.00% gu3i 2 0.00% blhs 2 0.00% (tuy 2 0.00% huth 2 0.00% (ghi 2 0.00% cùi 2 0.00% 45b 2 0.00% rê2nh 2 0.00% 319 2 0.00% 299 2 0.00% va5c 2 0.00% giô5i 2 0.00% cha(1t

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 155 2 0.00% lich 2 0.00% tiê5u 2 0.00% gô3 2 0.00% tiê1m 2 0.00% chê2 2 0.00% mím 2 0.00% chô4m 2 0.00% ngoa3i 2 0.00% tiu 2 0.00% stasi 2 0.00% viana 2 0.00% trâng 2 0.00% nghi3u 2 0.00% thóa 2 0.00% ddu'o'5m 2 0.00% nho'5t 2 0.00% piro 2 0.00% florenz 2 0.00% arne 2 0.00% robinson 2 0.00% ba(4m 2 0.00% paparazzi 2 0.00% cùn 2 0.00% uâ3n 2 0.00% vksnd 2 0.00% (japan 2 0.00% jovp 2 0.00% over 2 0.00% (nghê5 2 0.00% view 2 0.00% (hindustan 2 0.00% 1m74 2 0.00% 1m77 2 0.00% chõng 2 0.00% lu3n 2 0.00% nicholas 2 0.00% yolande 2 0.00% ishiba 2 0.00% (1954 2 0.00% central 2 0.00% apasra 2 0.00% matthews 2 0.00% dúi 2 0.00% (igo 2 0.00% 8h 2 0.00% (newsweek 2 0.00% 22h30 2 0.00% glickman 2 0.00% titorenko 2 0.00% davila 2 0.00% vip 2 0.00% 149

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 156 2 0.00% (ngo5c 2 0.00% tsymbalyuk 2 0.00% saigòn 2 0.00% htv9 2 0.00% hu'5u 2 0.00% (nô5i 2 0.00% caicos 2 0.00% turks 2 0.00% lô5p 2 0.00% transdnistr 2 0.00% nagorno 2 0.00% turkey 2 0.00% shore 2 0.00% pauly 2 0.00% menatep 2 0.00% electra 2 0.00% karabakh 2 0.00% schroeder 2 0.00% a(m 2 0.00% andrews 2 0.00% arab 2 0.00% ke4m 2 0.00% hawke 2 0.00% knightley 2 0.00% keira 2 0.00% jennifer 2 0.00% ethan 2 0.00% garner 2 0.00% macaulay 2 0.00% (hãng 2 0.00% culkin 2 0.00% nude 2 0.00% sophia 2 0.00% lìm 2 0.00% presley 2 0.00% elvis 2 0.00% anc 2 0.00% dos 2 0.00% (ngân 2 0.00% oe 2 0.00% slovak 2 0.00% vincent 2 0.00% rose 2 0.00% alexander 2 0.00% lobato 2 0.00% helen 2 0.00% tanaka 2 0.00% desmond 2 0.00% panama 2 0.00% (lee 2 0.00% lane 2 0.00% lois

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 157 2 0.00% catwoman 2 0.00% jolie 2 0.00% angelina 2 0.00% de4o 2 0.00% wallace 2 0.00% xúi 2 0.00% ''mô5t 2 0.00% obaid 2 0.00% riyadh 2 0.00% zvonareva 2 0.00% khâ5t 2 0.00% slam 2 0.00% judah 2 0.00% srebotnik 2 0.00% serena 2 0.00% khu'o'4ng 2 0.00% (border 2 0.00% lddbddvn 2 0.00% nicolas 2 0.00% chuâ4n 2 0.00% (lâ2n 2 0.00% pjico 2 0.00% khatoco 2 0.00% ddáu 2 0.00% 2 0.00% xu'a… 2 0.00% 'ddô2ng 2 0.00% kho'3i' 2 0.00% bdd 2 0.00% pomina 2 0.00% mikado 2 0.00% bonner 2 0.00% gabriel 2 0.00% loew 2 0.00% (1993 2 0.00% verdery 2 0.00% ddo'5 2 0.00% (cia 2 0.00% vi3 2 0.00% hajduk 2 0.00% donadoni 2 0.00% split 2 0.00% libby 2 0.00% uk 2 0.00% coming 2 0.00% patrol 2 0.00% phu'1t 2 0.00% (guest 2 0.00% premiership 2 0.00% soler 2 0.00% (felony 2 0.00% worker

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 158 2 0.00% park 2 0.00% shimon 2 0.00% guô1c 2 0.00% robertson 2 0.00% arevalos 2 0.00% lourdes 2 0.00% peres 2 0.00% dominguez 2 0.00% valerie 2 0.00% netayahu 2 0.00% binyamin 2 0.00% mendoza 2 0.00% josephine 2 0.00% litvinova 2 0.00% alhanko 2 0.00% osathanond 2 0.00% charm 2 0.00% sudnicka 2 0.00% barak 2 0.00% pat 2 0.00% begin 2 0.00% francys 2 0.00% (khuynh 2 0.00% kenai 2 0.00% ddu'5o'c 2 0.00% rún 2 0.00% keith 2 0.00% cùa 2 0.00% hartert 2 0.00% ritchie 2 0.00% guy 2 0.00% tania 2 0.00% jinan 2 0.00% phôn 2 0.00% luz 2 0.00% iyad 2 0.00% marina 2 0.00% hoa(1c 2 0.00% zuluaga 2 0.00% daryl 2 0.00% zuydendorp 2 0.00% phè 2 0.00% hói 2 0.00% xo'n 2 0.00% lcd 2 0.00% internationale 2 0.00% rouge 2 0.00% klinsman 2 0.00% (argentinia 2 0.00% boat 2 0.00% ho'5t 2 0.00% delgado

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 159 2 0.00% palace 2 0.00% lawrence 2 0.00% posters 2 0.00% illinois 2 0.00% 168 2 0.00% dâ5t 2 0.00% commerzbank 2 0.00% paredes 2 0.00% yvonne 2 0.00% ting 2 0.00% armando 2 0.00% (ddoa5t 2 0.00% pays 2 0.00% nos 2 0.00% démocratie 2 0.00% nhóang 2 0.00% co'… 2 0.00% guillermo 2 0.00% pavel 2 0.00% pardo 2 0.00% dean 2 0.00% ruô3i 2 0.00% 279 2 0.00% quâ2ng 2 0.00% tdd 2 0.00% bravo 2 0.00% (thi5 2 0.00% vb 2 0.00% erikson 2 0.00% asley 2 0.00% ghìm 2 0.00% larsson 2 0.00% hatton 2 0.00% ferguson 2 0.00% borgetti 2 0.00% beenhakker 2 0.00% stamford 2 0.00% reutes 2 0.00% mari 2 0.00% afif 2 0.00% arabiya 2 0.00% nadji 2 0.00% slater 2 0.00% khê5nh 2 0.00% xvi 2 0.00% pho'4n 2 0.00% xanana 2 0.00% nhau… 2 0.00% (jerusalem 2 0.00% 345 2 0.00% assad 2 0.00% snow

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 160 2 0.00% 245 2 0.00% vào… 2 0.00% (hezbollah 2 0.00% choueifat 2 0.00% 271 2 0.00% roissy 2 0.00% 470 2 0.00% xàng 2 0.00% hawaii 2 0.00% lifetime 2 0.00% oa3n 2 0.00% award 2 0.00% ngu'2 2 0.00% núddez 2 0.00% lynch 2 0.00% (liberal 2 0.00% québecois 2 0.00% downing 2 0.00% (va(1ng 2 0.00% kha5ng 2 0.00% lucky 2 0.00% trâ5p 2 0.00% khiá 2 0.00% (hlv 2 0.00% knight 2 0.00% (2003 2 0.00% ridder 2 0.00% jackie 2 0.00% (u 2 0.00% (council 2 0.00% véo 2 0.00% (joint 2 0.00% (bch 2 0.00% ernst 2 0.00% european 2 0.00% maldives 2 0.00% gnp 2 0.00% khiên 2 0.00% global 2 0.00% í 2 0.00% (02 2 0.00% gerald 2 0.00% (01 2 0.00% m'gladbach 2 0.00% lõa 2 0.00% dzoanh 2 0.00% dvtncs 2 0.00% bi4u 2 0.00% (lo'2i 2 0.00% community 2 0.00% koondoola 2 0.00% uss

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 161 2 0.00% mustin 2 0.00% tu'3ng 2 0.00% ansar 2 0.00% tunis 2 0.00% luo'5ng 2 0.00% ru5i 2 0.00% watts 2 0.00% chim… 2 0.00% initiative 2 0.00% (2001 2 0.00% sharkawy 2 0.00% habib 2 0.00% nhùi 2 0.00% (phu3 2 0.00% karim 2 0.00% abbou 2 0.00% zili 2 0.00% mughal 2 0.00% let 2 0.00% rúi 2 0.00% shi 2 0.00% pelosi 2 0.00% horst 2 0.00% qúi 2 0.00% lâng 2 0.00% vatican 2 0.00% ghiggia 2 0.00% dirksen 2 0.00% capitol 2 0.00% senate 2 0.00% alexandre 2 0.00% mesén 2 0.00% dana 2 0.00% cháu… 2 0.00% tóp 2 0.00% ddi4a… 2 0.00% xéo 2 0.00% joao 2 0.00% noí 2 0.00% lo'i 2 0.00% chu'3a 2 0.00% schottland 2 0.00% stadt 2 0.00% pace 2 0.00% salt 2 0.00% tho5ai 2 0.00% (heredia 2 0.00% (or 2 0.00% europe 2 0.00% bor 2 0.00% xem… 2 0.00% nuông

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 162 2 0.00% assembly 2 0.00% vfb 2 0.00% vietnamese 2 0.00% workers 2 0.00% quê5t 2 0.00% (vb 2 0.00% va(5c 2 0.00% are 2 0.00% alvaro 2 0.00% chuô1t 2 0.00% relation 2 0.00% nhu'2 2 0.00% or 2 0.00% dhea 2 0.00% quýnh 2 0.00% n17a 2 0.00% 400m2 2 0.00% ddang… 2 0.00% nguây 2 0.00% 20m 2 0.00% pách 2 0.00% mitsuo 2 0.00% trê5t 2 0.00% cutler 2 0.00% landing 2 0.00% tru’2ng 2 0.00% (170 2 0.00% electronics 2 0.00% (ha5 2 0.00% trompét 2 0.00% ngoài… 2 0.00% nhô2m 2 0.00% ………………… 2 0.00% ……………………… 2 0.00% iso 2 0.00% qua(1t 2 0.00% 159 2 0.00% cn 2 0.00% gía 2 0.00% valse 2 0.00% katsui 2 0.00% hoon 2 0.00% inch 2 0.00% nhoàm 2 0.00% mê2n 2 0.00% ngâ2u 2 0.00% 9g 2 0.00% (máy 2 0.00% thu'o'5c 2 0.00% stc 2 0.00% co’3i 2 0.00% fellowship

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 163 2 0.00% mechanics 2 0.00% pa3 2 0.00% (fluid 2 0.00% 522 2 0.00% starkville 2 0.00% rúm 2 0.00% kilômét 2 0.00% 55m 2 0.00% taku 2 0.00% kurt 2 0.00% 533 2 0.00% 626 2 0.00% (cuô5c 2 0.00% nhê4 2 0.00% off 2 0.00% take 2 0.00% nha5i 2 0.00% sekong 2 0.00% …………… 2 0.00% masafumi 2 0.00% sanai 2 0.00% (giám 2 0.00% roàng 2 0.00% miê2ng 2 0.00% disco 2 0.00% (ts 2 0.00% hssv 2 0.00% 184 2 0.00% 'ô' 2 0.00% satoh 2 0.00% tango 2 0.00% (xuâ1t 2 0.00% 721 2 0.00% ddu'o'2ng… 2 0.00% sahil 2 0.00% 189 2 0.00% dominique 2 0.00% cuzco 2 0.00% nguyê3n 2 0.00% khò 2 0.00% 7cm 2 0.00% vijay 2 0.00% 000m2 2 0.00% lu'a 2 0.00% ruby 2 0.00% lo'5m 2 0.00% vat 2 0.00% (graphics 2 0.00% dda5 2 0.00% troy 2 0.00% sophie 2 0.00% (trú

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 164 2 0.00% 770 2 0.00% dj 2 0.00% tênh 2 0.00% (vai 2 0.00% ngoai 2 0.00% lú 2 0.00% syri 2 0.00% kenkyusei 2 0.00% 178 2 0.00% thuâ1n 2 0.00% manouchehr 2 0.00% zimbabwe 2 0.00% heineken 2 0.00% adam 2 0.00% (ký 2 0.00% (pha3i 2 0.00% sachs 2 0.00% 147 2 0.00% 337 2 0.00% pinôkio 2 0.00% kho’i 2 0.00% câ1c 2 0.00% frankenstein 2 0.00% 5km 2 0.00% konda 2 0.00% anakonda 2 0.00% (grand 2 0.00% windows 2 0.00% rockét 2 0.00% triê3n… 2 0.00% klissmann 2 0.00% ita 2 0.00% 640 2 0.00% karnataka 2 0.00% nambiar 2 0.00% tho'i 2 0.00% ivanov 2 0.00% axit 2 0.00% jurgen 2 0.00% bi4nh 2 0.00% (photography 2 0.00% pessin 2 0.00% hoác 2 0.00% steve 2 0.00% (100 2 0.00% damas 2 0.00% rê1 2 0.00% hoa' 2 0.00% co'lêô 2 0.00% (vietnam 2 0.00% phigarô 2 0.00% ngu’2a

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 165 2 0.00% messi 2 0.00% fu 2 0.00% hên 2 0.00% paw 2 0.00% silvio 2 0.00% berlusconi 2 0.00% rèm 2 0.00% fashion 2 0.00% y2 2 0.00% uy5ch 2 0.00% philippineses 2 0.00% 20km 2 0.00% kawai 2 0.00% barem 2 0.00% 'ngôi 2 0.00% thích… 2 0.00% ddê3… 2 0.00% liê5m 2 0.00% gàu 2 0.00% xu'1c 2 0.00% nê5m 2 0.00% rosalie 2 0.00% albania 2 0.00% ubqg 2 0.00% (clb 2 0.00% seiko 2 0.00% hanh 2 0.00% (sgk 2 0.00% 500m 2 0.00% 198 2 0.00% mu’o’5n 2 0.00% át 2 0.00% kasim 2 0.00% western 2 0.00% thòm 2 0.00% pacoret 2 0.00% va5nh 2 0.00% vilnius 2 0.00% (nxbgd 2 0.00% unicef 2 0.00% jogjakarta 2 0.00% la3nh 2 0.00% sona 2 0.00% nhài 2 0.00% (fsb 2 0.00% vòm 2 0.00% iev 2 0.00% hollins 2 0.00% cátxin 2 0.00% esso 2 0.00% lazio 2 0.00% engineering

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 166 2 0.00% vff 2 0.00% lada 2 0.00% ana 2 0.00% (smddh 2 0.00% su4ng 2 0.00% cornell 2 0.00% thàn 2 0.00% doi 2 0.00% d1 2 0.00% grozny 2 0.00% stockholm 2 0.00% so’5i 2 0.00% thó 2 0.00% 100dd 2 0.00% cha5c 2 0.00% be4n 2 0.00% 20kg 2 0.00% daido 2 0.00% bqldaddtltddqt 2 0.00% xima(ng 2 0.00% vtv4 2 0.00% shinta 2 0.00% va5m 2 0.00% school 2 0.00% utah 2 0.00% fujiwara 2 0.00% examiner 2 0.00% chùng 2 0.00% cents 2 0.00% oak 2 0.00% tho'1 2 0.00% rubaie 2 0.00% terje 2 0.00% o'2i 2 0.00% uc 2 0.00% lu’o’1t 2 0.00% brindani 2 0.00% le4n 2 0.00% moriyama 2 0.00% 18001567 2 0.00% 200dd 2 0.00% suffredini 2 0.00% nhòm 2 0.00% (mâ5n 2 0.00% jba 2 0.00% 1559 2 0.00% leadership 2 0.00% bê5t 2 0.00% vroom 2 0.00% u'3ng 2 0.00% bu'o'1ng 2 0.00% scholarships

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 167 2 0.00% nhão 2 0.00% criminal 2 0.00% ddòng 2 0.00% mói 2 0.00% (lu'u 2 0.00% technologies 2 0.00% endeavour 2 0.00% ku 2 0.00% vu5c 2 0.00% 622 2 0.00% popovych 2 0.00% software 2 0.00% yaroslav 2 0.00% (singapore 2 0.00% monica 2 0.00% bellucci 2 0.00% bloc 2 0.00% schwab 2 0.00% ngo5t… 2 0.00% (202 2 0.00% (gâ2n 2 0.00% tictack 2 0.00% garmser 2 0.00% 14g 2 0.00% nhem 2 0.00% 335 2 0.00% (australian 2 0.00% ru'1a 2 0.00% daimler 2 0.00% (cpc 2 0.00% stress 2 0.00% syed 2 0.00% dupree 2 0.00% mars 2 0.00% phoenix 2 0.00% môtô 2 0.00% 680 2 0.00% nabarro 2 0.00% carol 2 0.00% schlein 2 0.00% tru’a 2 0.00% (tri5 2 0.00% ringgit 2 0.00% mogan 2 0.00% ddóm 2 0.00% du4a 2 0.00% potassium 2 0.00% (hoàng 2 0.00% (gia3ng 2 0.00% (1901 2 0.00% xoa(n 2 0.00% 356

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 168 2 0.00% lài 2 0.00% phomát 2 0.00% lô1m 2 0.00% (bâ1t 2 0.00% tnt 2 0.00% maggie 2 0.00% elysees 2 0.00% ga5n 2 0.00% champs 2 0.00% ô2i 2 0.00% oxford 2 0.00% sàm 2 0.00% ellen 2 0.00% daklak 2 0.00% ngo’5i 2 0.00% rlc 2 0.00% chàm 2 0.00% 18g 2 0.00% margaret 2 0.00% rap 2 0.00% ngo’ 2 0.00% acid 2 0.00% 1kg 2 0.00% 600ha 2 0.00% 100ha 2 0.00% nho3m 2 0.00% granit 2 0.00% viêt 2 0.00% nâ1t 2 0.00% bk 2 0.00% berkeley 2 0.00% 110mmol 2 0.00% rem 2 0.00% beslan 2 0.00% (bhyt 2 0.00% steinglass 2 0.00% tòng 2 0.00% 191 2 0.00% ðô5 2 0.00% isreal 2 0.00% vidéo 2 0.00% nhi3nh 2 0.00% (bn 2 0.00% 13g30 2 0.00% arkansas 2 0.00% uông 2 0.00% protein 2 0.00% shawn 2 0.00% clayton 2 0.00% ngu'o' 2 0.00% 172 2 0.00% carla

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 169 2 0.00% charlene 2 0.00% (thô3 2 0.00% ngoe5o 2 0.00% que5o 2 0.00% nán 2 0.00% barshefsky 2 0.00% dzhokhar 2 0.00% 05' 2 0.00% access 2 0.00% niêu 2 0.00% cunego 2 0.00% mt 2 0.00% (biên 2 0.00% be3o 2 0.00% (tuô3i 2 0.00% nho'2n 2 0.00% yeutter 2 0.00% lh 2 0.00% váp 2 0.00% vega 2 0.00% amelia 2 0.00% diêm 2 0.00% queensland 2 0.00% fsh 2 0.00% telemundo 2 0.00% ddâ1t… 2 0.00% khaled 2 0.00% madelein 2 0.00% nhám 2 0.00% cho3ng 2 0.00% dudayev 2 0.00% khuy2nh 2 0.00% (huynh 2 0.00% wilbur 2 0.00% (tác 2 0.00% nành 2 0.00% ghe4 2 0.00% jatusripitak 2 0.00% cu5p 2 0.00% dác 2 0.00% (jetro 2 0.00% albar 2 0.00% hu’1ng 2 0.00% nikolai 2 0.00% bêu 2 0.00% khâ3y 2 0.00% su’4a 2 0.00% jamaica 2 0.00% 1a 2 0.00% xâ1y 2 0.00% vo’2i 2 0.00% hi4

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 170 2 0.00% chi3n 2 0.00% eureka 2 0.00% omar 2 0.00% garzelli 2 0.00% fillet 2 0.00% nhu’o’2ng 2 0.00% zuôi 2 0.00% afc 2 0.00% cent 2 0.00% gâ2u 2 0.00% babylift 2 0.00% cell 2 0.00% (phó 2 0.00% (petrolimex 2 0.00% ngoa(1c 2 0.00% mia 2 0.00% nu4ng 2 0.00% parkinson 2 0.00% lém 2 0.00% 258 2 0.00% co3… 2 0.00% 'bà 2 0.00% thành… 2 0.00% be5t 2 0.00% fahri 2 0.00% norodom 2 0.00% ro'3m 2 0.00% bo’m 2 0.00% yêng 2 0.00% pot 2 0.00% dessel 2 0.00% pol 2 0.00% ngan 2 0.00% ron 2 0.00% tè 2 0.00% calci 2 0.00% (phú 2 0.00% (l 2 0.00% dracula 2 0.00% (sóc 2 0.00% chi3a 2 0.00% disko 2 0.00% amazon 2 0.00% tskh 2 0.00% xoa5c 2 0.00% thoa 2 0.00% 11g30 2 0.00% tru’o’5t 2 0.00% kilner 2 0.00% ne5t 2 0.00% ddhsp 2 0.00% vê3nh

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 171 2 0.00% thu'o'1t 2 0.00% choa3ng 2 0.00% taddùm 2 0.00% perfect 2 0.00% montana 2 0.00% khâ1p 2 0.00% lu’2a 2 0.00% studios 2 0.00% ddã… 2 0.00% (vinaconex 2 0.00% nature 2 0.00% abdul 2 0.00% vô1c 2 0.00% toiba 2 0.00% 6kg 2 0.00% vo’5 2 0.00% thvn 2 0.00% (tq 2 0.00% aceh 2 0.00% blues 2 0.00% brosnan 2 0.00% che3 2 0.00% cyril 2 0.00% tu'2… 2 0.00% uruzgan 2 0.00% ngu’5c 2 0.00% pete 2 0.00% 8m 2 0.00% 885 2 0.00% 225 2 0.00% cm2 2 0.00% pictures 2 0.00% tâ3n 2 0.00% (1994 2 0.00% 000usd 2 0.00% pierce 2 0.00% ho'1p 2 0.00% jds 2 0.00% nhuê5 2 0.00% ngút 2 0.00% mo’2i 2 0.00% xoa3ng 2 0.00% gruzia 2 0.00% japan 2 0.00% huntington 2 0.00% fat 2 0.00% guevara 2 0.00% area 2 0.00% tuyê5t…nhu'ng 2 0.00% du’4 2 0.00% dagestan 2 0.00% casino

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 172 2 0.00% su'o'1t 2 0.00% petersen 2 0.00% fremont 2 0.00% giai3 2 0.00% ca5c 2 0.00% nguyen 2 0.00% (malaysia 2 0.00% (bên 1 0.00% (cctv 1 0.00% humberto 1 0.00% dô2i… 1 0.00% teneriffa 1 0.00% (nhiê5m 1 0.00% kuijt 1 0.00% dyk 1 0.00% skijder 1 0.00% genf 1 0.00% short 1 0.00% tiergarten 1 0.00% lanzaat 1 0.00% rám 1 0.00% servette 1 0.00% wollenberg 1 0.00% assogbavi 1 0.00% wangen 1 0.00% predrag 1 0.00% nhâ 1 0.00% stern 1 0.00% ddr 1 0.00% nicole 1 0.00% ngong 1 0.00% (love 1 0.00% luegde 1 0.00% bo'3i… 1 0.00% ddo'4… 1 0.00% mile 1 0.00% milesovic 1 0.00% rotern 1 0.00% (43 1 0.00% phomai 1 0.00% phòng…vv 1 0.00% tiesto 1 0.00% musadshon 1 0.00% moenchengladbach 1 0.00% míê1ng 1 0.00% age 1 0.00% (32 1 0.00% hamed 1 0.00% hammouda 1 0.00% câ4ng 1 0.00% gaojun 1 0.00% snijder

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 173 1 0.00% locarno 1 0.00% belgrad 1 0.00% osasuna 1 0.00% westbam 1 0.00% monetegro 1 0.00% francileudo 1 0.00% rangers 1 0.00% ice 1 0.00% tashkent 1 0.00% pamplona 1 0.00% carthage 1 0.00% chu'n 1 0.00% ulkrine 1 0.00% luo'1i 1 0.00% bu3n 1 0.00% (380 1 0.00% casillas 1 0.00% arabien 1 0.00% tunesia 1 0.00% 46000 1 0.00% dempsey 1 0.00% convey 1 0.00% reyna 1 0.00% dà… 1 0.00% jorge 1 0.00% larrionda 1 0.00% donovan 1 0.00% dê2n 1 0.00% ru'o'ng 1 0.00% russol 1 0.00% walt 1 0.00% depp 1 0.00% gussin 1 0.00% worobej 1 0.00% reng 1 0.00% nesmatschni 1 0.00% (55 1 0.00% pernía 1 0.00% (232 1 0.00% sergio 1 0.00% nemo 1 0.00% (77 1 0.00% fabregas 1 0.00% senna 1 0.00% ky4… 1 0.00% bocanegra 1 0.00% jankulovski 1 0.00% rozehnal 1 0.00% tschechien 1 0.00% grygera 1 0.00% poborsky 1 0.00% rosicky

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 174 1 0.00% ivory 1 0.00% galasek 1 0.00% koeln 1 0.00% poulat 1 0.00% (frankreich 1 0.00% hmm… 1 0.00% hashemian 1 0.00% marlon 1 0.00% superstar 1 0.00% ivankovic 1 0.00% ferydoon 1 0.00% thuo'3ng 1 0.00% ui… 1 0.00% 3… 1 0.00% amoah 1 0.00% bing 1 0.00% onyewu 1 0.00% zaccardo 1 0.00% cherundolo 1 0.00% appiah 1 0.00% kingston 1 0.00% paintsil 1 0.00% plasil 1 0.00% lokvenc 1 0.00% keenen 1 0.00% addo 1 0.00% mensah 1 0.00% illiasu 1 0.00% jelen 1 0.00% thich 1 0.00% radomski 1 0.00% szymkowiak 1 0.00% (90 1 0.00% hu'2m…me5 1 0.00% smolarek 1 0.00% goal 1 0.00% bosacki 1 0.00% wanderers 1 0.00% (bù 1 0.00% sami 1 0.00% jaber 1 0.00% bak 1 0.00% lewandowski 1 0.00% arabi 1 0.00% zewlakow 1 0.00% harwick 1 0.00% dale 1 0.00% ahle 1 0.00% buender 1 0.00% hansa 1 0.00% rostock 1 0.00% lu'õng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 175 1 0.00% (bé 1 0.00% holsen 1 0.00% vù… 1 0.00% minnie 1 0.00% (75 1 0.00% kohl 1 0.00% cô… 1 0.00% buende 1 0.00% haas 1 0.00% kiê1n…se4 1 0.00% yaser 1 0.00% (47' 1 0.00% tunesien 1 0.00% rusol 1 0.00% reng… 1 0.00% jaidi 1 0.00% haggui 1 0.00% boumnijel 1 0.00% trabelsi 1 0.00% (schweiz 1 0.00% schelajew 1 0.00% rotan 1 0.00% timoschuk 1 0.00% gussew 1 0.00% woronin 1 0.00% busacca 1 0.00% (64 1 0.00% rebrow 1 0.00% khariri 1 0.00% noor 1 0.00% sulimani 1 0.00% ghamdi 1 0.00% aribia 1 0.00% zied 1 0.00% temyat 1 0.00% (australien 1 0.00% montashari 1 0.00% bouazizi 1 0.00% chedli 1 0.00% jemmali 1 0.00% mnari 1 0.00% dokhi 1 0.00% fallatah 1 0.00% chikhaoui 1 0.00% zaid 1 0.00% movemement 1 0.00% varanasi 1 0.00% islamic 1 0.00% (let 1 0.00% (student 1 0.00% bangalore 1 0.00% 'chúng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 176 1 0.00% ngu'o'i' 1 0.00% sahni 1 0.00% hyderabad 1 0.00% ajay 1 0.00% srinagar 1 0.00% saniora 1 0.00% tf1 1 0.00% fuad 1 0.00% baseyev 1 0.00% (phong 1 0.00% konstantin 1 0.00% alex 1 0.00% perry 1 0.00% phu’1 1 0.00% chaaa… 1 0.00% ddô1ì 1 0.00% bibhu 1 0.00% gujarat 1 0.00% umar 1 0.00% (bjp 1 0.00% bharatiya 1 0.00% janata 1 0.00% pinôkiôôô… 1 0.00% fitzpatrick 1 0.00% taepodong 1 0.00% hawai 1 0.00% churchgate 1 0.00% teapodong 1 0.00% mahal 1 0.00% ak 1 0.00% aurangabad 1 0.00% maharashtra 1 0.00% prasad 1 0.00% routray 1 0.00% malegaon 1 0.00% (taliban 1 0.00% taj 1 0.00% ningxia 1 0.00% maharashtran 1 0.00% zhongwei 1 0.00% chenchnya 1 0.00% bsc 1 0.00% pha(1t 1 0.00% hertha 1 0.00%nhu'ng…oaaa5ch…câ5u 1 0.00% medic 1 0.00% ðiê2u 1 0.00% xo'1t 1 0.00% (03 1 0.00% asamoah 1 0.00% vfl 1 0.00% wolfsburg

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 177 1 0.00% hahahaha…ông 1 0.00% hernández 1 0.00% víctor 1 0.00% boladdos 1 0.00% (comunicaciones 1 0.00% cristian 1 0.00% saborío 1 0.00% hildebrand 1 0.00% timo 1 0.00% ca3… 1 0.00% ddu'o'5cvào 1 0.00% novotny 1 0.00% latvia 1 0.00% u5c…u5c…u5c…lu'2a 1 0.00% heather 1 0.00% (feer 1 0.00% balbina 1 0.00% hwang 1 0.00% u5u5u5u5ccc 1 0.00% u'u'1u 1 0.00% huo'1ng 1 0.00% 15000 1 0.00% mcalary 1 0.00% ngo'2…pinôkiô 1 0.00% da5…da5…thu'a 1 0.00% fsv 1 0.00% mainz 1 0.00% enke 1 0.00% ku'ln 1 0.00% jrgen 1 0.00% owomoyela 1 0.00% suyê4n 1 0.00% triê4n 1 0.00% freier 1 0.00% fabian 1 0.00% cu’o’4ng 1 0.00% gìu'4 1 0.00% pound 1 0.00% diyala 1 0.00% tho’ 1 0.00% assessment 1 0.00% su'o'1ng… 1 0.00% qu3y 1 0.00% zarqawi'' 1 0.00% zarqo 1 0.00% ''ddu'2ng 1 0.00% rohan 1 0.00% gunaratna 1 0.00% nhu'ng…bóng 1 0.00% xu’o’ng 1 0.00% shiites 1 0.00% arav

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 178 1 0.00% sunna 1 0.00% ddiê3u 1 0.00% bo5ng 1 0.00% fudan 1 0.00% nawaf 1 0.00% alani 1 0.00% oàm 1 0.00% mustafa 1 0.00% amman 1 0.00% sayel 1 0.00% khalayleh 1 0.00% maliki 1 0.00% khalilzad 1 0.00% nuri 1 0.00% qudama 1 0.00% báy 1 0.00% mali 1 0.00% ayatollah 1 0.00% sirry 1 0.00% (sa3n 1 0.00% zalmay 1 0.00% “dreamliner” 1 0.00% lâ2m'' 1 0.00% nhu5y 1 0.00% kalaylah 1 0.00% zarqa 1 0.00% ''cuô5c 1 0.00% cao'' 1 0.00% laden'' 1 0.00% ''mô1i 1 0.00% du'1t'' 1 0.00% (ohio 1 0.00% jakarka 1 0.00% eltsin 1 0.00% (sco 1 0.00% omega 1 0.00% (bo5n 1 0.00% lukin 1 0.00% kazhakstan 1 0.00% karzai 1 0.00% tufts 1 0.00% khem 1 0.00% con…ra 1 0.00% biê3n…thì 1 0.00% thu’o’1c 1 0.00% yokosuka 1 0.00% mujahideen 1 0.00% cha…cha 1 0.00% trans 1 0.00% lu’o’2ng 1 0.00% shura 1 0.00% transfat

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 179 1 0.00% calori 1 0.00% lavrov 1 0.00% caucase 1 0.00% serguei 1 0.00% venezeula 1 0.00% “nguyên 1 0.00% ô5p…xin 1 0.00% kreee5tt…kreee5tt 1 0.00% tâ5n…có 1 0.00% whitlock 1 0.00% wilkinson 1 0.00% tagesspiegel 1 0.00% guido 1 0.00% uhrlau 1 0.00% do’i 1 0.00% scotland 1 0.00% congo 1 0.00% euardo 1 0.00% pearson 1 0.00% 760 1 0.00% uo'1c 1 0.00% ngo'2…cha 1 0.00% dias 1 0.00% mbeki 1 0.00% sadec 1 0.00% thabo 1 0.00% sinopec 1 0.00% ta…la5i 1 0.00% aimar 1 0.00% keita 1 0.00% 101 1 0.00% lúc15h00 1 0.00% (34t 1 0.00% akale 1 0.00% pacorét 1 0.00% nâ1ng 1 0.00% (belgien 1 0.00% dindane 1 0.00% bleeckere 1 0.00% diê3n 1 0.00% swen 1 0.00% goeren 1 0.00% gerard 1 0.00% aldo 1 0.00% bobadila 1 0.00% bekham 1 0.00% dotrmund 1 0.00% mât 1 0.00% bavìere 1 0.00% (co'4 1 0.00% grad 1 0.00% chuyê5n…vo'1i

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 180 1 0.00% hagreves 1 0.00% …ban 1 0.00% và…ngón 1 0.00% vali 1 0.00% vu4…nhu'ng 1 0.00% avery 1 0.00% mành 1 0.00% glenn 1 0.00% mâ1t… 1 0.00% shamsul 1 0.00% maidin 1 0.00% agustin 1 0.00% katei 1 0.00% ca1c 1 0.00% tu'5c 1 0.00% su5ng 1 0.00% ivoira 1 0.00% chuy5ê5n 1 0.00% berenguel 1 0.00% tenorio 1 0.00% pressing 1 0.00% worls 1 0.00% fring 1 0.00% 28t 1 0.00% dunbo 1 0.00% …và 1 0.00% to… 1 0.00% hê5n 1 0.00% nghành 1 0.00% ro'1i 1 0.00% hu'4a 1 0.00% balan 1 0.00% 40m2 1 0.00% schweigsteiger 1 0.00% ma5cworld 1 0.00% ddô5ng( 1 0.00% (mô 1 0.00% này…nhu'ng 1 0.00% trinida 1 0.00% 15min 1 0.00% heitiga 1 0.00% boa(n 1 0.00% dumbo…ông 1 0.00% chuô2ng…xóm 1 0.00% tai…nhu'ng 1 0.00% (cu'3a 1 0.00% vinaxad 1 0.00% xiê1c…và 1 0.00% (lý 1 0.00% franfurt 1 0.00% (mexiko 1 0.00% riveros

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 181 1 0.00% cha(mpa 1 0.00% 48000 1 0.00% paramay 1 0.00% justo 1 0.00% koch 1 0.00% hessen 1 0.00% roland 1 0.00% acudda 1 0.00% caniza 1 0.00% cáceres 1 0.00% bobadilla 1 0.00% (56 1 0.00% (ra 1 0.00% hê2…thê1 1 0.00% cuevas 1 0.00% t'ru'ng 1 0.00% bonet 1 0.00% toledo 1 0.00% …mo5c 1 0.00% hislop 1 0.00% coata 1 0.00% (deportivo 1 0.00% riaca 1 0.00% und 1 0.00% guaatemala 1 0.00% saprissa 1 0.00% calcio 1 0.00% paolo 1 0.00% brescia 1 0.00% aek 1 0.00% athen 1 0.00% mexiko 1 0.00% eckel 1 0.00% ottmar 1 0.00% bern 1 0.00% urus 1 0.00% haessler 1 0.00% showmaster 1 0.00% sa5ng 1 0.00% umberto 1 0.00% stoiber 1 0.00% gottschalk 1 0.00% edmund 1 0.00% 29t 1 0.00% umadda 1 0.00% badilla 1 0.00% rodríguez 1 0.00% pont 1 0.00% harold 1 0.00% (real 1 0.00% randall 1 0.00% azofeifa

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 182 1 0.00% (brujas 1 0.00% nô1c 1 0.00% lake 1 0.00% drummond 1 0.00% fayed 1 0.00% dodi 1 0.00% lê1u 1 0.00% malaga 1 0.00% to5c 1 0.00% wardy 1 0.00% italien 1 0.00% jervis 1 0.00% (brescia 1 0.00% alfaro 1 0.00% d'alma 1 0.00% lâ2n… 1 0.00% torrado 1 0.00% volpe 1 0.00% gerardo 1 0.00% này…con 1 0.00% perez 1 0.00% yahya 1 0.00% mateus 1 0.00% sun 1 0.00% nuremberg 1 0.00% mario 1 0.00% mendez 1 0.00% daei 1 0.00% (1930 1 0.00% hernan 1 0.00% (1978 1 0.00% tru'1 1 0.00% allback 1 0.00% 30m 1 0.00% panhproren 1 0.00% rô2i…tách 1 0.00% jared 1 0.00% mohammad 1 0.00% aliabadi 1 0.00% su'o'5t 1 0.00% lá… 1 0.00% (cdu 1 0.00% caradec'h 1 0.00% lady 1 0.00% mez 1 0.00% (blatter 1 0.00% pele 1 0.00% alcides 1 0.00% schiffer 1 0.00% main 1 0.00% claudia 1 0.00% slowakei

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 183 1 0.00% investigation 1 0.00% bsg 1 0.00% chemnitz 1 0.00% goerlitz 1 0.00% (ddr 1 0.00% motro 1 0.00% bert 1 0.00% marwijk 1 0.00% bortmund 1 0.00% fulda 1 0.00% freiburg 1 0.00% khatibi 1 0.00% thiê3m 1 0.00% (amua 1 0.00% so3i…châ1t 1 0.00% qomolangma 1 0.00% 848 1 0.00% 464 1 0.00% (551 1 0.00% 479 1 0.00% (468 1 0.00% 425 1 0.00% (1012 1 0.00% 1068 1 0.00% karakorum 1 0.00% choán 1 0.00% kirghizstan 1 0.00% tadjikistan 1 0.00% hu'o'mg 1 0.00% (giam 1 0.00% (ddàn 1 0.00% 541 1 0.00% 806 1 0.00% altai 1 0.00% kenvin 1 0.00% sikkim 1 0.00% boutan 1 0.00% 1799 1 0.00%ddây…là…nhà…mo'1i…cu3a…tôi 1 0.00% 1325 1 0.00% trach 1 0.00% (ba(1t 1 0.00% (1711 1 0.00% ta(2n 1 0.00% 28 1 0.00% go'2n 1 0.00% 1650 1 0.00% 1100 1 0.00% sè 1 0.00% nghiêu 1 0.00% (369 1 0.00% 286

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 184 1 0.00% (298 1 0.00% 376 1 0.00% (372 1 0.00% 289 1 0.00% coaster 1 0.00% (1911 1 0.00% roller 1 0.00% (280 1 0.00% 233 1 0.00% (1644 1 0.00% ad 1 0.00% liê1p 1 0.00% plat 1 0.00% xa( 1 0.00% ve3o 1 0.00% (ma(5t 1 0.00% cu'a3 1 0.00% a(2ng 1 0.00% a(5c 1 0.00% flat 1 0.00% tâ1y 1 0.00% buô5cpha3i 1 0.00% (u'u 1 0.00% (1973 1 0.00% merriman 1 0.00% (1951 1 0.00% bay( 1 0.00% hiê1m( 1 0.00% ngâ5y 1 0.00% (ddi5a 1 0.00% vu'2ng…cuô1i 1 0.00% 135 1 0.00% so'n( 1 0.00% va(ntthu'o'ng 1 0.00% (bút 1 0.00% tu'1cthích 1 0.00% mary 1 0.00% (xì 1 0.00% elizabeth 1 0.00% (tu'2ng 1 0.00% khoé 1 0.00% (nha(2m 1 0.00% (im 1 0.00% ddo5at 1 0.00% _lô1i 1 0.00% _bu'2a 1 0.00% gích 1 0.00% vãy 1 0.00% winsread 1 0.00% bec 1 0.00% tiên(19 1 0.00% diê1n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 185 1 0.00% 72003 1 0.00% ddê1ch 1 0.00% viêc 1 0.00% (ddánh 1 0.00% (dày 1 0.00% ternet 1 0.00% háp 1 0.00% (xa3y 1 0.00% chiê1p…ddàn 1 0.00% ghpgvnt 1 0.00% ttpgqt 1 0.00% thu'o5ng 1 0.00% to3n 1 0.00% o…o 1 0.00% chu5i 1 0.00% (vu'2a 1 0.00% thu'3a 1 0.00% louise 1 0.00% arbour 1 0.00% (heo 1 0.00% tóc…êm 1 0.00% xói 1 0.00% vê3 1 0.00% philipine 1 0.00% khóat 1 0.00% pakisstan 1 0.00% lu'5oc 1 0.00% dduô4n 1 0.00% (sa(1c 1 0.00% irland 1 0.00% cashmir 1 0.00% ma(t 1 0.00% dô5c 1 0.00% gém 1 0.00% loè 1 0.00% ca5p… 1 0.00% tép… 1 0.00% thi3ng 1 0.00% 100usd 1 0.00% nauy 1 0.00% bu'2a… 1 0.00% goa5i 1 0.00% gmail 1 0.00% hotmail 1 0.00% hushmail 1 0.00% vôstô1c 1 0.00% (bê2n 1 0.00% trành 1 0.00% gô 1 0.00% ngoâ3y 1 0.00% 4gio'2 1 0.00% (cu'1

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 186 1 0.00% rio'2ng 1 0.00% ê2nh 1 0.00% vladdi 1 0.00% (thu'2a 1 0.00% (495 1 0.00% (ca5nh 1 0.00% 700m3 1 0.00% sóat 1 0.00% (mâ4u 1 0.00% (huangling 1 0.00% 208 1 0.00% ha5p 1 0.00% 2030 1 0.00% (2050 1 0.00% (lenovo 1 0.00% 463 1 0.00% nhôm 1 0.00% (67 1 0.00% (shibaozhai 1 0.00% rasheed 1 0.00% (viê5c 1 0.00% (sars 1 0.00% thúe 1 0.00% kidwai 1 0.00% cnocc 1 0.00% (danjiangkou 1 0.00% 000km2 1 0.00% sanxia 1 0.00% loa3ng 1 0.00% 782km 1 0.00% rajiv 1 0.00% 443 1 0.00% do'3m 1 0.00% subhash 1 0.00% (600 1 0.00% lo'4n 1 0.00% xuezhong 1 0.00% monroe 1 0.00% chóe 1 0.00% okinawa 1 0.00% kapila 1 0.00% napoléon 1 0.00% mearsheimer 1 0.00% (350 1 0.00% gini 1 0.00% elit 1 0.00% manohar 1 0.00% (fdi 1 0.00% kwong 1 0.00% jagmohan 1 0.00% di3nh 1 0.00% dúa

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 187 1 0.00% (2467 1 0.00% rui 1 0.00% murli 1 0.00% sunanda 1 0.00% huyndai 1 0.00% ra…sau 1 0.00% tráp 1 0.00% anbani 1 0.00% kho3ng 1 0.00% (xâ1p 1 0.00% (1990 1 0.00% 67b 1 0.00% nguòi 1 0.00% thie3u 1 0.00% indonêxia 1 0.00% da3ng 1 0.00% tu'o3ng 1 0.00% liê3ng 1 0.00% (nòng 1 0.00% phâ3n 1 0.00% (cô3 1 0.00% ma(2n 1 0.00% papatacci 1 0.00% razzi 1 0.00% (mãi 1 0.00% nho'4n 1 0.00% xiê3ng 1 0.00% trie3n 1 0.00% núi…vo'5 1 0.00% thiê1t…tha(2ng 1 0.00% nhà…vo'5 1 0.00% engiô2 1 0.00% bô2m 1 0.00% bé…nó 1 0.00% 1001 1 0.00% (phóng 1 0.00% nhà…chu'a 1 0.00% vu'2ng…ddê3 1 0.00% voi…ddê1n 1 0.00% xê2nh 1 0.00% ma5ng… 1 0.00% (rô5ng 1 0.00% hungari 1 0.00% vê2…ông 1 0.00% vttm 1 0.00% dô4ng 1 0.00% dê1…o'3 1 0.00% ddie3m 1 0.00% chê1t…vu'2a 1 0.00% suý 1 0.00% lói 1 0.00% petofi

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 188 1 0.00% vua…nhà 1 0.00% (vo'3 1 0.00% cuô5c…nhu'ng 1 0.00% _ha5n 1 0.00% erin 1 0.00% cu…cù 1 0.00% t4 1 0.00% ''xét 1 0.00% dda3ng'' 1 0.00% priscila 1 0.00% vê4nh 1 0.00% lo3a 1 0.00% 180ô3 1 0.00% u5t 1 0.00% perales 1 0.00% odessa 1 0.00% chu'i3 1 0.00% ''tiê1p 1 0.00% dân'' 1 0.00% kléber 1 0.00% ngô4 1 0.00% ha(nh 1 0.00% tô5 1 0.00% 75m 1 0.00% i5t 1 0.00% ''quô1c 1 0.00% na5n'' 1 0.00% qúai 1 0.00% bureaux 1 0.00% mosad 1 0.00% bye 1 0.00% na(m…mô5t 1 0.00% hocova 1 0.00% deuxième 1 0.00% ngô4ng… 1 0.00% ubmttq 1 0.00% ddá…cha(3ng 1 0.00% qua(5p 1 0.00% dcch 1 0.00% audio 1 0.00% dê1…lúc 1 0.00% viê3n 1 0.00% vông 1 0.00% nguyê4nva(n 1 0.00% i5t… 1 0.00% hotel 1 0.00% wilshire 1 0.00% (cha(3ng 1 0.00% châ1y 1 0.00% cung…ngài 1 0.00% báothanh 1 0.00% 6h30'

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 189 1 0.00% (ddô1t 1 0.00% tbd 1 0.00% (hâ5u 1 0.00% gô5t 1 0.00% chõ 1 0.00% pgqt 1 0.00% (lau 1 0.00% alessandrie 1 0.00% ché 1 0.00% la… 1 0.00% riff 1 0.00% chiêng… 1 0.00% inddonêxia 1 0.00% (cho'i 1 0.00% wonder 1 0.00% stevie 1 0.00% redding 1 0.00% hddndtp 1 0.00% (thu' 1 0.00% cat 1 0.00% racism 1 0.00% (di4 1 0.00% demo 1 0.00% otis 1 0.00% 2700002 1 0.00% green 1 0.00% undiscovered 1 0.00% 822 1 0.00% (tám 1 0.00% nalthan 1 0.00% (mùa 1 0.00% luy 1 0.00% (thâ1p 1 0.00% (kìm 1 0.00% give 1 0.00% something 1 0.00% (vu5 1 0.00% (tra3i 1 0.00% tê3nh 1 0.00% htx 1 0.00% warwickshire 1 0.00% rugby 1 0.00% suraibaia 1 0.00% inddônêxia 1 0.00% guitar 1 0.00% itunes 1 0.00% hot 1 0.00% tây… 1 0.00% teenpop 1 0.00% (oda 1 0.00% xa(2ng 1 0.00% martini

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 190 1 0.00% (ddâ2y 1 0.00% si5n 1 0.00% niê1t 1 0.00% die 1 0.00% cu'1t 1 0.00% rô1t…thu'1 1 0.00% traviêncu3a 1 0.00% (la5ng 1 0.00% connery 1 0.00% quý… 1 0.00% dr 1 0.00% cm3 1 0.00% nhom 1 0.00% cung( 1 0.00% giu 1 0.00% luô1t 1 0.00% another 1 0.00% rê 1 0.00% bâ1n 1 0.00% (6m2 1 0.00% 1200cm3 1 0.00% mõ 1 0.00% cá…ddê2u 1 0.00% (cu' 1 0.00% kinks 1 0.00% chu'ong 1 0.00% stevens 1 0.00% la…lá 1 0.00% 123 1 0.00% 298 1 0.00% tphn 1 0.00% làthu'o'ng 1 0.00% (qh 1 0.00% (1767 1 0.00% 323 1 0.00% ddbqh 1 0.00% do'5 1 0.00% hddxx 1 0.00% lõm 1 0.00% sony 1 0.00% núm 1 0.00% dô3i 1 0.00% ty2 1 0.00% thàng 1 0.00% se3… 1 0.00% rù 1 0.00% du'o'1í 1 0.00% broccoli 1 0.00% vnd 1 0.00% xô3m 1 0.00% nính 1 0.00%

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 191 1 0.00% flash 1 0.00% garragher 1 0.00% felipe 1 0.00% juninho 1 0.00% kaka 1 0.00% và…cô 1 0.00% éo 1 0.00% chuôm 1 0.00% ze 1 0.00% postiga 1 0.00% bàn…uy2nh 1 0.00% gallery 1 0.00% (nghi3 1 0.00% bàn…cô 1 0.00% và…thâ1y 1 0.00% bo'5t 1 0.00% cha(4n 1 0.00% (62 1 0.00% wolrd 1 0.00% cái…thâ5t 1 0.00% và…toét 1 0.00% trin 1 0.00% publications 1 0.00% duesseldorf 1 0.00% zlatan 1 0.00% kallstroem 1 0.00% do… 1 0.00% lars 1 0.00% daylights 1 0.00% senegal 1 0.00% isaksson 1 0.00% turin 1 0.00% (arsenal 1 0.00% (fc 1 0.00% (larson 1 0.00% co'3 1 0.00% lúc17h00 1 0.00% thê3…ba(2ng 1 0.00% medina 1 0.00% tuyê5n 1 0.00% cafú 1 0.00% giùi 1 0.00% ga(5y 1 0.00% henrik 1 0.00% quí… 1 0.00% kia…và 1 0.00% 17h00 1 0.00% mönchengladbach 1 0.00% xoàm 1 0.00% ddo'5i…cho'2 1 0.00% o'p 1 0.00% gio

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 192 1 0.00% (bor 1 0.00% ho'm 1 0.00% úa 1 0.00% costinho 1 0.00% mu'1c… 1 0.00% ang 1 0.00% moócgian 1 0.00% alibaba 1 0.00% ru'o'1n 1 0.00% xô1ng 1 0.00% diva 1 0.00% (bô2 1 0.00% schumacher 1 0.00% klinmann 1 0.00% ò…ó… 1 0.00% scheinsteiger 1 0.00% con…nó 1 0.00% choè 1 0.00% amato' 1 0.00% oóc 1 0.00% (20t 1 0.00% (22t 1 0.00% 17m 1 0.00% 882 1 0.00% phô3ng 1 0.00% (21t 1 0.00% và…ky2 1 0.00% bay…alice 1 0.00% iaquinta 1 0.00% 25t 1 0.00% hit 1 0.00% 41m 1 0.00% díp 1 0.00% xú 1 0.00% thê1ch 1 0.00% lever 1 0.00% sabrosa 1 0.00% ro5t 1 0.00% phê2u 1 0.00% vô3 1 0.00% nomez 1 0.00% 974 1 0.00% dove 1 0.00% rô5p 1 0.00% zdf 1 0.00% này…mày 1 0.00% tho'1t… 1 0.00% biana 1 0.00% ã 1 0.00% (huy 1 0.00% sa(2ng 1 0.00% vhnt

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 193 1 0.00% lê4… 1 0.00% composer 1 0.00% na3o 1 0.00% rockband 1 0.00% berklee 1 0.00% nghê3n 1 0.00% ddo3…con 1 0.00% bu'o'1c…và 1 0.00% (italien 1 0.00% (50 1 0.00% nsnd 1 0.00% vukic 1 0.00% (41 1 0.00% (78 1 0.00% (88 1 0.00% argentia 1 0.00% rocker 1 0.00% (31 1 0.00% 183 1 0.00% ddi…thê1 1 0.00% prada 1 0.00% cô5te 1 0.00% ruud 1 0.00% (ba3ng 1 0.00% devil 1 0.00% taymoorian 1 0.00% giáo… 1 0.00% bakhtiarizadeh 1 0.00% rezaei 1 0.00% wears 1 0.00% (kolumbien 1 0.00% xe3o 1 0.00% friends 1 0.00% (chung 1 0.00% (nsu't 1 0.00% ra(1c…ra(1c 1 0.00% nhãi 1 0.00% tu'o'1ng…lão 1 0.00% ba(1n…không 1 0.00% ruiz 1 0.00% romaric 1 0.00% nu'4a…tâ1t 1 0.00% cody… 1 0.00% 1908 1 0.00% barkey 1 0.00% và…không 1 0.00% santis 1 0.00% andreas 1 0.00% herren 1 0.00% hôm… 1 0.00% sicily 1 0.00% llona

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 194 1 0.00% a5… 1 0.00% giuô5c 1 0.00% ddu'u'5c 1 0.00% massimo 1 0.00% living 1 0.00% (vua 1 0.00% octopussy 1 0.00% olaf 1 0.00% mellberg 1 0.00% (aston 1 0.00% warning 1 0.00% nu'o'2i 1 0.00% sepp 1 0.00% royale 1 0.00% (sweden 1 0.00% early 1 0.00% dudic 1 0.00% tàm 1 0.00% (hô2 1 0.00% cói 1 0.00% messy 1 0.00% moóc 1 0.00% sâ3m 1 0.00% guô5c 1 0.00% ergic 1 0.00% thuyê5t 1 0.00% moócgan 1 0.00% ddên 1 0.00% thín 1 0.00% trâ5u 1 0.00% (origami 1 0.00% poster 1 0.00% cosply 1 0.00% múp 1 0.00% trê3 1 0.00% vâ2y 1 0.00% tâ2n… 1 0.00% cambassio 1 0.00% ddâ1y… 1 0.00% 22g30 1 0.00% nha5n 1 0.00% liê1u 1 0.00% lau'1 1 0.00% (khánh 1 0.00% 'tru'o'1c 1 0.00% litte 1 0.00% 'vâng 1 0.00% lg 1 0.00% 11x 1 0.00% xu' 1 0.00% 318 1 0.00% nho'u'1

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 195 1 0.00% tînh 1 0.00% quo3 1 0.00% rfa 1 0.00% (ma(ng 1 0.00% da5' 1 0.00% (xe 1 0.00% dda(5tc 1 0.00% ry 1 0.00% li3u 1 0.00% muô1n…ddu'o'5c 1 0.00% 50c 1 0.00% (em 1 0.00% (lúc 1 0.00% singapo 1 0.00% ne 1 0.00% kovaliev 1 0.00% lutmila 1 0.00% alekseeva 1 0.00% pais 1 0.00% moskva 1 0.00% dupuis 1 0.00% santoro 1 0.00% chez 1 0.00% chiristophe 1 0.00% ddô2ng… 1 0.00% snelling 1 0.00% sato 1 0.00% to'n 1 0.00% 297 1 0.00% thcand 1 0.00% hrw 1 0.00% 293 1 0.00% (cu5c 1 0.00% 296 1 0.00% (houston 1 0.00% nia 1 0.00% oa 1 0.00% cjp 1 0.00% inter 1 0.00% grôve 1 0.00% truy5 1 0.00% ba5ch…nhu' 1 0.00% tíê1c 1 0.00% pum18 1 0.00% airline 1 0.00% ngoa 1 0.00% 13h30 1 0.00% tiê3ng 1 0.00% tím…xen 1 0.00% ho5e 1 0.00% ga(p 1 0.00% 19h30

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 196 1 0.00% vietsopetro 1 0.00% hacker 1 0.00% danchimviet 1 0.00% nguoivietonline 1 0.00% kung 1 0.00% (vi5t 1 0.00% (bruce 1 0.00% ykien 1 0.00% riê1u 1 0.00% seaprodex 1 0.00% vietnamexodus 1 0.00% vietland 1 0.00% doithoai 1 0.00% (trích 1 0.00% kontum 1 0.00% 'con' 1 0.00% pctt 1 0.00% (giáo 1 0.00% (thông 1 0.00% 'cái' 1 0.00% sa(m 1 0.00% 16h30 1 0.00% praha 1 0.00% ly2 1 0.00% phôtocopy 1 0.00% tgcp 1 0.00% vu'3a 1 0.00% tre5 1 0.00% toài 1 0.00% nha(4ng 1 0.00% 22h15 1 0.00% dôi 1 0.00% nhô1 1 0.00% 7h30 1 0.00% 310 1 0.00% ddu4ng 1 0.00% (cây 1 0.00% u'o'm 1 0.00% (quan 1 0.00% 1931 1 0.00% home 1 0.00% môtip 1 0.00% li5ch' 1 0.00% 'tiê2n 1 0.00% (thô 1 0.00% zinedane 1 0.00% rivery 1 0.00% meterazzi 1 0.00% trê2 1 0.00% piereo 1 0.00% giu5i 1 0.00% ghè

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 197 1 0.00% tro'2i… 1 0.00% là…tru'o'1c 1 0.00% (right 1 0.00% bonneur 1 0.00% (ngoa5i 1 0.00% tho3… 1 0.00% ro'm… 1 0.00% to5p 1 0.00% olympiastadion 1 0.00% 11mét 1 0.00% không… 1 0.00% nghê5ch 1 0.00% 1938 1 0.00% (08 1 0.00% kha´ 1 0.00% shiva 1 0.00% duô3i 1 0.00% mayer 1 0.00% vorfelder 1 0.00% platini 1 0.00% waluyo 1 0.00% (dfb 1 0.00% agus 1 0.00% ti´ch 1 0.00% ti´nh 1 0.00% giê4u 1 0.00% ly´ 1 0.00% (23t 1 0.00% kha´c 1 0.00% ca´ 1 0.00% pha´p 1 0.00% tha´i 1 0.00% co´ 1 0.00% ddâ´u 1 0.00% no´i 1 0.00% kê´t 1 0.00% marciel 1 0.00% (truyê2n 1 0.00% myu'2 1 0.00% sara 1 0.00% whyatt 1 0.00% scot 1 0.00% 904 1 0.00% 665 1 0.00% comité 1 0.00% cou'5 1 0.00% gopydtbcct2006 1 0.00% ddê1n… 1 0.00% magyar 1 0.00% vâ1u 1 0.00% matsco'va 1 0.00% amor

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 198 1 0.00% scher 1 0.00% elijah 1 0.00% cummigs 1 0.00% equya 1 0.00% ddor 1 0.00% quan… 1 0.00% voice 1 0.00% vietbao 1 0.00% 144 1 0.00% la(2n… 1 0.00% nxy 1 0.00% cncs 1 0.00% tra3y 1 0.00% thoán 1 0.00% bu5i… 1 0.00% giào 1 0.00% (1948 1 0.00% nhép 1 0.00% (nghi5 1 0.00% (ngo 1 0.00% nhóp 1 0.00% tru'o'1c(5 1 0.00% 91 1 0.00% 7336 1 0.00% vu'o'2n… 1 0.00% droits 1 0.00% l'homme 1 0.00% 48 1 0.00% si4nh 1 0.00% xiên 1 0.00% dângtàu 1 0.00% nay(30 1 0.00% sê2nh 1 0.00% sê5ch 1 0.00% quô1c' 1 0.00% vo'5' 1 0.00% mia3 1 0.00% choa 1 0.00% xo5ach 1 0.00% 'cuô5c 1 0.00% soa5ng 1 0.00% gia5y 1 0.00% 'giá 1 0.00% nghiá 1 0.00% róckét 1 0.00% ddâù 1 0.00% carbonic 1 0.00% co5a3ng 1 0.00% tri5â3 1 0.00% nck 1 0.00% muá 1 0.00% so'4…tu5i

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 199 1 0.00% 1886 1 0.00% lu'5i4c 1 0.00% te3o 1 0.00% (ptth 1 0.00% 6000 1 0.00% xuât 1 0.00% vietbooks 1 0.00% huâ5n 1 0.00% lu'a3 1 0.00% afula 1 0.00% nu'o'1cviê5t 1 0.00% nazareth 1 0.00% ddâ5n 1 0.00% hóp 1 0.00% lúy 1 0.00% ké 1 0.00% tripoli 1 0.00% hoa(2n 1 0.00% job 1 0.00% saìgon 1 0.00% vdc 1 0.00% 'qua3n 1 0.00% qualitex 1 0.00% sida 1 0.00% lu'a5 1 0.00% kha5o 1 0.00% bu'o'i 1 0.00% huyn 1 0.00% nhu'n 1 0.00% parks 1 0.00% 'ông 1 0.00% hoàng' 1 0.00% e3o 1 0.00% luâ5n' 1 0.00% co'5t 1 0.00% “biê1u 1 0.00% chu'á 1 0.00% hâ5p 1 0.00% 'bình 1 0.00% xén” 1 0.00% sia3 1 0.00% 'nãy 1 0.00% heo' 1 0.00% 'tâ1n 1 0.00% công' 1 0.00% luxembourg 1 0.00% no'1p 1 0.00% lòe 1 0.00% chu'5ng 1 0.00% xia3 1 0.00% “có 1 0.00% (1964

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 200 1 0.00% 'cà 1 0.00% sex' 1 0.00% 7h 1 0.00% ddiê3m” 1 0.00% nha(ng 1 0.00% thoóc 1 0.00% hanoinet 1 0.00% le5o 1 0.00% kê1p 1 0.00% sangin 1 0.00% nhách 1 0.00% bus 1 0.00% chô3 1 0.00% nylong 1 0.00% buà 1 0.00% 1400 1 0.00% lu'à 1 0.00% xôm 1 0.00% hànôinet 1 0.00% 312 1 0.00% helmang 1 0.00% 'ngày 1 0.00% hðba 1 0.00% mo5p 1 0.00% u5 1 0.00% siva 1 0.00% yesterday 1 0.00% marelli 1 0.00% children's 1 0.00% (adelaide 1 0.00% sàigon 1 0.00% terrazas 1 0.00% wuterich 1 0.00% waleed 1 0.00% anbar 1 0.00% baluchistan 1 0.00% sinna 1 0.00% taher 1 0.00% quetta 1 0.00% hammurabi 1 0.00% younis 1 0.00% ayed 1 0.00% briones 1 0.00% euphrates 1 0.00% mccormack 1 0.00% dover 1 0.00% nightline 1 0.00% jassim 1 0.00% faliha 1 0.00% ishaqi 1 0.00% (incident 1 0.00% abe

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 201 1 0.00% sirie 1 0.00% koppel 1 0.00% shinzo 1 0.00% zarquawi 1 0.00% (draftees 1 0.00% pike 1 0.00% globalsecurity 1 0.00% sally 1 0.00% donnelly 1 0.00% calley 1 0.00% giòi 1 0.00% (hope 1 0.00% vo5i 1 0.00% engelhardt 1 0.00% tomdispatch 1 0.00% zeus 1 0.00% damage 1 0.00% aqi 1 0.00% barry 1 0.00% mullah 1 0.00% stearns 1 0.00% pyjama 1 0.00% ramadi 1 0.00% aparisim 1 0.00% hamdullah 1 0.00% (collateral 1 0.00% eldon 1 0.00% bargewell 1 0.00% pendleton 1 0.00% husayif 1 0.00% nhèm 1 0.00% 320usd 1 0.00% cu3aâ3 1 0.00% pharma 1 0.00% quyê2nâ3 1 0.00% lèm 1 0.00% trozim 1 0.00% (700 1 0.00% berut 1 0.00% awkar 1 0.00% tarcefokyn 1 0.00% (ct 1 0.00% zuellig 1 0.00% 25000 1 0.00% 327 1 0.00% massage 1 0.00% tu'3u 1 0.00% tro'5t 1 0.00% phai 1 0.00% ddô2m 1 0.00% 450dd 1 0.00% ddôa3c

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 202 1 0.00% 'da5 1 0.00% berirut 1 0.00% nnptnt 1 0.00% xie 1 0.00% deshui 1 0.00% deutsche 1 0.00% mu’a 1 0.00% trust 1 0.00% yogurt 1 0.00% swarzkopt 1 0.00% samarra 1 0.00% nahiba 1 0.00% mahmud 1 0.00% heo…cùng 1 0.00% norman 1 0.00% tuz 1 0.00% (bvddk 1 0.00% adel 1 0.00% truo'1c 1 0.00% tra( 1 0.00% (ds 1 0.00% kazzaz 1 0.00% beauty 1 0.00% beast 1 0.00% 805 1 0.00% lêniniste 1 0.00% philipin 1 0.00% kurmatu 1 0.00% nghiã' 1 0.00% goldwyn 1 0.00% lott 1 0.00% perot 1 0.00% bartis 1 0.00% à…ddúng 1 0.00% iwhrekan 1 0.00% qùa 1 0.00% nhâ3n 1 0.00% ngu'á 1 0.00% depot 1 0.00% aga 1 0.00% cmc 1 0.00% sheibani 1 0.00% (kháng 1 0.00% hamza 1 0.00% rabia 1 0.00% bakhtur 1 0.00% quôc 1 0.00% satan 1 0.00% ruholah 1 0.00% khomeini 1 0.00% ebrahim 1 0.00% hoffman

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 203 1 0.00% strait 1 0.00% lu’o’5t 1 0.00% hco 1 0.00% investment 1 0.00% transnational 1 0.00% (buô5c 1 0.00% to’2 1 0.00% 322 1 0.00% (innovation 1 0.00% pacifichem 1 0.00% honolulu 1 0.00% internationalization 1 0.00% nha(2mmu5c 1 0.00% ddánhgiá 1 0.00% trei3n 1 0.00% qh10 1 0.00% câ2p 1 0.00% ha3ng 1 0.00% menchov 1 0.00% cuaro’ 1 0.00% mu’o’1n 1 0.00% (khtn 1 0.00% (khxh 1 0.00% cu4a 1 0.00% tru'5 1 0.00% po'1p 1 0.00% ngô4ng' 1 0.00% bakhthur 1 0.00% alvarez 1 0.00% gambino 1 0.00% 8500 1 0.00% tôi… 1 0.00% konstantinos 1 0.00% cesar 1 0.00% hujra 1 0.00% pashtun 1 0.00% sâ3y 1 0.00% abscam 1 0.00% ðu’1c 1 0.00% burkanday 1 0.00% chu’1c’ 1 0.00% choctaws 1 0.00% qua3hay 1 0.00% dorgan 1 0.00% choctaw 1 0.00% “thành 1 0.00% ‘thôi 1 0.00% alexandria 1 0.00% coushatta 1 0.00% ring 1 0.00% rehoboth 1 0.00% town

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 204 1 0.00% talk 1 0.00% jalalabad 1 0.00% montélimar 1 0.00% misri 1 0.00% juergen 1 0.00% masri 1 0.00% darrunta 1 0.00% maghrebi 1 0.00% farraj 1 0.00% béziers 1 0.00% shakai 1 0.00% abd 1 0.00% rahman 1 0.00% jehl 1 0.00% ti… 1 0.00% (afghanistan 1 0.00% khurseed 1 0.00% crocker 1 0.00% sheikh 1 0.00% rashid 1 0.00% gì” 1 0.00% carlotta 1 0.00% gall 1 0.00% 22000 1 0.00% “tô3ng 1 0.00% msnbc 1 0.00% 'ddô2i 1 0.00% mai' 1 0.00% 'rô2ng 1 0.00% 'bá 1 0.00% chu3' 1 0.00% 'ddui' 1 0.00% lâ3u 1 0.00% su'a3 1 0.00% tich 1 0.00% ddâ1t' 1 0.00% u5n 1 0.00% “ban 1 0.00% ga(5ng 1 0.00% 670 1 0.00% xáng 1 0.00% 370 1 0.00% gâ5p 1 0.00% (42 1 0.00% (khóm 1 0.00% ngiê5m 1 0.00% 353 1 0.00% mobicard 1 0.00% têt 1 0.00% quy5t 1 0.00% lèn 1 0.00% pearls

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 205 1 0.00% án” 1 0.00% 988 1 0.00% orchad 1 0.00% katong 1 0.00% “cha5y 1 0.00% damian 1 0.00% orchard 1 0.00% 'nàng 1 0.00% 'tour 1 0.00% 888 1 0.00% limousine 1 0.00% stefano 1 0.00% mu3n 1 0.00% nhu4ng” 1 0.00% nho3… 1 0.00% kloden 1 0.00% buá 1 0.00% vinh' 1 0.00% (1549 1 0.00% khcn 1 0.00% ngò 1 0.00% thô5t 1 0.00% di5t 1 0.00% dâ5u 1 0.00% nhu’5a 1 0.00% toáy 1 0.00% kha(m 1 0.00% trâ1u 1 0.00% ngô5n 1 0.00% eng 1 0.00% éc 1 0.00% chóa 1 0.00% carraro 1 0.00% tíc 1 0.00% ru’o’4i 1 0.00% cha(1m 1 0.00% dda(p 1 0.00% ngô2n 1 0.00% iut 1 0.00% bachelor 1 0.00% licence 1 0.00% gio3i' 1 0.00% dea 1 0.00% as 1 0.00% re3o 1 0.00% lo'1 1 0.00% ô1t 1 0.00% maitrise 1 0.00% doctorat 1 0.00% ddãm 1 0.00% (tru'o'3ng 1 0.00% clamoxyl

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 206 1 0.00% adrenalin 1 0.00% khô3n 1 0.00% qu5y 1 0.00% luciano 1 0.00% hoa3( 1 0.00% tu’o’1c 1 0.00% thân( 1 0.00% 7610 1 0.00% (tham 1 0.00% 382a 1 0.00% kìê5n 1 0.00% vâ5p 1 0.00% tktt 1 0.00% choe5t 1 0.00% táu 1 0.00% fibre2fashion 1 0.00% pha3y 1 0.00% (thâ1y 1 0.00% moggi 1 0.00% ghiê5m 1 0.00% vts 1 0.00% lt 1 0.00% si3nh 1 0.00% (lame 1 0.00% ammonium 1 0.00% (phiên 1 0.00% pyinmana 1 0.00% templer 1 0.00% nitrate 1 0.00% khin 1 0.00% (total 1 0.00% daewoo 1 0.00% ongc 1 0.00% chiê1p…me5 1 0.00% burma 1 0.00% dâ4nddâ2u 1 0.00% 685 1 0.00% tutu 1 0.00% vaclav 1 0.00% havel 1 0.00% dô5 1 0.00% vù…vù… 1 0.00% cô2m 1 0.00% 200000 1 0.00% tdx 1 0.00% …me5 1 0.00% sarandon 1 0.00% lô2m 1 0.00% nengzheng 1 0.00% zaw 1 0.00% morris 1 0.00% kandawgyi

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 207 1 0.00% steeple 1 0.00% muse 1 0.00% kyaw 1 0.00% katie 1 0.00% redford 1 0.00% earthrights 1 0.00% ddoái 1 0.00% koji 1 0.00% triumph 1 0.00% methamphetamine 1 0.00% cnooc 1 0.00% symon 1 0.00% hideaki 1 0.00% videsh 1 0.00% 182 1 0.00% offshore 1 0.00% (asean 1 0.00% gulkin 1 0.00% siam 1 0.00% mizukoshi 1 0.00% sihasak 1 0.00% phuangketkaew 1 0.00% rosnef 1 0.00% nho’3n 1 0.00% rakowski 1 0.00% honecker 1 0.00% pha5n 1 0.00% serbian 1 0.00% balkans 1 0.00% tuo'3ng 1 0.00% tht 1 0.00% vãng 1 0.00% (ddông 1 0.00% zhikov 1 0.00% (bulgary 1 0.00% (1992 1 0.00% nothing 1 0.00% (thu'o'5ng 1 0.00% gaza's 1 0.00% difference 1 0.00% (na(ng 1 0.00% paletine 1 0.00% gia3o 1 0.00% nho’ 1 0.00% (1991 1 0.00% amiri 1 0.00% zdebko 1 0.00% tomanovic 1 0.00% moldova 1 0.00% xiét 1 0.00% baltic 1 0.00% ekho

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 208 1 0.00% moskvy 1 0.00% (kremlin 1 0.00% 5700 1 0.00% 150000 1 0.00% gasprom 1 0.00% lavia 1 0.00% jerzy 1 0.00% marek 1 0.00% yogyokarta 1 0.00% vu'5o't 1 0.00% khuâng 1 0.00% naftogaz 1 0.00% 603 1 0.00% essar 1 0.00% trie5âu 1 0.00% (nato 1 0.00% armenia 1 0.00% cu’u5 1 0.00% trám 1 0.00% khemr 1 0.00% bâng 1 0.00% (pyongyang 1 0.00% tru'o'1ùc 1 0.00% goldberg 1 0.00% gideon 1 0.00% (axis 1 0.00% evil 1 0.00% mohandas 1 0.00% jawaharlal 1 0.00% nehru 1 0.00% (born 1 0.00% again 1 0.00% dda5o5 1 0.00% xuâ1i 1 0.00% cu5c…cu5c…cu5c 1 0.00% deng 1 0.00% xiaoping 1 0.00% natan 1 0.00% sharansky 1 0.00% gorbachev 1 0.00% 1648 1 0.00% eagleburger 1 0.00% (utopia 1 0.00% bi4 1 0.00% cu5ng 1 0.00% wesphalia 1 0.00% ttiê5u 1 0.00% lng 1 0.00% (liquefied 1 0.00% comman 1 0.00% (members 1 0.00% bruxelles

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 209 1 0.00% safta 1 0.00% brum 1 0.00% 700km 1 0.00% natural 1 0.00% mani 1 0.00% shankar 1 0.00% sindhi 1 0.00% gengali 1 0.00% telugu 1 0.00% marathi 1 0.00% trillion 1 0.00% (purchasing 1 0.00% parity 1 0.00% oriya 1 0.00% punjabi 1 0.00% assamese 1 0.00% urdu 1 0.00% gujarati 1 0.00% kannada 1 0.00% soljenitsyn 1 0.00% ibrahim 1 0.00% jaafari 1 0.00% hakim 1 0.00% ngu'o'2i' 1 0.00% ddóng' 1 0.00% khalizad 1 0.00% cpa 1 0.00% mowaffaq 1 0.00% shlomo 1 0.00% chà…chà…ta 1 0.00% hirsh 1 0.00% provisional 1 0.00% co’4 1 0.00% ròi 1 0.00% (cu'3 1 0.00% (state 1 0.00% cáøc 1 0.00% diane 1 0.00% ablonczy 1 0.00% nashat 1 0.00% aqtash 1 0.00% birzeit 1 0.00% (câu 1 0.00% ddoá 1 0.00% (muslim 1 0.00% woodrow 1 0.00% 1917 1 0.00% (america 1 0.00% quincy 1 0.00% adams 1 0.00% 1821 1 0.00% empire

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 210 1 0.00% (détente 1 0.00% sakharo 1 0.00% su'3u3 1 0.00% gio'1ùi 1 0.00% (evil 1 0.00% wilkerson 1 0.00% rami 1 0.00% khouri 1 0.00% ziad 1 0.00% al–bashir 1 0.00% bulliet 1 0.00% tì 1 0.00% 2200 1 0.00% karen 1 0.00% hughes 1 0.00% feith 1 0.00% (review 1 0.00% if 1 0.00% haq 1 0.00% gwozdecky 1 0.00% “nhu’4ng 1 0.00% thê3” 1 0.00% ''có 1 0.00% giá'' 1 0.00% nhân'' 1 0.00% jbeil 1 0.00% nahriya 1 0.00% sadr 1 0.00% breaking 1 0.00% bint 1 0.00% (inconceivable 1 0.00% ác'' 1 0.00% sa(ùt 1 0.00% quô1âc 1 0.00% (oecd 1 0.00% 2040 1 0.00% 2050 1 0.00% mo'i 1 0.00% jordanie 1 0.00% khí'' 1 0.00% ''tru5c 1 0.00% khô5ng 1 0.00% (ba3o 1 0.00% baradei 1 0.00% near 1 0.00% dirita 1 0.00% whitman 1 0.00% lu’5u 1 0.00% (messianic 1 0.00% khô… 1 0.00% orders 1 0.00% (presidential

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 211 1 0.00% finding 1 0.00% blitzer 1 0.00% 'ném 1 0.00% (executing 1 0.00% sàng… 1 0.00% bernstein 1 0.00% (plan 1 0.00% attack 1 0.00% 2400 1 0.00% moqtada 1 0.00% carl 1 0.00% 1 0.00% judith 1 0.00% miller 1 0.00% pulitzer 1 0.00% rather 1 0.00% (paper 1 0.00% vu'o'1t 1 0.00% aznar 1 0.00% (bring 1 0.00% them 1 0.00% helmand 1 0.00% willing 1 0.00% maría 1 0.00% 'pha3n 1 0.00% loa5n' 1 0.00% baath 1 0.00% (dead 1 0.00% alive 1 0.00% thía 1 0.00% 1411 1 0.00% bo’1i 1 0.00% nghi5u 1 0.00% colgate 1 0.00% ducks 1 0.00% axis 1 0.00% ro’1t 1 0.00% ballistic 1 0.00% missile 1 0.00% abm 1 0.00% rwanda 1 0.00% ddâ2u…vua 1 0.00% (vi5 1 0.00% 2025 1 0.00% yê3n 1 0.00% ddâ2âu 1 0.00% bengali 1 0.00% thây 1 0.00% hôi5 1 0.00% traí 1 0.00% derming 1 0.00% golman

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 212 1 0.00% yarnell 1 0.00% chemical 1 0.00% (r 1 0.00% mouwafak 1 0.00% guantanamo 1 0.00% barakzayi 1 0.00% ðánh 1 0.00% naway 1 0.00% madeline 1 0.00% qaida 1 0.00% ttk 1 0.00% kufa 1 0.00% tho’2 1 0.00% na(m1979 1 0.00% tai5 1 0.00% davos 1 0.00% tê4nh 1 0.00% (pre 1 0.00% emptive 1 0.00% saeedi 1 0.00% khalid 1 0.00% rodhan 1 0.00% speculation 1 0.00% johns 1 0.00% hopkins 1 0.00% strike 1 0.00% (bully 1 0.00% (wild 1 0.00% ve5m 1 0.00% 12000 1 0.00% pangandara 1 0.00% (isotope 1 0.00% patriot 1 0.00% ddi4nh 1 0.00% scud 1 0.00% torng 1 0.00% (high 1 0.00% phét 1 0.00% hexafluoride 1 0.00% (cascade 1 0.00% enriched 1 0.00% montesquieu 1 0.00% (separation 1 0.00% (check 1 0.00% biden 1 0.00% safed 1 0.00% baron 1 0.00% 1863 1 0.00% isael 1 0.00% (difference 1 0.00% balance 1 0.00% (law

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 213 1 0.00% order 1 0.00% (vu4 1 0.00% niall 1 0.00% pity 1 0.00% 219 1 0.00% zbigniew 1 0.00% shemona 1 0.00% khruschev 1 0.00% (weapons 1 0.00% destruction 1 0.00% kiryat 1 0.00% gm 1 0.00% (neo 1 0.00% conservative 1 0.00% shabbaz 1 0.00% high 1 0.00% chad 1 0.00% (moscow 1 0.00% chiefs 1 0.00% staff 1 0.00% low 1 0.00% board 1 0.00% bouchard 1 0.00% cambone 1 0.00% 'du'1t 1 0.00% ddiê3m' 1 0.00% (defense 1 0.00% (cruise 1 0.00% (black 1 0.00% reconnaissance 1 0.00% (arabian 1 0.00% (luâ5t 1 0.00% (covert 1 0.00% operations 1 0.00% (berlin 1 0.00% (ballistic 1 0.00% udi 1 0.00% (tactical 1 0.00% mahmoudiya 1 0.00% strategic 1 0.00% (xuyên 1 0.00% 23000 1 0.00% forein 1 0.00% kalandi 1 0.00% minendra 1 0.00% villepin 1 0.00% sharra 1 0.00% farouk 1 0.00% toàu5 1 0.00% fool 1 0.00% twice 1 0.00% (la5i

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 214 1 0.00% kantipur 1 0.00% hadi 1 0.00% meir 1 0.00% dagan 1 0.00% azeri 1 0.00% baluchi 1 0.00% galluci 1 0.00% thinking 1 0.00% kuswoyo 1 0.00% moriarty 1 0.00% knesset 1 0.00% nahavandian 1 0.00% (wishful 1 0.00% sa(1c” 1 0.00% hitz 1 0.00% (it's 1 0.00% (intelligence 1 0.00% deutch 1 0.00% chê3nh 1 0.00% hãn' 1 0.00% mèn… 1 0.00% estimate 1 0.00% nie 1 0.00% 'thành 1 0.00% terrorism 1 0.00% prevention 1 0.00% wto–liên 1 0.00% achin 1 0.00% lyndon 1 0.00% raborn 1 0.00% (norad 1 0.00% (bay 1 0.00% pigs 1 0.00% stansfield 1 0.00% turner 1 0.00% (gengis 1 0.00% scandasia 1 0.00% schlesinger 1 0.00% colby 1 0.00% mobil 1 0.00% federal 1 0.00% (ftc 1 0.00% exchange 1 0.00% nhirn 1 0.00% jingpping 1 0.00% tru’4 1 0.00% âm' 1 0.00% (crude 1 0.00% zheng 1 0.00% lockyer 1 0.00% 'ddàn 1 0.00% mercantile

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 215 1 0.00% gouging 1 0.00% (strategic 1 0.00% petroleum 1 0.00% tuâ1 1 0.00% nha' 1 0.00% (price 1 0.00% (nhiên 1 0.00% kilduff 1 0.00% fimat 1 0.00% reserve 1 0.00% allan 1 0.00% hubbard 1 0.00% hillenkoetter 1 0.00% immigration 1 0.00% studies 1 0.00% (bush 1 0.00% 633 1 0.00% 8000 1 0.00% krikorian 1 0.00% ho5” 1 0.00% em… 1 0.00% “phía 1 0.00% “cha(1c 1 0.00% sessions 1 0.00% ra(2n 1 0.00% stewart 1 0.00% duck 1 0.00% 1150 1 0.00% nhéo 1 0.00% 652 1 0.00% (vama 1 0.00% lame 1 0.00% guest 1 0.00% janice 1 0.00% kephart 1 0.00% khu’5ng 1 0.00% goldman 1 0.00% roberts 1 0.00% (director 1 0.00% nsa 1 0.00% “danh 1 0.00% oss 1 0.00% ddi5nh” 1 0.00% 1941 1 0.00% hoyt 1 0.00% vanderberd 1 0.00% roscoe 1 0.00% xi5a 1 0.00% sidney 1 0.00% souers 1 0.00% nam… 1 0.00% (daily

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 216 1 0.00% presidential 1 0.00% briefing 1 0.00% geneve 1 0.00% (agency 1 0.00% dpb 1 0.00% “thông 1 0.00% kept 1 0.00% secrets 1 0.00% grisham 1 0.00% clancy 1 0.00% (cu5 1 0.00% tahrir 1 0.00% caliphate 1 0.00% corera 1 0.00% jihads 1 0.00% hizb 1 0.00% ut 1 0.00% (su'5 1 0.00% lak 1 0.00% (centcom 1 0.00% terror 1 0.00% threat 1 0.00% remains 1 0.00% qeada 1 0.00% mclellan 1 0.00% tommy 1 0.00% franks 1 0.00% taiwan 1 0.00% micrsoft 1 0.00% webofficenow 1 0.00% morning 1 0.00% dak 1 0.00% secretary 1 0.00% mcnamara 1 0.00% delong 1 0.00% cnn's 1 0.00% sensenbrenner 1 0.00% (pha5t 1 0.00% (civil 1 0.00% (ghettos 1 0.00% 1920 1 0.00% 1921 1 0.00% (progressive 1 0.00% (dám 1 0.00% (amnesty 1 0.00% offense 1 0.00% mahony 1 0.00% 'cho'i' 1 0.00% (property 1 0.00% generational 1 0.00% warfare 1 0.00% 'kim

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 217 1 0.00% macdill 1 0.00% chung' 1 0.00% fourth 1 0.00% nu’o’ng 1 0.00% centcom 1 0.00% truân 1 0.00% (extremist 1 0.00% ideology 1 0.00% soft 1 0.00% crepso 1 0.00% (gdp 1 0.00% frnakfurt 1 0.00% “lu’5a 1 0.00% cars 1 0.00% pickup 1 0.00% truck 1 0.00% running 1 0.00% anything 1 0.00% done 1 0.00% (bell 1 0.00% shah 1 0.00% chuông' 1 0.00% (hybrid 1 0.00% mtbe 1 0.00% xu’o’3ng 1 0.00% tiê2n… 1 0.00% (refining 1 0.00% 'tu'3 1 0.00% (dâ2u 1 0.00% schearf 1 0.00% insect 1 0.00% halliburton 1 0.00% shinto 1 0.00% (tuy2 1 0.00% masahiro 1 0.00% seatle 1 0.00% you're 1 0.00% elane 1 0.00% vàng' 1 0.00% 'chuông 1 0.00% sa(3n 1 0.00% wenyi 1 0.00% (people's 1 0.00% republic 1 0.00% (triê5u 1 0.00% “ddâ1u 1 0.00% (wang 1 0.00% 'sâu 1 0.00% willfully 1 0.00% intimidating 1 0.00% coercing 1 0.00% come

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 218 1 0.00% back 1 0.00% (misdemeanor 1 0.00% dda(1n” 1 0.00% span 1 0.00% ddôi” 1 0.00% threatening 1 0.00% harassing 1 0.00% official 1 0.00% guard 1 0.00% masuda 1 0.00% shigeru 1 0.00% matxco’va 1 0.00% hideshi 1 0.00% takesada 1 0.00% khu’3 1 0.00% 'tam 1 0.00% cho’1 1 0.00% ten' 1 0.00% 1700 1 0.00% 228 1 0.00% hoàn' 1 0.00% beck 1 0.00% 4000 1 0.00% jaebum 1 0.00% yonsei 1 0.00% falklands 1 0.00% spratlys 1 0.00% 1900 1 0.00% akihiko 1 0.00% hu’3u 1 0.00% hye 1 0.00% asahi 1 0.00% nishi 1 0.00% nihon 1 0.00% no’3 1 0.00% yitzhak 1 0.00% si3u 1 0.00% burghardt 1 0.00% (stroke 1 0.00% thô5n 1 0.00% phalangist 1 0.00% sabra 1 0.00% chatila 1 0.00% bashar 1 0.00% menachem 1 0.00% do’2i 1 0.00% du’o’4ng” 1 0.00% ðô4 1 0.00% thu3… 1 0.00% monrovia 1 0.00% 'a(ng 1 0.00% tuô2n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 219 1 0.00% 4595 1 0.00% dorothy 1 0.00% dworskin 1 0.00% “bô2i 1 0.00% connecticut 1 0.00% crane 1 0.00% dda( 1 0.00% váng…nhanh 1 0.00% sukehiro 1 0.00% hasegawa 1 0.00% 1300 1 0.00% tasi 1 0.00% tolu 1 0.00% gusmão 1 0.00% dare 1 0.00% rogério 1 0.00% 767 1 0.00% vision 1 0.00% micky 1 0.00% maubisse 1 0.00% fatuahi 1 0.00% rory 1 0.00% callinan 1 0.00% abile 1 0.00% (ám 1 0.00% chili 1 0.00% kazkhstan 1 0.00% glock 1 0.00% stephanie 1 0.00% becora 1 0.00% ru’o’3i 1 0.00% magnum 1 0.00% steyr 1 0.00% sinapan 1 0.00% samydorai 1 0.00% think 1 0.00% báng 1 0.00% chee 1 0.00% a321 1 0.00% hectare 1 0.00% cô3… 1 0.00% aâu 1 0.00% chiam 1 0.00% see 1 0.00% frederick 1 0.00% chok 1 0.00% con…hu 1 0.00% a320 1 0.00% sheila 1 0.00% (pho3ng 1 0.00% hu… 1 0.00% (pap

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 220 1 0.00% loong 1 0.00% aljunied 1 0.00% goh 1 0.00% kuan 1 0.00% yew 1 0.00% hsien 1 0.00% râ1p 1 0.00% nhít 1 0.00% tro'i 1 0.00% phaolô 1 0.00% kilowatt 1 0.00% phân… 1 0.00% co'4i 1 0.00% ma5nh' 1 0.00% 'xiê1t 1 0.00% (tô1n 1 0.00% camry 1 0.00% nghê4 1 0.00% gioan 1 0.00% lý' 1 0.00% ddai' 1 0.00% xu'ô1ng 1 0.00% 'biê3u 1 0.00% tình' 1 0.00% reiter 1 0.00% site 1 0.00% (appeals 1 0.00% ddiê4n 1 0.00% chu5m 1 0.00% lô2n 1 0.00% benenson 1 0.00% 'khuynh 1 0.00% loát' 1 0.00% câ2u' 1 0.00% toyota 1 0.00% 'mo'3 1 0.00% tri5' 1 0.00% 50000 1 0.00% (forticule 1 0.00% cho3m 1 0.00% nghén 1 0.00% 'vàng 1 0.00% lô5n' 1 0.00% lo'5i' 1 0.00% 496 1 0.00% 25b 1 0.00% xê3nh 1 0.00% tê1' 1 0.00% 61 1 0.00% 473 1 0.00% 'kính 1 0.00% my4'

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 221 1 0.00% 'ddôi 1 0.00% tró 1 0.00% hòi 1 0.00% tô4ng 1 0.00% ddu'5oc 1 0.00% ácch 1 0.00% cghu’4a 1 0.00% ddá…ha 1 0.00% discovry 1 0.00% ti…pha3i 1 0.00% ha…vô 1 0.00% diaa 1 0.00% rashwan 1 0.00% ahram 1 0.00% (cung 1 0.00% fossumvà 1 0.00% lu’3ng 1 0.00% gan” 1 0.00% ha(ng… 1 0.00% judea 1 0.00% samaria 1 0.00% temple 1 0.00% mount 1 0.00% qasa 1 0.00% amir 1 0.00% peretz 1 0.00% “to 1 0.00% nho’3 1 0.00% hydro 1 0.00% evangelicals 1 0.00% tu'21 1 0.00% ngu’ 1 0.00% fadli 1 0.00% ru'3i 1 0.00% ngu5a 1 0.00% dòi 1 0.00% phâ3u 1 0.00% 39000 1 0.00% hydrazine 1 0.00% kiê4m 1 0.00% (cá 1 0.00% (petro 1 0.00% rê5u 1 0.00% eilliam 1 0.00% 'lên 1 0.00% gân' 1 0.00% lo’ 1 0.00% giôn 1 0.00% lùc 1 0.00% 'ddô1i 1 0.00% lu'o'5c' 1 0.00% thuo'2ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 222 1 0.00% wìliam 1 0.00% 'tha3o 1 0.00% cao' 1 0.00% ba(1tddâ2u 1 0.00% rental 1 0.00% alamo 1 0.00% fila 1 0.00% iap 1 0.00% lnr 1 0.00% anchor 1 0.00% (vulture 1 0.00% (bankruptcy 1 0.00% protection 1 0.00% hedge 1 0.00% hades 1 0.00% 'tô3 1 0.00% netco 1 0.00% redlands 1 0.00% kelley 1 0.00% (appropriations 1 0.00% chôn… 1 0.00% thâ3u 1 0.00% (góp 1 0.00% (political 1 0.00% (mutual 1 0.00% funds 1 0.00% “bu’o’1c 1 0.00% (significant 1 0.00% role 1 0.00% bastain 1 0.00% against 1 0.00% waste 1 0.00% politics 1 0.00% 407 1 0.00% republican 1 0.00% pathez 1 0.00% 141 1 0.00% 328 1 0.00% tro’n 1 0.00% no’1i 1 0.00% ddi…không 1 0.00% responsive 1 0.00% gomes 1 0.00% garn 1 0.00% 410 1 0.00% hu…hích… 1 0.00% ddâu…hu 1 0.00% nãi 1 0.00% roth 1 0.00% zabel 1 0.00% gazette 1 0.00% numo

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 223 1 0.00% democrat 1 0.00% schulte 1 0.00% (indictment 1 0.00% (435 1 0.00% (incumbents 1 0.00% (challengers 1 0.00% stade 1 0.00% no’5 1 0.00% (lobbyists 1 0.00% levey 1 0.00% roche 1 0.00% interests 1 0.00% mathes 1 0.00% alessandro 1 0.00% noam 1 0.00% mariannas 1 0.00% ngúm 1 0.00% “chô1ng 1 0.00% cu'o'1c' 1 0.00% merrill 1 0.00% (budget 1 0.00% deficit 1 0.00% gôloa” 1 0.00% “chú 1 0.00% skyboxes 1 0.00% suót 1 0.00% rutgers 1 0.00% sampa 1 0.00% nigel 1 0.00% gottlieb 1 0.00% quô1ctê1 1 0.00% sún 1 0.00% fn 1 0.00% aviation 1 0.00% ðào 1 0.00% lawler 1 0.00% ha5i” 1 0.00% ambassador's 1 0.00% ritz 1 0.00% “tê5 1 0.00% dave 1 0.00% (blatant 1 0.00% 'lý 1 0.00% dormund 1 0.00% groups 1 0.00% strussion 1 0.00% buchanan 1 0.00% (staffer 1 0.00% v…ddua 1 0.00% potomac 1 0.00% shiled 1 0.00% neil

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 224 1 0.00% soviet 1 0.00% pioneers 1 0.00% cu’1ng 1 0.00% corleone 1 0.00% won 1 0.00% (felonies 1 0.00% lu’o’4ng 1 0.00% (college 1 0.00% “thâ1t 1 0.00% lafayette 1 0.00% (skyboxes 1 0.00% trâ5n” 1 0.00% brandeis 1 0.00% guys 1 0.00% lousis 1 0.00% (bulk 1 0.00% rate 1 0.00% kathryn 1 0.00% lehman 1 0.00% thu3” 1 0.00% fantasy 1 0.00% sports 1 0.00% good 1 0.00% oklahoma 1 0.00% rogan 1 0.00% hayes 1 0.00% (football 1 0.00% redskins 1 0.00% (contract 1 0.00% tít… 1 0.00% maximus 1 0.00% marianas 1 0.00% conrad 1 0.00% doolittle 1 0.00% reid 1 0.00% employee 1 0.00% circus 1 0.00% copley 1 0.00% territories 1 0.00% namibia 1 0.00% newt 1 0.00% gingrich 1 0.00% bryant 1 0.00% phu’o’2ng 1 0.00% lu’o’4i 1 0.00% saipan 1 0.00% marianna 1 0.00% (us 1 0.00% diners 1 0.00% northern 1 0.00% co’2 1 0.00% “kiê1m

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 225 1 0.00% moral 1 0.00% ‘game’ 1 0.00% falwell 1 0.00% toward 1 0.00% tradition 1 0.00% “game” 1 0.00% louie 1 0.00% co’m 1 0.00% bryan 1 0.00% focus 1 0.00% 'pích' 1 0.00% ta… 1 0.00% goodlatte 1 0.00% (speaker 1 0.00% “nín 1 0.00% 927 1 0.00% sau… 1 0.00% tho’3” 1 0.00% thâ5t… 1 0.00% mô1c”cu3a 1 0.00% (libertarian 1 0.00% ground 1 0.00% hu'2… 1 0.00% vo’2 1 0.00% “dâ1u 1 0.00% ghét… 1 0.00% (amendments 1 0.00% (suspension 1 0.00% jeb 1 0.00% matthew 1 0.00% “zidane 1 0.00% tortilla 1 0.00% châ3y 1 0.00% (simple 1 0.00% calendar 1 0.00% ru'òm 1 0.00% vìu5 1 0.00% quarterly 1 0.00% ralston 1 0.00% elot 1 0.00% groveri 1 0.00% tro’1 1 0.00% century 1 0.00% strategies 1 0.00% “swordsman” 1 0.00% “chính 1 0.00% denny 1 0.00% “thi 1 0.00% tu'1c… 1 0.00% zindane 1 0.00% spending 1 0.00% talal

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 226 1 0.00% fairmont 1 0.00% hotels 1 0.00% daimlerchrysler 1 0.00% tate 1 0.00% alwaleed 1 0.00% wind 1 0.00% brad 1 0.00% financial 1 0.00% resorts 1 0.00% orascom 1 0.00% holding 1 0.00% tussauds 1 0.00% feet 1 0.00% csx 1 0.00% jumeirah 1 0.00% oán” 1 0.00% burj 1 0.00% “ân 1 0.00% drake 1 0.00% deborah 1 0.00% madame 1 0.00% essex 1 0.00% hemsley 1 0.00% sir 1 0.00% xa3n 1 0.00% “la(1m 1 0.00% bechtel 1 0.00% khuê1ch 1 0.00% abullah 1 0.00% jeddah 1 0.00% petrodollar 1 0.00% ipsos 1 0.00% pew 1 0.00% “câ5u 1 0.00% a380 1 0.00% airways 1 0.00% emaar 1 0.00% (dda(5t 1 0.00% cayman 1 0.00% xe3n 1 0.00% da3 1 0.00% pfc 1 0.00% 'miu 1 0.00% broadband 1 0.00% tâ5t” 1 0.00% khuyê1ch 1 0.00% rabigh 1 0.00% aramco 1 0.00% bourband 1 0.00% oriental 1 0.00% nazi 1 0.00% rô1n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 227 1 0.00% einstein 1 0.00% 1852 1 0.00% 1660 1 0.00% charle 1 0.00% (it 1 0.00% rare 1 0.00% example 1 0.00% denmark 1 0.00% doanh” 1 0.00% “ba3n 1 0.00% 1788 1 0.00% châ2y 1 0.00% dornan 1 0.00% villaraigosa 1 0.00% “nêm” 1 0.00% …ngoài 1 0.00% (ddiê3n 1 0.00% cancun 1 0.00% (welsh 1 0.00% botany 1 0.00% (musician 1 0.00% san' 1 0.00% vincente 1 0.00% venezelua 1 0.00% xoi 1 0.00% ddê1” 1 0.00% vanuatu 1 0.00% immigrant 1 0.00% zimbabue 1 0.00% phâ4m 1 0.00% (dpw 1 0.00% peninsular 1 0.00% (inflation 1 0.00% (stagnation 1 0.00% “hoàng 1 0.00% mel 1 0.00% 41 1 0.00% tancredo 1 0.00% welfare 1 0.00% but 1 0.00% exist 1 0.00% 209 1 0.00% (arizona 1 0.00% arlen 1 0.00% specter 1 0.00% border 1 0.00% angles 1 0.00% denver 1 0.00% taylor 1 0.00% diet 1 0.00% calcium 1 0.00% infant

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 228 1 0.00% death 1 0.00% syndrome 1 0.00% product 1 0.00% (index 1 0.00% (consumer 1 0.00% mandel 1 0.00% (gross 1 0.00% domestic 1 0.00% (sudden 1 0.00% (abscam 1 0.00% 'phai 1 0.00% flake 1 0.00% (metastasized 1 0.00% (fighting 1 0.00% tinh' 1 0.00% o5c 1 0.00% “khá 1 0.00% sids 1 0.00% medicare 1 0.00% u’u 1 0.00% (knowledge 1 0.00% chu’1 1 0.00% (jobless 1 0.00% recovery 1 0.00% landefeld 1 0.00% (software 1 0.00% businessweek 1 0.00% okun 1 0.00% (opec 1 0.00% hùng… 1 0.00% (misery 1 0.00% (gô5p 1 0.00% arthur 1 0.00% transistor 1 0.00% “trong 1 0.00% roosevelt 1 0.00% flamingo 1 0.00% confidence 1 0.00% nasdaq 1 0.00% 1929 1 0.00% siegel 1 0.00% bell 1 0.00% labs 1 0.00% las 1 0.00% vegas 1 0.00% bugsy 1 0.00% guy4 1 0.00% (hoà 1 0.00% (mexico 1 0.00% haiti 1 0.00% (chile 1 0.00% michelle

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 229 1 0.00% bachelet 1 0.00% aires 1 0.00% buenos 1 0.00% “ngay 1 0.00% gringo 1 0.00% (non 1 0.00% lionel 1 0.00% ghinh' 1 0.00% khameini 1 0.00% saud 1 0.00% faisal 1 0.00% cupper 1 0.00% (incompetent 1 0.00% hector 1 0.00% (uae 1 0.00% (singing 1 0.00% court 1 0.00% (accusations 1 0.00% abul 1 0.00% bianchi 1 0.00% (1974 1 0.00% 'sinh 1 0.00% strom 1 0.00% versus 1 0.00% valeo 1 0.00% ernest 1 0.00% (break 1 0.00% columbus 1 0.00% julio 1 0.00% thurmond 1 0.00% 637 1 0.00% tcpv 1 0.00% (tcpv 1 0.00% reactor 1 0.00% canberra 1 0.00% tantilo 1 0.00% dehli 1 0.00% (fast 1 0.00% breeder 1 0.00% stans 1 0.00% (fair 1 0.00% share 1 0.00% (containment 1 0.00% brewster 1 0.00% maurice 1 0.00% (esa 1 0.00% envisat 1 0.00% hamamatsu 1 0.00% (slide 1 0.00% (james 1 0.00% cook 1 0.00% madagascar

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 230 1 0.00% amylose 1 0.00% tru5quô1c 1 0.00% now 1 0.00% lynne 1 0.00% râ 1 0.00% logic 1 0.00% (ta3ng 1 0.00% (maritime 1 0.00% (cotton 1 0.00% vàâ 1 0.00% date 1 0.00% (british 1 0.00% (southampton 1 0.00% xe… 1 0.00% vest 1 0.00% british 1 0.00% afar 1 0.00% (belfast 1 0.00% cingular 1 0.00% wireless 1 0.00% m8 1 0.00% (great 1 0.00% (giô1ng 1 0.00% panasonic 1 0.00% (near 1 0.00% vie 1 0.00% andes 1 0.00% ddddst 1 0.00% (perfect 1 0.00% 15 1 0.00% lu´a 1 0.00% anderson 1 0.00% exmovere 1 0.00% hadjiis 1 0.00% (hàm 1 0.00% (nhe5 1 0.00% soyuz 1 0.00% hiê 1 0.00% bae 1 0.00% giô´ng 1 0.00% bluetooth 1 0.00% pc 1 0.00% farnborough 1 0.00% (21dd 1 0.00% (37dd 1 0.00% (36 1 0.00% 26dd 1 0.00% 28dd 1 0.00% (44dd 1 0.00% solomon 1 0.00% (31dd 1 0.00% tu'o'1c'

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 231 1 0.00% gang… 1 0.00% (35 1 0.00% (34 1 0.00% 31dd 1 0.00% xoa5t 1 0.00% (cddsp 1 0.00% (na(2m 1 0.00% gps 1 0.00% 5b 1 0.00% (phu5c 1 0.00% 000km 1 0.00% 42dd 1 0.00% 29dd 1 0.00% 492 1 0.00% 405 1 0.00% liê5u… 1 0.00% 19001785 1 0.00% (chi5 1 0.00% ka'aba 1 0.00% bengal 1 0.00% cbc 1 0.00% 9333 1 0.00% (1801 1 0.00% 1865 1 0.00% vecto' 1 0.00% 60km 1 0.00% (gd 1 0.00% tncs 1 0.00% nghu'4ng 1 0.00% stabilimenta 1 0.00% 14dd 1 0.00% diêu 1 0.00% (26dd 1 0.00% 'hâ2u 1 0.00% sphecidae 1 0.00% (tiê1p 1 0.00% 780 1 0.00% làâ 1 0.00% 16dd 1 0.00% 13dd 1 0.00% 11dd 1 0.00% rubinokia 1 0.00% cha(n… 1 0.00% ayef 1 0.00% stormeye 1 0.00% vmc 1 0.00% (pr 1 0.00% (hongtrung1987hnd 1 0.00% ddi…la(n 1 0.00% kv2 1 0.00% exciter 1 0.00% (mã

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 232 1 0.00% (minh 1 0.00% goldsun 1 0.00% (vaa 1 0.00% coca 1 0.00% cola 1 0.00% tgdd 1 0.00% tru'o'2n…to'1i 1 0.00% quebéc 1 0.00% bates 1 0.00% golden 1 0.00% advertising 1 0.00% tru'o'2n… 1 0.00% unilever 1 0.00% dentsu 1 0.00% (iata 1 0.00% gtvt 1 0.00% (ca3ng 1 0.00% fibrin 1 0.00% (ddào 1 0.00% 364 1 0.00% airport 1 0.00% consultants 1 0.00% naco 1 0.00% kv3 1 0.00%nhipcautinhban20042003 1 0.00% netherlands 1 0.00% (akhuong1967 1 0.00% (esc_kenvin_matnick 1 0.00% so'1t 1 0.00% nghê2… 1 0.00% (kv2 1 0.00% la5i…và 1 0.00% op 1 0.00% sx 1 0.00% perionyx 1 0.00% excavatus 1 0.00% rân 1 0.00% chô3ng 1 0.00% (ddiê3m 1 0.00% iaculor 1 0.00% injection 1 0.00% sherbrooke 1 0.00% (contrast 1 0.00% (brightness 1 0.00% mm 1 0.00% bdi 1 0.00% 'thu3y 1 0.00% clor 1 0.00% mini 1 0.00% helium 1 0.00% (cnw 1 0.00% wthr

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 233 1 0.00% 390m 1 0.00% bars 1 0.00% sintef 1 0.00% zdnets 1 0.00% sonic 1 0.00% boom 1 0.00% pips 1 0.00% volume 1 0.00% (color 1 0.00% gâ2n… 1 0.00% dden' 1 0.00% lpr 1 0.00% dvorak 1 0.00% vaughan 1 0.00% hull 1 0.00% avenir 1 0.00% khòm 1 0.00% schubert 1 0.00% (bourbo 1 0.00% (naintraco 1 0.00% vinagimex 1 0.00% mahler 1 0.00% bruckner 1 0.00% sibelius 1 0.00% sciences 1 0.00% (chip 1 0.00% niclosamite 1 0.00% mb 1 0.00% endosulfan 1 0.00% dden… 1 0.00% freescale 1 0.00% kyoto 1 0.00% châ3u 1 0.00% sò 1 0.00% (ram 1 0.00% ram 1 0.00% kivas 1 0.00% (vsattp 1 0.00% vsattp 1 0.00% (cpi 1 0.00% ba5nh 1 0.00% resort 1 0.00% ssh 1 0.00% o'3… 1 0.00% tu'o'm 1 0.00% nhu'g 1 0.00% ku… 1 0.00% tuýp 1 0.00% hiê5u… 1 0.00% xxxxii 1 0.00% 897 1 0.00% 983

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 234 1 0.00% 64 1 0.00% 513 1 0.00% 608 1 0.00% 176 1 0.00% catalogue 1 0.00% valentine 1 0.00% xnk 1 0.00% 862 1 0.00% click 1 0.00% lóm 1 0.00% pétersburg 1 0.00% khè 1 0.00% caramen 1 0.00% se3o 1 0.00% mastercard 1 0.00% pha3i…cu3a 1 0.00% ro'o'o' 1 0.00% boss 1 0.00% xi5ch 1 0.00% ddá…cây 1 0.00% gucci 1 0.00% armani 1 0.00% bu'o'1m… 1 0.00% và…gâ2n 1 0.00% (chiê1c 1 0.00% hydrocarbure 1 0.00% soài 1 0.00% ilulissat 1 0.00% gâ2n…và 1 0.00% nielsen 1 0.00% skov 1 0.00% joern 1 0.00% dzo5t 1 0.00% buo'1c 1 0.00% da5m 1 0.00% (bddhq 1 0.00% 334 1 0.00% chu'1…câ3n 1 0.00% ddê1n…hê1t 1 0.00% dwt 1 0.00% 000dwt 1 0.00% (gtcc 1 0.00% kìn 1 0.00% du4i 1 0.00% vê2…ki 1 0.00% (vietcombank 1 0.00% giu'o'2ng…không 1 0.00% tra(5c 1 0.00% (lào 1 0.00% (bank 1 0.00% forget 1 0.00% (bidina

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 235 1 0.00% (eximbank 1 0.00% bidiphar 1 0.00% (ktm 1 0.00% kém… 1 0.00% ho'i… 1 0.00% (hsbc 1 0.00% champasak 1 0.00% cbf 1 0.00% nói… 1 0.00% risk 1 0.00% nâ1c…cu5c 1 0.00% leavitt 1 0.00% (ita 1 0.00% wildlife 1 0.00% schiver 1 0.00% ttgdck 1 0.00% 414 1 0.00% (bosf 1 0.00% bosf 1 0.00% hn 1 0.00% câ5u… 1 0.00% b6 1 0.00% hiê5 1 0.00% ny 1 0.00% gtcc 1 0.00% c17 1 0.00% p3 1 0.00% tu'o'2ng…và 1 0.00% ngu3… 1 0.00% (vhtt 1 0.00% (bhxh 1 0.00% nghe…rô2i 1 0.00% 13h 1 0.00% zizou 1 0.00% continental 1 0.00% classe 1 0.00% 1901 1 0.00% citroen 1 0.00% à…vâng 1 0.00% amg 1 0.00% sa5ch…bông 1 0.00% didier 1 0.00% 645 1 0.00% gìn… 1 0.00% cls55 1 0.00% slr 1 0.00% maserati 1 0.00% lancia 1 0.00% ying 1 0.00% (157 1 0.00% (cospar 1 0.00% (tùy

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 236 1 0.00% carrera 1 0.00% 1 1 0.00% mclaren 1 0.00% vw 1 0.00% jaguar 1 0.00% renault 1 0.00% ddình… 1 0.00% vâm 1 0.00% (tro'2i 1 0.00% myfinances 1 0.00% webgiadinh 1 0.00% lascaux 1 0.00% hebrew 1 0.00% (chênh 1 0.00% thang… 1 0.00% hình… 1 0.00% (1m79 1 0.00% (ngay 1 0.00% ùmmm 1 0.00% rx 1 0.00% rs6 1 0.00% (mô2ng 1 0.00% ùmmmmm 1 0.00% sl65 1 0.00% lexus 1 0.00% 155 1 0.00% (51 1 0.00% khú 1 0.00% alexanrda 1 0.00% 599 1 0.00% gtb 1 0.00% phuuu 1 0.00% (60 1 0.00% ma(5n… 1 0.00% calais 1 0.00% ônh 1 0.00% hhà 1 0.00% a8l 1 0.00% nogaard 1 0.00% vietsovpetro 1 0.00% diesel 1 0.00% cay…bánh 1 0.00% cassano 1 0.00% pas 1 0.00% (chô1ng 1 0.00% sh 1 0.00% maixo 1 0.00% (ma5 1 0.00% crôm 1 0.00% titan 1 0.00% mu'o'ng 1 0.00% 5m2

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 237 1 0.00% nord 1 0.00% chát…miê1ng 1 0.00% wave 1 0.00% yoshii 1 0.00% colleen 1 0.00% ha(4ng 1 0.00% nho'1n 1 0.00% cadillac 1 0.00% escalade 1 0.00% x5 1 0.00% fe 1 0.00% (191 1 0.00% (160 1 0.00% enzo 1 0.00% hmmm 1 0.00% béo…su'4a 1 0.00% a8 1 0.00% 4x4 1 0.00% (ceerd 1 0.00% ddu'o'2ng…bánh 1 0.00% h2 1 0.00% (thô1t 1 0.00% navigato 1 0.00% rolls 1 0.00% scout 1 0.00% phantom 1 0.00% laboratory 1 0.00% gallardo 1 0.00% arnage 1 0.00% (diem 1 0.00% micro 1 0.00% (vui 1 0.00% 2011 1 0.00% (ddê2 1 0.00% (cvtv 1 0.00% (trái 1 0.00% mém 1 0.00% xii 1 0.00% (ddô1i 1 0.00% ya 1 0.00% ua 1 0.00% lích 1 0.00% botshabelo 1 0.00% (kcx 1 0.00% 15g30 1 0.00% huflit 1 0.00% gary 1 0.00% eum 1 0.00% ô3ng 1 0.00% (ffi 1 0.00% cathay 1 0.00% chu5p…hình

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 238 1 0.00% ksnd 1 0.00% (bâ1m 1 0.00% ici 1 0.00% cho'5… 1 0.00% krông 1 0.00% ea 1 0.00% spagetti 1 0.00% châ4m 1 0.00% (ttn 1 0.00% rông 1 0.00% tu'… 1 0.00% 16h 1 0.00% kar 1 0.00% (svtn 1 0.00% qua… 1 0.00% tôr 1 0.00% vinhempich 1 0.00% (189 1 0.00% granite 1 0.00% riu 1 0.00% ddo5i 1 0.00% nó…quá 1 0.00% 1388 1 0.00% samosa 1 0.00% trung… 1 0.00% (145 1 0.00% (4a 1 0.00% ta(1p 1 0.00% 699 1 0.00% cu'1u…con 1 0.00% 504 1 0.00% d70s 1 0.00% olympus 1 0.00% e300 1 0.00% zi 1 0.00% forum 1 0.00% 384 1 0.00% toitim 1 0.00% muabanraovat 1 0.00% ttvnol 1 0.00% nikon 1 0.00% trê4… 1 0.00% du'4… 1 0.00% tccn 1 0.00% jessy 1 0.00% bich 1 0.00% litani 1 0.00% 249 1 0.00% vogvn 1 0.00% 455 1 0.00% 247 1 0.00% novosti

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 239 1 0.00% 7m 1 0.00% how 1 0.00% (i'm 1 0.00% glad 1 0.00% 267 1 0.00% vo'1i…kiê1ng 1 0.00% hello 1 0.00% boyfriend 1 0.00% shì 1 0.00% mas 1 0.00% meet 1 0.00% trít 1 0.00% konnichiwa 1 0.00% gtgt 1 0.00% 471 1 0.00% xiú 1 0.00% 60m2 1 0.00% garfield 1 0.00% nâ4ng 1 0.00% malaysia(a3nh 1 0.00% (cbcc 1 0.00% penang 1 0.00% (tncn 1 0.00% (qtkd 1 0.00% lén…công 1 0.00% commonwealth 1 0.00% 8215969 1 0.00% jubilo 1 0.00% iwata 1 0.00% vasco 1 0.00% gama 1 0.00% (1987 1 0.00% sandfield 1 0.00% 8214444 1 0.00% ddô5… 1 0.00% lúc…tí 1 0.00% bernabeu 1 0.00% 8214730 1 0.00% marlene 1 0.00% luxemburgo 1 0.00% vào…nhu'ng 1 0.00% autuori 1 0.00% teixeira 1 0.00% 2014 1 0.00% vanderlei 1 0.00% sul 1 0.00% corinthians 1 0.00% (1984 1 0.00% joachim 1 0.00% rio 1 0.00% grande 1 0.00% kenneth

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 240 1 0.00% lâ5y 1 0.00% hành… 1 0.00% 17h 1 0.00% 23h20 1 0.00% fa 1 0.00% tét 1 0.00% (nmnth 1 0.00% 22g 1 0.00% huyyyyy 1 0.00% (tro'5 1 0.00% kennet 1 0.00% merseyside 1 0.00% (wipo 1 0.00% spike 1 0.00% spikelee 1 0.00% 220m2 1 0.00% (226 1 0.00% 153 1 0.00% ddo'm 1 0.00% cruise 1 0.00% tomcruise 1 0.00% (saigontourist 1 0.00% bbc1 1 0.00% bbc2 1 0.00% khoa(1n 1 0.00% câ5p… 1 0.00% bô1p 1 0.00% kampoo 1 0.00% 429 1 0.00% (co'2 1 0.00% thê3u 1 0.00% (14 1 0.00% la3m 1 0.00% tanimex 1 0.00% 9h 1 0.00% 30 1 0.00% panna 1 0.00% quá…khó 1 0.00% khuây 1 0.00% quá…cao 1 0.00% trum 1 0.00% (hàng 1 0.00% (cu'5u 1 0.00% 771 1 0.00% fatehah 1 0.00% mustapa 1 0.00% vlxd 1 0.00% qua3… 1 0.00% (t 1 0.00% andorra 1 0.00% karlsruhe 1 0.00% ahhhhh

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 241 1 0.00% livorno 1 0.00% macedonia 1 0.00% estonia 1 0.00% giàng 1 0.00% paletta 1 0.00% gonzalez 1 0.00% phew 1 0.00% teery 1 0.00% bellamy 1 0.00% asanovic 1 0.00% (rumani 1 0.00% emila 1 0.00% rô2i…pháo 1 0.00% oana 1 0.00% madalina 1 0.00% florea 1 0.00% blance 1 0.00% la(1m…nhu'ng 1 0.00% aljosa 1 0.00% ngon… 1 0.00% zlatko 1 0.00% kranjcar 1 0.00% (áo 1 0.00% telegraph 1 0.00% hcmcgj 1 0.00% emb 1 0.00% testosterone 1 0.00% vienna 1 0.00% nelson 1 0.00% (eju 1 0.00% (jasso 1 0.00% (wb 1 0.00% klaus 1 0.00% rohland 1 0.00% jp 1 0.00% 8225314 1 0.00% …pro'2 1 0.00% brewery 1 0.00% (cúp 1 0.00% tô2 1 0.00% 417 1 0.00% apb 1 0.00% (tiê1n 1 0.00% seagate 1 0.00% (apb 1 0.00% nou 1 0.00% (ddt 1 0.00% (cách 1 0.00% (jds 1 0.00% test 1 0.00% thê1…tiê2n 1 0.00% (vgsv

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 242 1 0.00% mext 1 0.00% (nghiên 1 0.00% (bia 1 0.00% winston 1 0.00% (mext 1 0.00% (ddi5nh 1 0.00% hbv 1 0.00% (lâ1y 1 0.00% tamiflu 1 0.00% jice 1 0.00% (nu'o'1c 1 0.00% (muô1i 1 0.00% nghê3u 1 0.00% (gakushushoreihi 1 0.00% (la5nh 1 0.00% 20g 1 0.00% 30g 1 0.00% xoang 1 0.00% (gió 1 0.00% (na(1ng 1 0.00% 300dd 1 0.00% cô2 1 0.00% hi5ên 1 0.00% b17 1 0.00% tru'o'1c… 1 0.00% 800dd 1 0.00% (gdgt 1 0.00% (gv 1 0.00% gmat 1 0.00% b19 1 0.00% (trô2ng 1 0.00% schriver 1 0.00% câ2u… 1 0.00% (tddc 1 0.00% 564 1 0.00% ttptqdd 1 0.00% (ttptqdd 1 0.00% 530 1 0.00% cardinal 1 0.00% khêu 1 0.00% (ma5o 1 0.00% (khách 1 0.00% 568 1 0.00% 538 1 0.00% 469 1 0.00% maùi 1 0.00% troáng 1 0.00% vòt 1 0.00% coâng 1 0.00% nghieäp 1 0.00% hoaøng 1 0.00% maya

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 243 1 0.00% chão 1 0.00% va(2m 1 0.00% (tsc 1 0.00% lovelygirl4687 1 0.00% toserco 1 0.00% pha3i… 1 0.00% (totem 1 0.00% nào… 1 0.00% toefl 1 0.00% kolbe 1 0.00% (lu'o'5c 1 0.00% ielts 1 0.00% tröôùc 1 0.00% (ñ 1 0.00% naøy 1 0.00% cánh… 1 0.00% loaïi 1 0.00% (cec 1 0.00% ddó…bác 1 0.00% qua5nh 1 0.00% unicode 1 0.00% (nguyenthithuydung712 1 0.00% ngáng 1 0.00% bíc 1 0.00% reap 1 0.00% sihanoukville 1 0.00% sites 1 0.00% font 1 0.00% capumchia 1 0.00% siem 1 0.00% (mosnews 1 0.00% xuô1ng…liê2n 1 0.00% xác… 1 0.00% vu4a 1 0.00% (huybinhmt88 1 0.00% (fareasttravel 1 0.00% (yourangel172 1 0.00% (hddba 1 0.00% (giâ1u 1 0.00% loi 1 0.00% (lat 1 0.00% (nv1 1 0.00% (fta 1 0.00% 976 1 0.00% (xinhuanet 1 0.00% to3m 1 0.00% bettina 1 0.00% lat 1 0.00% awards 1 0.00% bhumibol 1 0.00% adulyadej 1 0.00% vejjajiva

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 244 1 0.00% ill 1 0.00% lybia 1 0.00% (sepa 1 0.00% (li 1 0.00% 402 1 0.00% sahara 1 0.00% uganda 1 0.00% (wfp 1 0.00% wfp 1 0.00% 577 1 0.00% (aei 1 0.00% smh 1 0.00% cooke 1 0.00% eritrea 1 0.00% kenya 1 0.00% allafrica 1 0.00% tuy3 1 0.00% (hùynh 1 0.00% (tvdddd 1 0.00% mri 1 0.00% c7 1 0.00% c3 1 0.00% 8mm 1 0.00% (tran 1 0.00% (ttl 1 0.00% lâu… 1 0.00% churchill 1 0.00% nhâ1t… 1 0.00% hcm… 1 0.00% geffen 1 0.00% (aids 1 0.00% yê1u… 1 0.00% (dhs 1 0.00% 7kg 1 0.00% (nutraingredients 1 0.00% nghiê5m… 1 0.00% (cbt 1 0.00% bergen 1 0.00% (thx 1 0.00% clo 1 0.00% (times 1 0.00% dang_vanhai2001 1 0.00% (phân 1 0.00% smecta 1 0.00% cxiii 1 0.00% eximer 1 0.00% sâ5m 1 0.00% (243a 1 0.00% alterman 1 0.00% (hong_quyen99 1 0.00% antibio 1 0.00% neopeptine

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 245 1 0.00% (truongbangxdag 1 0.00% ec 1 0.00% 'có 1 0.00% rosenlund 1 0.00% (ttytdp 1 0.00% chu'o'2ng 1 0.00% massege 1 0.00% thuyê1n 1 0.00% lu'o'3ng 1 0.00% healthcare 1 0.00% excimer 1 0.00%(nguoikhungbohientu_121171 1 0.00% 551 1 0.00% attp 1 0.00% epic 1 0.00% encounter 1 0.00% 'cái 1 0.00% yom 1 0.00% kippur 1 0.00% 3b 1 0.00% 705 1 0.00% rabinovich 1 0.00% koms 1 0.00% transformed 1 0.00% middle 1 0.00% traivenguon2006 1 0.00% dayan 1 0.00% mig 1 0.00% gíó 1 0.00% này' 1 0.00% hayward 1 0.00% siôn 1 0.00% moreau 1 0.00% aaron 1 0.00% yariv 1 0.00% moshe 1 0.00% (lê4 1 0.00% (ta5m 1 0.00% 16km 1 0.00% blu 1 0.00% 9306737 1 0.00% roma 1 0.00% crai 1 0.00% haaretz 1 0.00% dua 1 0.00% 9302127 1 0.00% 'bô1p' 1 0.00% shevardnadze 1 0.00% salvatore 1 0.00% striano 1 0.00% 66 1 0.00% co5t

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 246 1 0.00% l'express 1 0.00% skyhawk 1 0.00% mirage 1 0.00% quid 1 0.00% 'cha(1c 1 0.00% (tuâ2n 1 0.00% ddân 1 0.00% galilê 1 0.00% voi… 1 0.00% 8225540 1 0.00% ubnviet 1 0.00% sadate 1 0.00% 'cha(3ng 1 0.00% salát 1 0.00% 'tôi 1 0.00% oa(m 1 0.00% maariv 1 0.00% (israel 1 0.00% jenin 1 0.00% bethlehem 1 0.00% ddu5n 1 0.00% tô1i… 1 0.00% hobson 1 0.00% yurczyszyn 1 0.00% 'to'1i 1 0.00% ô2ng 1 0.00% gurion 1 0.00% …ì 1 0.00% adn 1 0.00% hagana 1 0.00% ô5c 1 0.00% gershom 1 0.00% gorenberg 1 0.00% thô5p 1 0.00% …xì 1 0.00% vâ1p( 1 0.00% sõi 1 0.00% 1923 1 0.00% nasser 1 0.00% uthant 1 0.00% (kibbutz 1 0.00% degania 1 0.00% tiberia 1 0.00% l'arche 1 0.00% 474 1 0.00% juin 1 0.00% akaba 1 0.00% jacket 1 0.00% gilê 1 0.00% samakh 1 0.00% salim 1 0.00% inside

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 247 1 0.00% savannah 1 0.00% (jamal 1 0.00% mansour 1 0.00% jamal 1 0.00% (hô1t 1 0.00% yomiuri 1 0.00% stark 1 0.00% (3d 1 0.00% headphone 1 0.00% danna 1 0.00% ý… 1 0.00% lu'ng… 1 0.00% ignacio 1 0.00% schweimler 1 0.00% pelvis 1 0.00% nhâ5t… 1 0.00% fetlock 1 0.00% grovel 1 0.00% yongjian 1 0.00% boneshaker 1 0.00% vertigo 1 0.00% bashthumb 1 0.00% rushforth 1 0.00% 482 1 0.00% assam 1 0.00% smartypantws 1 0.00% queens 1 0.00% tru'3ng ………………………………………………………………………1 0.00% 1 0.00% compost 1 0.00% 278 1 0.00% ahn 1 0.00% dda5p… 1 0.00% quo'1i 1 0.00% giâ1c… 1 0.00% ddu'o'2ng…chàng 1 0.00% overmars 1 0.00% seaman 1 0.00% luô1n 1 0.00% …không 1 0.00% ngu'o'2i…là 1 0.00% zola 1 0.00% (mercosur 1 0.00% nu'4a…thê1 1 0.00% o3 1 0.00% thâ2n…và 1 0.00% gianfranco 1 0.00% tô1i…ngô2i 1 0.00% 2051 1 0.00% 437 1 0.00% ru'4a 1 0.00% yongfeng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 248 1 0.00% 4… 1 0.00% swimbladder 1 0.00% cddv 1 0.00% 423 1 0.00% a(n…nha3y 1 0.00% chét 1 0.00% swashbucke 1 0.00% nu'4a…cho 1 0.00% ………………fax 1 0.00% christie's 1 0.00% mozart 1 0.00% estée 1 0.00% lauder 1 0.00% chocolate 1 0.00% rockland 1 0.00% (massachusetts 1 0.00% u'2m…con 1 0.00% (baby 1 0.00% boomer 1 0.00% vì… 1 0.00% loews 1 0.00% last 1 0.00% sharia 1 0.00% xa5o 1 0.00% fukuyama 1 0.00% end 1 0.00% history 1 0.00%…………………………ddtddd 1 0.00% tomahawk 1 0.00% tisch 1 0.00% …tháng… 1 0.00%……………………………………… 1 0.00% táo… 1 0.00% lillian 1 0.00% ……………no'i 1 0.00% bòng 1 0.00% phiê1m 1 0.00% matxa 1 0.00% ngày……… 1 0.00% 8g10 1 0.00% 9km 1 0.00% 40oc 1 0.00% sabato 1…………………………………………………………………… 0.00% 1 0.00% …………………… 1 0.00% (tu' 1 0.00% ……………ngày……… 1 0.00% rondon 1 0.00% ………… 1 0.00% harley 1 0.00% ngày…………… 1 0.00% cùng…

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 249 1 0.00% 70m 1 0.00% 'ddâ3y' 1 0.00% bull 1 0.00% mu5t 1 0.00% 'kéo' 1 0.00% mdairej 1 0.00% 0kg 1 0.00% 250g 1 0.00% 8kg 1 0.00% gam 1 0.00% (3200 1 0.00% (5x600 1 0.00% 12cm 1 0.00% 42cm 1 0.00% 46cm 1 0.00% 25cm 1 0.00% 3cm 1 0.00% 1cm 1 0.00% 50gr 1 0.00% quinolon 1 0.00% cimetidin 1 0.00% sox 1 0.00% chlotetracyclin 1 0.00% trimoxazol 1 0.00% spiramycin 1 0.00% fructose 1 0.00% 4kg 1 0.00% 2kg 1 0.00% (cocain 1 0.00% methadon 1 0.00% arginin 1 0.00% calcimex 1 0.00% pô1p 1 0.00% ly5 1 0.00% ths 1 0.00% êkip 1 0.00% fv 1 0.00% 1x2cm 1 0.00% 145mmol 1 0.00% huddersfield 1 0.00% sô5t 1 0.00% (vân 1 0.00% gorontlo 1 0.00% (tim 1 0.00% mòi 1 0.00% (tlt 1 0.00% yaourt 1 0.00% 49cm 1 0.00% 13cm 1 0.00% 48cm 1 0.00% antigen 1 0.00% dên

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 250 1 0.00% cobalt 1 0.00% coleman 1 0.00% (prostate 1 0.00% specific 1 0.00% nha(2n… 1 0.00% khu5t 1 0.00% raisio 1 0.00% kaiser 1 0.00% goldstein 1 0.00% 580 1 0.00% women 1 0.00% co'4n 1 0.00% brigham 1 0.00% valio 1 0.00% (sxh 1 0.00% melinda 1 0.00% bóng…ra 1 0.00% ocytocin 1 0.00% phenylethylamine 1 0.00% amphetamin 1 0.00% (dehydroepiandrosteron 1 0.00% ddê2m 1 0.00% oxtocin 1 0.00% norepinephrine 1 0.00% vasopressine… 1 0.00% nhiê3u 1 0.00% (du5c 1 0.00% du5c… 1 0.00% serotonin 1 0.00% vòng…bessie 1 0.00% testosteron 1 0.00% rda 1 0.00% dietary 1 0.00% prolactin 1 0.00% recommended 1 0.00% erythromycin 1 0.00% nitrofurantoin 1 0.00% gentamycin 1 0.00% 63gr 1 0.00% lô3n 1 0.00% nhô3n 1 0.00% allowances 1 0.00% albopictus 1 0.00% ddá… 1 0.00% cortisol 1 0.00% digest 1 0.00% reader's 1 0.00% aegypti 1 0.00% 1gr 1 0.00% lán 1 0.00% caffeine 1 0.00% loa(ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 251 1 0.00% khóc…vì 1 0.00% jacksonville 1 0.00% chuchaisaengrat 1 0.00% panom 1 0.00% meesiriphan 1 0.00% apphich 1 0.00% (dld 1 0.00% boonriang 1 0.00% (who 1 0.00% (idf 1 0.00% ba(ngrôn 1 0.00% dld 1 0.00% yukol 1 0.00% limlamthong 1 0.00% librex 1 0.00% mogadishu 1 0.00% abdullahi 1 0.00% yusuf 1 0.00% tnhk 1 0.00% húp 1 0.00% baidoa 1 0.00% tošenovský 1 0.00% moravskoslezsko 1 0.00% châng 1 0.00% ctv 1 0.00% con' 1 0.00% zedník 1 0.00% chí… 1 0.00% …xì…ì 1 0.00% 'trong 1 0.00% (aman 1 0.00% footspa 1 0.00% flo 1 0.00% eichmann 1 0.00% vitebsk 1 0.00% halperin 1 0.00% working 1 0.00% ss 1 0.00% (nails 1 0.00% xu'3… 1 0.00% ynetnews 1 0.00% môto' 1 0.00% to'2i 1 0.00% 5g40 1 0.00% hanan 1 0.00% nho5c…' 1 0.00% debkafile 1 0.00% 'ho5 1 0.00% safety 1 0.00% 70x70cm 1 0.00% '…qua 1 0.00% 15m

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 252 1 0.00% irvine 1 0.00% ndonesia 1 0.00% thích…ddu'2ng 1 0.00% toby 1 0.00% (rlc 1 0.00% phán… 1 0.00% ngu'o'2i… 1 0.00% birmingham 1 0.00% under 1 0.00% 596 1 0.00% fragrances 1 0.00% xánh 1 0.00% tiê5c… 1 0.00% intelligencer 1 0.00% yêu… 1 0.00% sêry 1 0.00% urê 1 0.00% (xinhua 1 0.00% u'2m 1 0.00% 'nhu'4ng 1 0.00% 21và 1 0.00% (kyodo 1 0.00% khéo… 1 0.00% nha… 1 0.00% ernesto 1 0.00% em…' 1 0.00% 'tu'2 1 0.00% …ba 1 0.00% haret 1 0.00% hreik 1 0.00% rô1ckét 1 0.00% sùm 1 0.00% yanjin 1 0.00% (ddài 1 0.00% shibuya 1 0.00% aqsa 1 0.00% aljazeera 1 0.00% ab 1 0.00% hottest 1 0.00% (iom 1 0.00% 'em 1 0.00% newscastle 1 0.00% shiefield 1 0.00% entrepreneurs 1 0.00% 2214 1 0.00% (bernama 1 0.00% (liba(ng 1 0.00% (voa 1 0.00% 'mô4i 1 0.00% 'vào 1 0.00% cu'o'2m 1 0.00% hàn…

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 253 1 0.00% doang 1 0.00% 300mw 1 0.00% bot 1 0.00% ipp 1 0.00% 150mw 1 0.00% 916 1 0.00% bê1n… 1 0.00% chuy5ên 1 0.00% highway 1 0.00% croissiere… 1 0.00% lo'1n… 1 0.00% cu4…mô5t 1 0.00% hinh 1 0.00% baby 1 0.00% you'll 1 0.00% mssu 1 0.00% (if 1 0.00% tutor 1 0.00% level 1 0.00% southern 1 0.00% chô3i… 1 0.00% never 1 0.00% arc… 1 0.00% (cph 1 0.00% anansi 1 0.00% 400usd 1 0.00% company 1 0.00% boxing 1 0.00% (lcci 1 0.00% bibbidi 1 0.00% commerce 1 0.00% bibibidi 1 0.00% hannibal 1 0.00% (cif 1 0.00% 440usd 1 0.00% alps 1 0.00% (vô 1 0.00% 839 1 0.00% judo 1 0.00% wushu… 1 0.00% plongeon 1 0.00% v… 1 0.00% (karate 1 0.00% hu'2m 1 0.00% atlanta 1 0.00% olympics 1 0.00% menu 1 0.00% karate 1 0.00% 415 1 0.00% ngoác 1 0.00% celtic 1 0.00% ddâ2u…và

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 254 1 0.00% hornpipe 1 0.00% treble 1 0.00% jigs 1 0.00% oireachtas 1 0.00% kelowna 1 0.00% mà…vì 1 0.00% lord 1 0.00% nhâ2u 1 0.00% riverside 1 0.00% ddi…bé 1 0.00% dancing 1 0.00% pttn 1 0.00% burnaby 1 0.00% solo 1 0.00% championships 1 0.00% tu'3…hay 1 0.00% surrey 1 0.00% lu5m 1 0.00% nè…bé 1 0.00% nu'4a…nhanh 1 0.00% li3a 1 0.00% kha3nh 1 0.00% cho'2… 1 0.00% ranchomirage 1 0.00% reply 1 0.00% béng 1 0.00% khê3nh 1 0.00% falls 1 0.00% verison 1 0.00% so'5i…mô5t 1 0.00% nhim 1 0.00% blogweb 1 0.00% kha3ng 1 0.00% mode 1 0.00% woodbridge 1 0.00% nhanh… 1 0.00% (women 1 0.00% need 1 0.00% gio'1i… 1 0.00% (vl 1 0.00% nhà… 1 0.00% lp 1 0.00% dâ2n… 1 0.00% 620 1 0.00% schwartzeneggar 1 0.00% joan 1 0.00% phuong 1 0.00% sachen 1 0.00% anhalt 1 0.00% xong… 1 0.00% vieteuro 1 0.00% (horea

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 255 1 0.00% 42 1 0.00% mercer 1 0.00% romeo 1 0.00% hyon 1 0.00% otc 1 0.00% engle 1 0.00% (prubf1 1 0.00% (bavik 1 0.00% 6h45 1 0.00% 10h45 1 0.00% broadway 1 0.00% mcneff 1 0.00% trê5ch 1 0.00% wilkins 1 0.00% 19h 1 0.00% pha3i…ta 1 0.00% ogoniok 1 0.00% baseball 1 0.00% 3x4 1 0.00% vodka 1 0.00% 600dd 1 0.00% (nl 1 0.00% là…tô1i 1 0.00% gio'4n 1 0.00% 900dd 1 0.00% brunno 1 0.00% oooh 1 0.00% (hkd 1 0.00% (so'2 1 0.00% (kéo 1 0.00% 717 1 0.00% 830 1 0.00% le5m 1 0.00% elliot 1 0.00% info 1 0.00% bedart 1 0.00% pak 1 0.00% mukilteo 1 0.00% giava 1 0.00% mongthong 1 0.00% (htv 1 0.00% 150m 1 0.00% northwest 1 0.00% thâ1ygus 1 0.00% weekly 1 0.00% norbert 1 0.00% fame 1 0.00% dearborn 1 0.00% sue 1 0.00% unger 1 0.00% automotive 1 0.00% thousand

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 256 1 0.00% oaks 1 0.00% detroit 1 0.00% newcomb 1 0.00% hiawatha 1 0.00% tulane 1 0.00% ag 1 0.00% (dinh 1 0.00% reithofer 1 0.00% menchika 1 0.00% panke 1 0.00% helmut 1 0.00% boola 1 0.00% romulus 1 0.00% thu'2 1 0.00% chrysler 1 0.00% doola 1 0.00% salaga 1 0.00% ldm 1 0.00% cao… 1 0.00% khuy 1 0.00% u'3i 1 0.00% (vnvnonn 1 0.00% hatta 1 0.00% visa… 1 0.00% collins 1 0.00% 992 1 0.00% register 1 0.00% aladin 1 0.00% nathan 1 0.00% price 1 0.00% vuô1ng 1 0.00% caro 1 0.00% controls 1 0.00% shao 1 0.00% xán 1 0.00% mowgli 1 0.00% qiwei 1 0.00% (tsunami 1 0.00% crain 1 0.00% radjasa 1 0.00% whittington 1 0.00% control 1 0.00% (ddtddd 1 0.00% oleguer 1 0.00% changed 1 0.00% earth 1 0.00% mình…nhu'ng 1 0.00% place 1 0.00% bronkhorst 1 0.00% mutu 1 0.00% không…và 1 0.00% mun

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 257 1 0.00% hãi… 1 0.00% heaven 1 0.00% when 1 0.00% gudjohnsen 1 0.00% giâ5n… 1 0.00% ddêm…mà 1 0.00% na5t… 1 0.00% mu3ng 1 0.00% (chuyê3n 1 0.00% nhu'…ngo5n 1 0.00% nhê2n 1 0.00% hê… 1 0.00% eidur 1 0.00% stone 1 0.00% bridge 1 0.00% olivier 1 0.00% dde5p… 1 0.00% riêng… 1 0.00% (afc 1 0.00% broadcasting 1 0.00% (ffa 1 0.00% 'và 1 0.00% billy 1 0.00% plimp 1 0.00% suleco 1 0.00% (câ5u 1 0.00% moore 1 0.00% o'neill 1 0.00% (innovative 1 0.00% ventures 1 0.00% guillou 1 0.00% arsene 1 0.00% c1 1 0.00% josie 1 0.00% four 1 0.00% hiddink 1 0.00% hungaria 1 0.00% ti5… 1 0.00% wenger 1 0.00% nó… 1 0.00% socceroos 1 0.00% almeida 1 0.00% 50m 1 0.00% 3421 1 0.00% bo'i…bo'i… 1 0.00% ddó…kitty 1 0.00% khu'o'1u 1 0.00% nghe…nhu'ng 1 0.00% intouch 1 0.00% vnaemb 1 0.00% lu'3 1 0.00% bô1n…

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 258 1 0.00% gì…nhu'ng 1 0.00% 823 1 0.00% mofa 1 0.00% (paraguay 1 0.00% mfa 1 0.00% (venezuela 1 0.00% cairô 1 0.00% nicanor 1 0.00% tròi 1 0.00% gddt 1 0.00% 6928 1 0.00% vasquez 1 0.00% cls 1 0.00% ntn 1 0.00% salomon 1 0.00% feyenoord 1 0.00% ba(2m 1 0.00% tung… 1 0.00% andriy 1 0.00% shevchenko 1 0.00% atletico 1 0.00% bilbao 1 0.00% achilles 1 0.00% sô3ng 1 0.00% buddhist 1 0.00% (zoning 1 0.00% nói…piê1p 1 0.00% gu5 1 0.00% 1189 1 0.00% obi 1 0.00% lo'1p… 1 0.00% 8612 1 0.00% 336 1 0.00% ô2n… 1 0.00% claude 1 0.00% luôn… 1 0.00% old 1 0.00% trafford 1 0.00% chelea 1 0.00% ta…và 1 0.00% (ubtvqh 1 0.00% vizag 1 0.00% luzon 1 0.00% pradesh 1 0.00% andhra 1 0.00% trét 1 0.00% guide 1 0.00% nhình 1 0.00% buô2m… 1 0.00% tvqh 1 0.00% (xác 1 0.00% messenger

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 259 1 0.00% pclb 1 0.00% tro5ng… 1 0.00% su'5… 1 0.00% 57250099 1 0.00% 70 1 0.00% anh…ta5i 1 0.00% (mèo 1 0.00% (jean 1 0.00% newswires 1 0.00% xiu 1 0.00% soo 1 0.00% (khóa 1 0.00% 268 1 0.00% thoa(n 1 0.00% tra3o 1 0.00% nhiêm 1 0.00% (vùng 1 0.00% va(5t…trong 1 0.00% tính…chê1t 1 0.00% tay…tro'2i 1 0.00% tí…con 1 0.00% nha…tro'2i 1 0.00% 342ha 1 0.00% (ddêm 1 0.00% nhu'…mang 1 0.00% xa(n 1 0.00% chân… 1 0.00% xoàn 1 0.00% nhu'ng…dâ4m 1 0.00% phô5ng 1 0.00% 144ha 1 0.00% (nông 1 0.00% nè…xê1p 1 0.00% dozen 1 0.00% 747ha 1 0.00% nè…chuô1t 1 0.00% farm 1 0.00% ôi… 1 0.00% avenal 1 0.00% mâ2y 1 0.00% doping 1 0.00% asiad 1 0.00% zeo 1 0.00% ddó…mô5t 1 0.00% nga(2m 1 0.00% vôn 1 0.00% ru'o'2i 1 0.00% ddu'2 1 0.00% 01m 1 0.00% pân 1 0.00% 57km 1 0.00% 1g8'56

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 260 1 0.00% osca 1 0.00% creusot 1 0.00% montceau 1 0.00% mines 1 0.00% alberto 1 0.00% (safa 1 0.00% lddtbxh 1 0.00% sceaux 1 0.00% antony 1 0.00% news24 1 0.00% billiards 1 0.00% snooker 1 0.00% xéc 1 0.00% khoa3ng150 1 0.00% sa(1c… 1 0.00% 9h30 1 0.00% 943045 1 0.00% héo… 1 0.00% 57250095 1 0.00% ddây… 1 0.00% quê2u 1 0.00% 943048 1 0.00% (bùi 1 0.00% lb 1 0.00% (bvd 1 0.00% (tn 1 0.00% vladimirovich 1 0.00% vladimia 1 0.00% 961 1 0.00% webcam 1 0.00% 'k' 1 0.00% vích 1 0.00% 'a' 1 0.00% ai… 1 0.00% 15h20 1 0.00% bót 1 0.00% ddéc 1 0.00% (khoa 1 0.00% 234 1 0.00% 920 1 0.00% 377 1 0.00% hoán 1 0.00% su'1c…cho'5t 1 0.00% 7p 1 0.00% 'trô5m 1 0.00% kê1… 1 0.00% âmthâ2m 1 0.00% ra(1n…nho'2 1 0.00% bu'o'1i 1 0.00% ddâ2t 1 0.00% bô1… 1 0.00% sentani

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 261 1 0.00% lu'4a 1 0.00% shangye 1 0.00% ra(5t 1 0.00% thiê1t…kim 1 0.00% cha(5t…kim 1 0.00% bi5n 1 0.00% ri5n 1 0.00% mình…mô5t 1 0.00% giang…gia(5c 1 0.00% eiu 1 0.00% 400kg 1 0.00% djik 1 0.00% tandtc 1 0.00% xính 1 0.00% aid 1 0.00% co3n 1 0.00% 136 1 0.00% (ca(1t 1 0.00% vksndtc 1 0.00% khong 1 0.00% go3i 1 0.00% (la(ng 1 0.00% cha5y…bo5n 1 0.00% gi 1 0.00% vinaconex 1 0.00% 480 1 0.00% (làng 1 0.00% te 1 0.00% (636 1 0.00% pham 1 0.00% ''gio'1i 1 0.00% tre3'' 1 0.00% (dssgdd 1 0.00% nao… 1 0.00% (isp 1 0.00% websites 1 0.00% (bài 1 0.00% (chô2ng 1 0.00% theo… 1 0.00% (vdc 1 0.00% 174 1 0.00% (tnhh 1 0.00% ftp 1 0.00% telnet 1 0.00% isp 1 0.00% 520198 1 0.00% su3i 1 0.00% da5… 1 0.00% kìa… 1 0.00% (techcombank 1 0.00% (dda(1c 1 0.00% phên

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 262 1 0.00% ilizarov 1 0.00% luyn 1 0.00% 240 1 0.00% phêng 1 0.00% pheng 1 0.00% ama 1 0.00% 166 1 0.00% 964 1 0.00% ê4nh 1 0.00% 797 1 0.00% 980 1 0.00% 326 1 0.00% haryanto 1 0.00% ro3i 1 0.00% ratna 1 0.00% (papua 1 0.00% antara 1 0.00% ô5p… 1 0.00% 914 1 0.00% 778 1 0.00% 722 1 0.00% 195 1 0.00% 32 1 0.00% (1997 1 0.00% ra5t… 1 0.00% shiu 1 0.00% kee 1 0.00% simcard 1 0.00% 12g 1 0.00% hóavâ4n 1 0.00% châ4u 1 0.00% san… 1 0.00% bolzano 1 0.00% 200er 1 0.00% thù…thây 1 0.00% ri3nh 1 0.00% (vna 1 0.00% 325 1 0.00% khua… 1 0.00% 'cây 1 0.00% ddo3…dde5p 1 0.00% size 1 0.00% flint 1 0.00% (pw 1 0.00% viê1t… 1 0.00% 711 1 0.00% bu'o'ng 1 0.00% (cuô1n 1 0.00% twddcsvn 1 0.00% nùng 1 0.00% full 1 0.00% gospel

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 263 1 0.00% nhk 1 0.00% vanh 1 0.00% made 1 0.00% (igfm 1 0.00% ddô5ng' 1 0.00% ddaì 1 0.00% rô2i…nhà 1 0.00% thu'5o'ng 1 0.00% 'ngôn 1 0.00% (ntk 1 0.00% vaò 1 0.00% cnet 1 0.00% nay2 1 0.00% certificate 1 0.00% education 1 0.00% ddaò 1 0.00% ngoaì 1 0.00% ngoaí 1 0.00% ro5 1 0.00% (link 1 0.00% 674m2 1 0.00% bql 1 0.00% nu'o'1c…tin 1 0.00% opendocument 1 0.00% kkt 1 0.00% vàng…chính 1 0.00% uniflashes 1 0.00% nsf 1783b2f92a5b73014c12571ae005592ca 0.00% 1 0.00% 0l 1 0.00% chùc 1 0.00% ch0 1 0.00% bô1c… 1 0.00% 1g 1 0.00% 22459955 1 0.00% (mu4i 1 0.00% dy 1 0.00% dìa 1 0.00% quy0t 1 0.00% ke5 1 0.00% health 1 0.00% 193 1 0.00% opaque 1 0.00% 570 1 0.00% mác… 1 0.00% mún 1 0.00% chùa…trong 1 0.00% 488 1 0.00% ngoay 1 0.00% qua5ch 1 0.00% 520 1 0.00% ______

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 264 1 0.00% (ec 1 0.00% dower 1 0.00% na(m1981 1 0.00% (dành 1 0.00% 2142 1 0.00% rubber 1 0.00% stamp 1 0.00% mo'5 1 0.00% (www 1 0.00% thtndc 1 0.00% (giu'4a 1 0.00% tóc… 1 0.00% ucraina 1 0.00% 2150 1 0.00% 'chu'a 1 0.00% ghim 1 0.00% (1287 1 0.00% (pghh 1 0.00% (â1p 1 0.00% tay… 1 0.00% 30cm 1 0.00% (ca 1 0.00% vàm 1 0.00% xoè 1 0.00% 395 1 0.00% lpdd 1 0.00% 475 1 0.00% chtaura 1 0.00% masnaa 1 0.00% jalala 1 0.00% békaa 1 0.00% litami 1 0.00% anti 1 0.00% (tân 1 0.00% taiba 1 0.00% shyam 1 0.00% taanayel 1 0.00% arménia 1 0.00% anjar 1 0.00% (bâ5c 1 0.00% siniori 1 0.00% aitaroune 1 0.00% jbail 1 0.00% sumantiawan 1 0.00% villepine 1 0.00% faoud 1 0.00% atlantis 1 0.00% tre3… 1 0.00% (ngo5n 1 0.00% ( 1 0.00% (nasa 1 0.00% 1ngu'o'2i

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 265 1 0.00% tree 1 0.00% yucca 1 0.00% mà… 1 0.00% (los 1 0.00% caifornia 1 0.00% joshua 1 0.00% karachi 1 0.00% mohan 1 0.00% bhattari 1 0.00% (beiruth 1 0.00% sâ1u…â2m 1 0.00% (kathmandu 1 0.00% nai…cu4ng 1 0.00% talat 1 0.00% masood 1 0.00% (seatlle 1 0.00% saran 1 0.00% chandrasekharan 1 0.00% là…là… 1 0.00% bieng 1 0.00% 500m2 1 0.00% stralsund 1 0.00% farnboough 1 0.00% ''trong 1 0.00% mòn'' 1 0.00% xoa3i 1 0.00% ddùi… 1 0.00% o'1n 1 0.00% vedeno 1 0.00% chê1t… 1 0.00% (muô1n 1 0.00% o3m 1 0.00% chô2ng… 1 0.00% (djakarta 1 0.00% gú 1 0.00% ddó… 1 0.00% búng 1 0.00% 24gio'2 1 0.00% phê2 1 0.00% hôm19 1 0.00% eke 1 0.00% lomonosov 1 0.00% bourj 1 0.00% barajneh 1 0.00% gusmao 1 0.00% mick 1 0.00% ichkeria 1 0.00% (dili 1 0.00% nheo 1 0.00% imam 1 0.00% malachenko 1 0.00% kirton

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 266 1 0.00% tê1t… 1 0.00% (palestine 1 0.00% mccomack 1 0.00% 30kg 1 0.00% junaidi 1 0.00% cilacap 1 0.00% dody 1 0.00% jeltsin 1 0.00% (pangandaran 1 0.00% dudi 1 0.00% ghei 1 0.00% sunda 1 0.00% djakarta 1 0.00% alon 1 0.00% nurdina 1 0.00% 05g00 1 0.00% gmt 1 0.00% bachar 1 0.00% ngay… 1 0.00% bánh… 1 0.00% friedman 1 0.00% freidman 1 0.00% hzbollah 1 0.00% k'ho 1 0.00% 735 1 0.00% manifesto 1 0.00% all 1 0.00% created 1 0.00% equal 1 0.00% luyn(dâ2u 1 0.00% cu'óp 1 0.00% nhiê2u… 1 0.00% la5… 1 0.00% tha(1ng… 1 0.00% fulrô 1 0.00% evident 1 0.00% up 1 0.00% TRUE 1 0.00% meaning 1 0.00% deeply 1 0.00% rooted 1 0.00% rise 1 0.00% these 1 0.00% truths 1 0.00% self 1 0.00% its 1 0.00% creed 1 0.00% hold 1 0.00% 524 1 0.00% plds 1 0.00% hp 1 0.00% hoa(2ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 267 1 0.00% nga3nh 1 0.00% su5a 1 0.00% room 1 0.00% tho5t 1 0.00% (acb 1 0.00% gáo 1 0.00% nai… 1 0.00% (room 1 0.00% huyê4n 1 0.00% cuo'1p 1 0.00% nu'óc 1 0.00% ddo'3 1 0.00% du'4ng 1 0.00% vis 1 0.00% vi3nh 1 0.00% pl 1 0.00% vu'o'4ng 1 0.00% nga5 1 0.00% gìo' 1 0.00% ne5p 1 0.00% a…râ1t 1 0.00% (bqldaddtltddqt 1 0.00% (kho'3i 1 0.00% a…a…nu'o'1c 1 0.00% zulfeqar 1 0.00% fayyaz 1 0.00% tayyaba 1 0.00% ptdcvn 1 0.00% chu'ng…vua 1 0.00% phau 1 0.00% (ddê1n 1 0.00% me5c 1 0.00% xa(1p 1 0.00% zabiuddin 1 0.00% karki 1 0.00% gulam 1 0.00% chimma 1 0.00% nepak 1 0.00% dhak 1 0.00% bahadur 1 0.00% la3i 1 0.00% nha3i 1 0.00% sayyad 1 0.00% aftab 1 0.00% muhaddin 1 0.00% siddiqui 1 0.00% xòa 1 0.00% xum 1 0.00% even 1 0.00% pleikrông 1 0.00% yaly 1 0.00% tuabin

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 268 1 0.00% difficulties 1 0.00% tomorrow 1 0.00% still 1 0.00% though 1 0.00% ddi5u 1 0.00% face 1 0.00% lumpua 1 0.00% brussel 1 0.00% (bi3 1 0.00% mandelson 1 0.00% (toàn 1 0.00% danlentieng 1 0.00% (geneva 1 0.00% 1gio'2 1 0.00% 2t5 1 0.00% (amm 1 0.00% neon 1 0.00% la(1m… 1 0.00% a…a…trên 1 0.00% nhu5t 1 0.00% 92 1 0.00% 704 1 0.00% communes 1 0.00% lãu3nh 1 0.00% (thô1ng 1 0.00% pha(1c 1 0.00% sichuan 1 0.00% according 1 0.00% elena(mia 1 0.00% (fredy 1 0.00% maestro 1 0.00% guizhou 1 0.00% repressio 1 0.00% (387 1 0.00% 483 1 0.00% (212 1 0.00% aventure 1 0.00% tíê1p 1 0.00% vât 1 0.00% 26 1 0.00% repression 1 0.00% selective 1 0.00% wolfgang 1 0.00% cha5n 1 0.00% gulags 1 0.00% 1960s 1 0.00% 1980s 1 0.00% amdo 1 0.00% rossum 1 0.00% (emmy 1 0.00% jenifer 1 0.00% (kurt

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 269 1 0.00% inner 1 0.00% bennett 1 0.00% (qinghai 1 0.00% russel 1 0.00% turkestan 1 0.00% californie 1 0.00% ngát…qua3 1 0.00% vogel 1 0.00% ru5c 1 0.00% semi 1 0.00% open 1 0.00% elections 1 0.00% singapour 1 0.00% trô1t 1 0.00% a3… 1 0.00% ri5ch 1 0.00% christian(mike 1 0.00% 6300 1 0.00% regnery 1 0.00% publishing 1 0.00% (giê1t 1 0.00% babbin 1 0.00% na(2ng 1 0.00% timperlake 1 0.00% wen 1 0.00% gia(5c… 1 0.00% po 1 0.00% 502 1 0.00% hòanh 1 0.00% storm 1 0.00% jed 1 0.00% lujan 1 0.00% pdn 1 0.00% ddòi… 1 0.00% mcavoy 1 0.00% protosevich 1 0.00% jesse 1 0.00% thâ2h 1 0.00% states 1 0.00% (cha5m 1 0.00% showdown 1 0.00% why 1 0.00% wants 1 0.00% (1996 1 0.00% irwin 1 0.00% xu3ng 1 0.00% csth 1 0.00% (tài 1 0.00% (1982 1 0.00% neame 1 0.00% npc 1 0.00% far

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 270 1 0.00% brezhnev 1 0.00% princelings 1 0.00% tay…qua3 1 0.00% duo'1i 1 0.00% yellow 1 0.00% (spratly 1 0.00% constructive 1 0.00% ca(1t…chi3 1 0.00% thu'2o'ng 1 0.00% brunei 1 0.00% ni5u 1 0.00% meo…meo 1 0.00% fire 1 0.00% engagement 1 0.00% force 1 0.00% outbreak 1 0.00% tho3ng 1 0.00% doàn 1 0.00% mackeno 1 0.00% chopin 1 0.00% ddõ 1 0.00% phu'ong 1 0.00% mu'o'1c 1 0.00% (quy 1 0.00% arraras 1 0.00% kitô 1 0.00% tiatanic 1 0.00% (danh 1 0.00% vinci 1 0.00% qinghai 1 0.00% chê1t…chê1t…chê1t 1 0.00% ho3ang 1 0.00% ft 1 0.00% josh 1 0.00% brookhart 1 0.00% lustgarten 1 0.00% fortune 1 0.00% thành…mèo 1 0.00% 'su'1 1 0.00% shigatze 1 0.00% abrahm 1 0.00% bu'ã 1 0.00% baò 1 0.00% mí 1 0.00% federation 1 0.00% (bwaf 1 0.00% bwaf 1 0.00% varsovie 1 0.00% sàch 1 0.00% bông…trông 1 0.00% (rfa 1 0.00% koa

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 271 1 0.00% tonight 1 0.00% workers' 1 0.00% enterprise 1 0.00% uô1ng…dù 1 0.00% ngênh 1 0.00% ddâ5i 1 0.00% celeste 1 0.00% aei 1 0.00% u'1o'c 1 0.00% traò 1 0.00% beijing 1 0.00% 1250 1 0.00% dominica 1 0.00% mundial 1 0.00% (mongol 1 0.00% 995 1 0.00% barrett 1 0.00% conor 1 0.00% region 1 0.00% 586 1 0.00% sifang 1 0.00% locomotive 1 0.00% invites 1 0.00% (power 1 0.00% bombardier 1 0.00% nortel 1 0.00% (inner 1 0.00% construction 1 0.00% corps 1 0.00% (xpcc 1 0.00% (jimmy 1 0.00% wikimedia 1 0.00% production 1 0.00% 919 1 0.00% kazakhs 1 0.00% manchuria 1 0.00% 630 1 0.00% uyghur 1 0.00% (uy 1 0.00% hòai 1 0.00% thermosiphon 1 0.00% (feet 1 0.00% macarthur 1 0.00% zero 1 0.00% foot 1 0.00% railway 1 0.00% tangula 1 0.00% sq 1 0.00% ammonia 1 0.00% 930 1 0.00% xining 1 0.00% zhang

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 272 1 0.00% invasion 1 0.00% tibet 1 0.00% sót… 1 0.00% (jacinda 1 0.00% firms 1 0.00% join 1 0.00% móp 1 0.00% brenda 1 0.00% cherry 1 0.00% macalister 1 0.00% lucas 1 0.00% (josh 1 0.00% (golnaz 1 0.00% farmani 1 0.00% (ho5c 1 0.00% circle' 1 0.00% (2000 1 0.00% sô1t… 1 0.00% mung 1 0.00% súyt 1 0.00% ro5c 1 0.00% (nha 1 0.00% cap 1 0.00% ddo3m 1 0.00% 'the 1 0.00% lo5at 1 0.00% jafar 1 0.00% balloon 1 0.00% 431 1 0.00% múôn 1 0.00% ihrc 1 0.00% 'crimson 1 0.00% gold' 1 0.00% pizza 1 0.00% circle 1 0.00% (silver 1 0.00% 'offside' 1 0.00% xi5ch… 1 0.00% sa(4ng 1 0.00% sticker 1 0.00% nhõng 1 0.00% (vddck 1 0.00% vddck 1 0.00% (m 1 0.00% hi3nh 1 0.00% nghe5o 1 0.00% tràosau 1 0.00% hóm 1 0.00% theo…ddêm 1 0.00% nhu5a 1 0.00% (drd 1 0.00% kha(n…

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 273 1 0.00% (hxhcnvn 1 0.00% re5c 1 0.00% sòn 1 0.00% (ptqd 1 0.00% xình 1 0.00% nhe3o 1 0.00% ba(1t…quân 1 0.00% viê5cthành 1 0.00% 428 1 0.00% ddau… 1 0.00% (ta(1t 1 0.00% ipr 1 0.00% intellectual 1 0.00% xa5m 1 0.00% 'mo'2i' 1 0.00% (nó 1 0.00% fda 1 0.00% (food 1 0.00% drug 1 0.00% ghém 1 0.00% 301 1 0.00% 'catfish' 1 0.00% quyê2 1 0.00% agreement 1 0.00% bta 1 0.00% most 1 0.00% 215 1 0.00% vao 1 0.00% bilateral 1 0.00% cafe 1 0.00% cu'òi 1 0.00% mo5 1 0.00% favored 1 0.00% cu4' 1 0.00% schlect 1 0.00% mi4 1 0.00% kelly 1 0.00% dent 1 0.00% (trùng 1 0.00% xe' 1 0.00% chxnchvn 1 0.00% tunit 1 0.00% indonesian 1 0.00% hiê2u 1 0.00% panarub 1 0.00% doãi 1 0.00% f50 1 0.00% roi… 1 0.00% tier 1 0.00% va5y 1 0.00% churches 1 0.00% administration

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 274 1 0.00% (human 1 0.00% trafficking 1 0.00% xbmdd 1 0.00% 'lái 1 0.00% khé 1 0.00% (written 1 0.00% testimonies 1 0.00% (cd 1 0.00% (ha5t 1 0.00% seine 1 0.00% christophe 1 0.00% leningrad 1 0.00% stalingrad 1 0.00% shakespeare 1 0.00% pour 1 0.00% française 1 0.00% j'ai 1 0.00% phô1c 1 0.00% udf 1 0.00% (union 1 0.00% volgograd 1 0.00% mehdi 1 0.00% gorelik 1 0.00% 1264 1 0.00% (thuâ5n 1 0.00% 1874 1 0.00% (tphcm 1 0.00% tiê3n 1 0.00% emma 1 0.00% 1284 1 0.00% (1258 1 0.00% 1232 1 0.00% tru'2u 1 0.00% ghe5o 1 0.00% califoocnia 1 0.00% cu5c…cu5c 1 0.00% victimes 1 0.00% communisme 1 0.00% (toronto 1 0.00% táhi 1 0.00% iiss 1 0.00% audrey 1 0.00% lo5an 1 0.00% cho'2i 1 0.00% xo'1 1 0.00% aux 1 0.00% est 1 0.00% vô2… 1 0.00% criminel 1 0.00% changé 1 0.00% rue 1 0.00% car

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 275 1 0.00% députés 1 0.00% veulent 1 0.00% hommage 1 0.00% figaro 1 0.00% guillaume 1 0.00% perrault 1 0.00% seok 1 0.00% (radio 1 0.00% kiê4ng 1 0.00% thóang 1 0.00% xiê2ng 1 0.00% du'1o'i 1 0.00% lôn 1 0.00% mân 1 0.00% 1306 1 0.00% (delayed 1 0.00% replay 1 0.00% tru'1o'1c 1 0.00% íck 1 0.00% (north 1 0.00% lama 1 0.00% (myamar 1 0.00% nho'1p 1 0.00% (ta5p 1 0.00% …nê1u 1 0.00% petrus 1 0.00% thi5ch 1 0.00% nhúa 1 0.00% ra(1n… 1 0.00% nigieria 1 0.00% dalai 1 0.00% (gouverneur 1 0.00% général 1 0.00% élysée 1 0.00% 1787 1 0.00% 1790 1 0.00% 1887 1 0.00% krasne 1 0.00% lucille 1 0.00% ghadimi 1 0.00% 253 1 0.00% 519 1 0.00% cánh…cáo 1 0.00% 1731 1 0.00% 1620 1 0.00% sóm 1 0.00% meregue 1 0.00% (cai 1 0.00% 1613 1 0.00% 1635 1 0.00% 1623 1 0.00% (funan

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 276 1 0.00% 1698 1 0.00% 1618 1 0.00% jive 1 0.00% vô2…tru'o'5t 1 0.00% yim 1 0.00% foon 1 0.00% judita 1 0.00% milinic 1 0.00% 1m82 1 0.00% cheong 1 0.00% thuli 1 0.00% sithole 1 0.00% kha(1c… 1 0.00% hrubyova 1 0.00% natasa 1 0.00% pinoza 1 0.00% nada 1 0.00% samurai 1 0.00% jerris 1 0.00% công… 1 0.00% chacao 1 0.00% margarita 1 0.00% (globebeauties 1 0.00% ho5a… 1 0.00% mu4m 1 0.00% mi4m 1 0.00% …ò 1 0.00% dâ2n…nga 1 0.00% alternative 1 0.00% ddô5ng…không 1 0.00% (ddâ1t 1 0.00% à…o'i 1 0.00% belmar 1 0.00% (vâ5t 1 0.00% theo…qua3 1 0.00% fiction 1 0.00% kill 1 0.00% prime 1 0.00% (le4 1 0.00% heroine 1 0.00% pulp 1 0.00% herman 1 0.00% sascha 1 0.00% ngoa(1t 1 0.00% ngô2ng 1 0.00% elisabeth 1 0.00% jacqueline 1 0.00% fernandez 1 0.00% hilliman 1 0.00% shivern 1 0.00% peters 1 0.00% rodney

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 277 1 0.00% hàon 1 0.00% gisella 1 0.00% gâ4u 1 0.00% jictzad 1 0.00% 1m85 1 0.00% 1m67 1 0.00% ling 1 0.00% là…con 1 0.00% asiaweek 1 0.00% aaa 1 0.00% thuy 1 0.00% mofya 1 0.00% chisenga 1 0.00% 1m72 1 0.00% moss 1 0.00% been 1 0.00% 1m63 1 0.00% tsymbaliuk 1 0.00% kirazli 1 0.00% sik 1 0.00% shaveena 1 0.00% jetaime 1 0.00% cerge 1 0.00% 1m69 1 0.00% liong 1 0.00% doherty 1 0.00% fatimih 1 0.00% chùy 1 0.00% amparo 1 0.00% chúng… 1 0.00% viên… 1 0.00% hongsakula 1 0.00% sirikit 1 0.00% khóc… 1 0.00% irene 1 0.00% sáez 1 0.00% fedorova 1 0.00% ncgd 1 0.00% rombin 1 0.00% sinh… 1 0.00% phâ3m… 1 0.00% (hhhv 1 0.00% tru'1ng… 1 0.00% quoc 1 0.00% soàn 1 0.00% ddô3i… 1 0.00% kuusela 1 0.00% (1975 1 0.00% vu5t… 1 0.00% betbeze 1 0.00% mills 1 0.00% ga(2n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 278 1 0.00% gáy… 1 0.00% (2004 1 0.00% (vienna 1 0.00% robina 1 0.00% me5…o'…o' 1 0.00% ngâ1n 1 0.00% (jba 1 0.00% outfitters 1 0.00% …tôi 1 0.00% osce 1 0.00% 20m2 1 0.00% (european 1 0.00% akutagawa 1 0.00% penelope 1 0.00% lo'1n…không 1 0.00% naomi 1 0.00% su'1a 1 0.00% ddi…rô2i 1 0.00% lencii 1 0.00% fukuoka 1 0.00% kato 1 0.00% sarkae 1 0.00% female 1 0.00% kyushu 1 0.00% nu'4a…ngày 1 0.00% (ma3ng 1 0.00% vo…vo…vo…vo 1 0.00% co5c…co5c…co5c… 1 0.00% nhuô1c 1 0.00% globals 1 0.00% 25m2 1 0.00% (ddo'n 1 0.00% digital 1 0.00% (spam 1 0.00% wanadoo 1 0.00% chíp… 1 0.00% qua5… 1 0.00% suê 1 0.00% tometa 1 0.00% pte 1 0.00% consultans 1 0.00% eidhr 1 0.00% cu…cu 1 0.00% (tunisia 1 0.00% lel 1 0.00% shaer 1 0.00% goonline 1 0.00% blitz 1 0.00% watchguard 1 0.00% kpro 1 0.00% tre5o 1 0.00% halle

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 279 1 0.00% berry 1 0.00% n4120 1 0.00% jessica 1 0.00% alba 1 0.00% pirates 1 0.00% 1m83 1 0.00% 35m 1 0.00% xoe5t 1 0.00% mrs 1 0.00% o'i… 1 0.00% warbird 1 0.00% manhattan 1 0.00% airplane 1 0.00% 250cc 1 0.00% a(ngten 1 0.00% saunders 1 0.00% (luke 1 0.00% (airplane 1 0.00% 700m 1 0.00% 160km 1 0.00% (methanol 1 0.00% (helicopter 1 0.00% cõ 1 0.00% cover 1 0.00% vâ5t…mô5t 1 0.00% rain 1 0.00% spears 1 0.00% hard 1 0.00% (câ1t 1 0.00% '70s 1 0.00% mila 1 0.00% kunis 1 0.00% deryx 1 0.00% se3…bà 1 0.00% creative 1 0.00% britney 1 0.00% way 1 0.00% oh 1 0.00% kiss 1 0.00% lo5i 1 0.00% scale 1 0.00% canto 1 0.00% chò 1 0.00% soco 1 0.00% hiê1t 1 0.00% happy 1 0.00% crazy 1 0.00% ha(5c 1 0.00% 381 1 0.00% 660m3 1 0.00% cha(5p 1 0.00% (isro

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 280 1 0.00% mâ4m 1 0.00% 389 1 0.00% bakker 1 0.00% gamespot 1 0.00% brigette 1 0.00% murkowski 1 0.00% hudson 1 0.00% 030m3 1 0.00% 10m 1 0.00% (ddoan 1 0.00% go3ng 1 0.00% nghen 1 0.00% strecker 1 0.00% này…na(m 1 0.00% ping 1 0.00% bei 1 0.00% zo' 1 0.00% shangdong 1 0.00% xa(1t 1 0.00% oách 1 0.00% wang 1 0.00% enoch 1 0.00% nghi5… 1 0.00% vàl 1 0.00% maastricht 1 0.00% 600m3 1 0.00% 8ha 1 0.00% santomauro 1 0.00% quynh 1 0.00% vu'o'2i 1 0.00% lenbanon 1 0.00% nu'o'1i 1 0.00% universee 1 0.00% 661 1 0.00% elysee 1 0.00% fete 1 0.00% musique 1 0.00% koblenz 1 0.00% 373 1 0.00% sofa 1 0.00% capelle 1 0.00% aan 1 0.00% ijssel 1 0.00% mitterrand 1 0.00% jumna 1 0.00% hollande 1 0.00% 960 1 0.00% arturo 1 0.00% wbc 1 0.00% (sawaco 1 0.00% slovakia 1 0.00% nu'4a…nhu'ng

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 281 1 0.00% dello 1 0.00% sport 1 0.00% giovanna 1 0.00% zab 1 0.00% (100ha 1 0.00% gazzetta 1 0.00% katarina 1 0.00% rijkaard 1 0.00% inacio 1 0.00% ha(3ng 1 0.00% 18g30 1 0.00% nwankwo 1 0.00% kanu 1 0.00% (chcnnb 1 0.00% patty 1 0.00% schnyder 1 0.00% vazquez 1 0.00% vera 1 0.00% 3kg 1 0.00% abbey 1 0.00% cho'5i 1 0.00% thô1c 1 0.00% almeyda 1 0.00% 16m50 1 0.00% suhaili 1 0.00% (tra3 1 0.00% rudebox74 1 0.00% (hddnd 1 0.00% tolga 1 0.00% biê1t10 1 0.00% be…be 1 0.00% kavaratti 1 0.00% (bdd 1 0.00% 'me5' 1 0.00% 5m50 1 0.00% melandri 1 0.00% eurowindow 1 0.00% ssss 1 0.00% ekaphan 1 0.00% yuuech 1 0.00% mitsustar 1 0.00% aniekan 1 0.00% –con 1 0.00% (trình 1 0.00% jason 1 0.00% statham 1 0.00% transporter 1 0.00% avenue 1 0.00% joaquin 1 0.00% walk 1 0.00% daredevil 1 0.00% booth…

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 282 1 0.00% piven 1 0.00% jamie 1 0.00% foxx 1 0.00% farrell 1 0.00% 6th 1 0.00% gic 1 0.00% (lê 1 0.00% nordin 1 0.00% htv7 1 0.00% thiê1c… 1 0.00% najib 1 0.00% fong 1 0.00% ngu'1a…thân 1 0.00% flower 1 0.00% (hddnt 1 0.00% onn 1 0.00% torre 1 0.00% ballet 1 0.00% opera 1 0.00% prilly 1 0.00% là…bông 1 0.00% 918 1 0.00% tôi…thô1i 1 0.00% hoáy 1 0.00% yoga 1 0.00% ceyla 1 0.00% onwarin 1 0.00% kenisha 1 0.00% thom 1 0.00% nho'1t…con 1 0.00% 784 1 0.00% 1 0.00% vellu 1 0.00% entourage 1 0.00% sêri 1 0.00% hbo 1 0.00% carmen 1 0.00% tròn…trái 1 0.00% 683 1 0.00% samy 1 0.00% idol 1 0.00% luvoo 1 0.00% ddê3…phu5t 1 0.00% phu4m 1 0.00% caribe 1 0.00% shakira 1 0.00% hâ5u…hà 1 0.00% (puerto 1 0.00% asare 1 0.00% trêm 1 0.00% quiê1t 1 0.00% 2015

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 283 1 0.00% uy2nh 1 0.00% u4m 1 0.00% adriana 1 0.00% turbay 1 0.00% paula 1 0.00% betancourt 1 0.00% tru'o' 1 0.00% lu´c 1 0.00% paola 1 0.00% ru´t 1 0.00% ba´c 1 0.00% du'o'1i…nu'o'1c 1 0.00% nhên 1 0.00% 272 1 0.00% daza 1 0.00% kapur 1 0.00% magali 1 0.00% romitelli 1 0.00% adrienn 1 0.00% bende 1 0.00% neha 1 0.00% (hq 1 0.00% khê5 1 0.00% havard 1 0.00% tra3m 1 0.00% nát…mô5t 1 0.00% nê5 1 0.00% hungary 1 0.00% faurbiye 1 0.00% dina 1 0.00% fekadu 1 0.00% chiu5 1 0.00% 21h30 1 0.00% betina 1 0.00% (so5t 1 0.00% buô4i 1 0.00% ngâu 1 0.00% các… 1 0.00% rafaella 1 0.00% zanella 1 0.00% (commission 1 0.00% ddoì 1 0.00% ronie 1 0.00% guyer 1 0.00% saigonforsaigon 1 0.00% cuôc 1 0.00% hôp 1 0.00% nghe…và 1 0.00% (gddpt 1 0.00% fry 1 0.00% lorreta 1 0.00% ddâ1n

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 284 1 0.00% (hòa 1 0.00% ddu'o'1i 1 0.00% allard 1 0.00% maaskroon 1 0.00% rahir 1 0.00% jamais 1 0.00% coeurs 1 0.00% (tín 1 0.00% 6g30 1 0.00% inches 1 0.00% nãn 1 0.00% liege 1 0.00% calif 1 0.00% (little 1 0.00% nù 1 0.00% 75w 1 0.00% cato 1 0.00% stephany 1 0.00% dawson 1 0.00% nhâ4y 1 0.00% (tuyên 1 0.00% ngày…mô5t 1 0.00% (centre 1 0.00% carpenter 1 0.00% choviê5t 1 0.00% makarkine 1 0.00% cu'ú 1 0.00% (ldd 1 0.00% (hùng 1 0.00% (tin 1 0.00% thiê2ng 1 0.00% (mlnqvn 1 0.00% (cddvn 1 0.00% cung…lúc 1 0.00% ddìa 1 0.00% vâ1y 1 0.00% crystal 1 0.00% gi3a 1 0.00% gìo'1i 1 0.00% ddâu… 1 0.00% démocratique 1 0.00% humanisme 1 0.00% d'avroy 1 0.00% (monument 1 0.00% résistance 1 0.00% solidarité 1 0.00% 200kg 1 0.00% giu'1p 1 0.00% caritas 1 0.00% croix 1 0.00% bô4 1 0.00% (liège

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 285 1 0.00% toòng 1 0.00% hamburgers 1 0.00% qua5c 1 0.00% (hu'o'1ng 1 0.00% shirt 1 0.00% cung…nhu'ng 1 0.00% ghê1ch 1 0.00% ddúa 1 0.00% 0g 1 0.00% (chicago 1 0.00% uptown 1 0.00% 1106 1 0.00% vivre 1 0.00% cet 1 0.00% espace 1 0.00% nous 1 0.00% sommes 1 0.00% heureux 1 0.00% cùng…thu3y 1 0.00% ancêtres 1 0.00% restera 1 0.00% paix 1 0.00% liberté 1 0.00% 23g57 1 0.00% d'accueil 1 0.00% ót 1 0.00% tro'2 1 0.00% tu'…thâ1y 1 0.00% en 1 0.00% souvenir 1 0.00% l'exode 1 0.00% remercient 1 0.00% belgique 1 0.00% le5p 1 0.00% monde 1 0.00% réfugiés 1 0.00% vietnamiens 1 0.00% vody 1 0.00% putlizer 1 0.00% mineralnye 1 0.00% diversity 1 0.00% olmstead 1 0.00% (caucasus 1 0.00% sos 1 0.00% ga5o…ngoài 1 0.00% hollen 1 0.00% representative 1 0.00% ni5ch 1 0.00% weisel 1 0.00% ankara 1 0.00% matsukawa 1 0.00% liê5t…phâ5t

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 286 1 0.00% sanjay 1 0.00% (ha(1n 1 0.00% kashiwahara 1 0.00% lori 1 0.00% civil 1 0.00% social 1 0.00% justice 1 0.00% gupta 1 0.00% npr 1 0.00% mãi…cuô1i 1 0.00% supyan 1 0.00% escondido 1 0.00% (thí 1 0.00% sandwishes 1 0.00% mira 1 0.00% mesa 1 0.00% (vatican 1 0.00% benedicto 1 0.00% háu 1 0.00% (super 1 0.00% splitting 1 0.00% (tro'1 1 0.00% vista 1 0.00% khasbulatov 1 0.00% bayside 1 0.00% ruslan 1 0.00% gâu…gâu 1 0.00% nâ4n 1 0.00% lofgren 1 0.00% trannh 1 0.00% height 1 0.00% linda 1 0.00% tamarov 1 0.00% trê3n 1 0.00% sd 1 0.00% (oasis 1 0.00% meili 1 0.00% faille 1 0.00% parliamentary 1 0.00% 1 0.00% networks 1 0.00% soulanges 1 0.00% québec 1 0.00% satelitte 1 0.00% (bloc 1 0.00% budennovsk 1 0.00% vaudreuil 1 0.00% montréal 1 0.00% rob 1 0.00% (conservative 1 0.00% calgary 1 0.00% block

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 287 1 0.00% quen…chàng 1 0.00% (ndp 1 0.00% derek 1 0.00% (scarborough 1 0.00% cung…ít 1 0.00% tonks 1 0.00% waal 1 0.00% weston 1 0.00% waikiki 1 0.00% (aaja 1 0.00% journalist 1 0.00% gagra 1 0.00% (hawaii 1 0.00% sheraton 1 0.00% ddom 1 0.00% tritia 1 0.00% toyata 1 0.00% nhau…sô1ng 1 0.00% mi5nh 1 0.00% achievement 1 0.00% arlington 1 0.00% 7300 1 0.00% soldier 1 0.00% black 1 0.00% leyna 1 0.00% dzu4ng 1 0.00% orchid 1 0.00% valleyjo 1 0.00% furniture 1 0.00% galleries 1 0.00% april 1 0.00% gmc 1 0.00% realty 1 0.00% district 1 0.00% knowledge 1 0.00% 203 1 0.00% congresswoman 1 0.00% i'm 1 0.00% voter 1 0.00% permanent 1 0.00% gru…còn 1 0.00% status 1 0.00% gru… 1 0.00% consider 1 0.00% granting 1 0.00% congressman 1 0.00% (parliamentary 1 0.00% (rà 1 0.00% nhoa5ng 1 0.00% (countries 1 0.00% particular 1 0.00% concern

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 288 1 0.00% xeo 1 0.00% dear 1 0.00% senator 1 0.00% (nàng 1 0.00% ítn 1 0.00% trô1ng… 1 0.00% amended 1 0.00% should 1 0.00% include 1 0.00% relationship 1 0.00% both 1 0.00% countries 1 0.00% labor 1 0.00% unions 1 0.00% viê2n 1 0.00% vietnam's 1 0.00% chòe 1 0.00% form 1 0.00% sustainable 1 0.00% amend 1 0.00% cung…món 1 0.00% attach 1 0.00% days 1 0.00% would 1 0.00% ask 1 0.00% relevant 1 0.00% principle 1 0.00% fabrics 1 0.00% few 1 0.00% conditions 1 0.00% dài… 1 0.00% có… 1 0.00% lông… 1 0.00% vuô1t… 1 0.00% (wa 1 0.00% afraid 1 0.00% bô4ng… 1 0.00% phi5ch 1 0.00% wanneroo 1 0.00% de4 1 0.00% sse 1 0.00% (btc 1 0.00% phcg 1 0.00% nhoay 1 0.00% investor 1 0.00% statement 1 0.00% expression 1 0.00% ecosoc 1 0.00% itu 1 0.00% undp 1 0.00% larue 1 0.00% nhoáy

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 289 1 0.00% (phcg 1 0.00% 420ha 1 0.00% (president 1 0.00% chi4nh 1 0.00% fontelles 1 0.00% ddu5p 1 0.00% (ma5ng 1 0.00% sharer 1 0.00% josep 1 0.00% borrell 1 0.00% (ntu 1 0.00% thô3n 1 0.00% nanyang 1 0.00% hu5p 1 0.00% singarpore 1 0.00% mãi…nàng 1 0.00% (syrie 1 0.00% ouzbékistan 1 0.00% arabie 1 0.00% saoudite 1 0.00% oa(1t 1 0.00% libye 1 0.00% népal 1 0.00% motjaba 1 0.00% saminejad 1 0.00% (tunisie 1 0.00% tunisie 1 0.00% turkménistan 1 0.00% u'o'4n 1 0.00% journey 1 0.00% from 1 0.00% fall 1 0.00% kearney 1 0.00% he3 1 0.00% ddu'o'1c 1 0.00% lính…khiê1n 1 0.00% (quyê2n 1 0.00% mâ1p 1 0.00% mâ3m 1 0.00% nu'o'1 1 0.00% ganizhev 1 0.00% butcher 1 0.00% rar 1 0.00% 1200 1 0.00% dunnett 1 0.00% vremya 1 0.00% thích…ddang 1 0.00% urals 1 0.00% mcnulty 1 0.00% kushtov 1 0.00% isa 1 0.00% chài…thê1

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 290 1 0.00% challenge 1 0.00% brass 1 0.00% jurapa 1 0.00% (riverside 1 0.00% chê1t…ngô2i 1 0.00% 1735 1 0.00% ca3…may 1 0.00% ekazhevo 1 0.00% la(1n 1 0.00% califorina 1 0.00% 5g 1 0.00% janet 1 0.00% la(1m…ddôi 1 0.00% gw 1 0.00% du'o'1c 1 0.00% saconnex 1 0.00% interfax 1 0.00% thê2u 1 0.00% me5…me5…e5 1 0.00% tarkhan 1 0.00% vietact 1 0.00% zakayev 1 0.00% akhmed 1 0.00% bankstown 1 0.00% cabramatta 1 0.00% 'dã 1 0.00% laurie 1 0.00% mont 1 0.00% khattab 1 0.00% ore 1 0.00% ho'5 1 0.00% (tiêu 1 0.00% dduma 1 0.00% tu3i… 1 0.00% tass 1 0.00% tro3… 1 0.00% surikov 1 0.00% anton 1 0.00% tra(ng… 1 0.00% file 1 0.00% export 1 0.00% hu'o'ng… 1 0.00% beliefs 1 0.00% producers 1 0.00% cultural 1 0.00% products 1 0.00% un's 1 0.00% declaration 1 0.00% marcom 1 0.00% expressions 1 0.00% other 1 0.00% basic

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 291 1 0.00% yevloyev 1 0.00% rorahbacher 1 0.00% nu'1o'1c 1 0.00% nazran 1 0.00% komsomolskaya 1 0.00% no'i…mô5t 1 0.00% xu'a…vào 1 0.00% kê3… 1 0.00% novosty 1 0.00% cac 1 0.00% bu'o'1c… 1 0.00% nazir 1 0.00% núc 1 0.00% 100kg 1 0.00% manufacturing 1 0.00% (amtac 1 0.00% itar 1 0.00% hoi3 1 0.00% kháp 1 0.00% taì 1 0.00% aò 1 0.00% hai3 1 0.00% toan… 1 0.00% (ingushetia 1 0.00% vui…

Pham, Kohnert, Carney, 2008, Corpora of Vietnamese Texts, Page 292