Paper
Document
Submit new version
Download
Flag content
0

Complete sequencing and characterization of 21,243 full-length human cDNAs

Authors
Toshio Ota,Yutaka Suzuki
Tetsuo Nishikawa,Tetsuji Otsuki,Tomoyasu Sugiyama,Ryotaro Irie,Ai Wakamatsu,Kôji Hayashi,Hiroyuki Sato,Keiichi Nagai,Kouichi Kimura,Hiroshi Makita,Mitsuo Sekine,Masaya Obayashi,Tatsunari Nishi,Toshikazu Shibahara,Toshihiro Tanaka,Shizuko Ishii,Junichi Yamamoto,Kaoru Saito,Yuri Kawai,Yuko Isono,Yoshitaka Nakamura,Kenji NAGAHARI,Katsuhiko Murakami,Kei Yura,Takao Iwayanagi,Masako Wagatsuma,Akiko Shiratori,Hiroaki Sudo,Takehiko Hosoiri,Yoshiko Kaku,Hiroyo Kodaira,Hiroshi Kondo,Masanori Sugawara,M. Takahashi,Katsuhiro Kanda,Tatsuya Yokoi,Tetsuo Furuya,Emiko Kikkawa,Yoko Omura,Kumi Abe,Kumiko Kamihara,Naoko Katsuta,Kazuomi Sato,Machiko Tanikawa,Makoto Yamazaki,Ken Ninomiya,Tadashi Ishibashi,Hiromichi Yamashita,Katsuji Murakawa,Kiyoshi Fujimori,Hiroyuki Tanai,Manabu Kimata,Motoji Watanabe,Susumu Hiraoka,Yoshiyuki Chiba,Shinichi Ishida,Yukio Ono,Sumiyo Takiguchi,Susumu Watanabe,Makoto Yosida,Tomoko Hotuta,Junko Kusano,Keiichi Kanehori,Asako Takahashi-Fujii,Hiroto Hara,Tomo-o Tanase,Yoshiko Nomura,Sakae Togiya,Fukuyo Komai,Reiko Hara,Kazuha Takeuchi,Miho Arita,Nobuyuki Imose,Kaoru Musashino,Hisatsugu Yuuki,Akira Oshima,Naokazu Sasaki,Satoshi Aotsuka,Yoko Yoshikawa,Hiroshi Matsunawa,Tatsuo Ichihara,Namiko Shiohata,Sanae Sano,Shogo Moriya,Hiroko Momiyama,Noriko Satoh,Sachiko Takami,Yuko Terashima,Osamu Suzuki,Satoshi Nakagawa,Akihiro Senoh,Hiroshi Mizoguchi,Yoshihiro Gotō,Fumio Shimizu,H Wakebe,Haretsugu Hishigaki,Takeshi Watanabe,Akio Sugiyama,Makoto Takemoto,Bunsei Kawakami,Masaaki Yamazaki,Koji Watanabe,Akira Kumagai,Shoko Itakura,Yasuhito Fukuzumi,Yoshifumi Fujimori,Megumi Komiyama,Hiroyuki Tashiro,Akira Tanigami,Tsutomu Fujiwara,Toshihide Ono,Katsue Yamada,Yuka Fujii,Kouichi Ozaki,Maasa Hirao,Yoshihiro Ohmori,Ayako Kawabata,Takeshi Hikiji,Naoko Kobatake,Hiromi Inagaki,Yasuko Ikema,Sachiko Okamoto,Rie Okitani,Takuma Kawakami,Satoko Noguchi,Tomoko Itoh,Keiko Shigeta,Tadashi Senba,Kyoka Matsumura,Yoshie Nakajima,Takae Mizuno,Misato Morinaga,Masahide Sasaki,Takushi Togashi,Masaaki Oyama,Hiroko Hata,Manabu Watanabe,Takami Komatsu,Junko Mizushima‐Sugano,Tadashi Satoh,Yuko Shirai,Yukiko Takahashi,K. Nakagawa,Koji Okumura,Takahiro Nagase,Nobuo Nomura,Hisashi Kikuchi,Yasuhiko Masuho,Riu Yamashita,Kenta Nakai,Tetsushi Yada,Yusuke Nakamura,Osamu Ohara,Takao Isogai
+154 authors
,Sumio Sugano
Published
Dec 21, 2003
Show more
Save
TipTip
Document
Submit new version
Download
Flag content
0
TipTip
Save
Document
Submit new version
Download
Flag content

Abstract

As a base for human transcriptome and functional genomics, we created the “full-length long Japan” (FLJ) collection of sequenced human cDNAs. We determined the entire sequence of 21,243 selected clones and found that 14,490 cDNAs (10,897 clusters) were unique to the FLJ collection. About half of them (5,416) seemed to be protein-coding. Of those, 1,999 clusters had not been predicted by computational methods. The distribution of GC content of nonpredicted cDNAs had a peak at ∼58% compared with a peak at ∼42%for predicted cDNAs. Thus, there seems to be a slight bias against GC-rich transcripts in current gene prediction procedures. The rest of the cDNAs unique to the FLJ collection (5,481) contained no obvious open reading frames (ORFs) and thus are candidate noncoding RNAs. About one-fourth of them (1,378) showed a clear pattern of splicing. The distribution of GC content of noncoding cDNAs was narrow and had a peak at ∼42%, relatively low compared with that of protein-coding cDNAs.

Paper PDF

This paper's license is marked as closed access or non-commercial and cannot be viewed on ResearchHub. Visit the paper's external site.