ARTICLE Received 25 Dec 2014 | Accepted 29 Jul 2015 | Published 10 Sep 2015

DOI: 10.1038/ncomms9212

OPEN

Outbred genome sequencing and CRISPR/Cas9 gene editing in butterflies Xueyan Li1,*, Dingding Fan2,*, Wei Zhang3,*, Guichun Liu1,*, Lu Zhang2,4,*, Li Zhao1,5, Xiaodong Fang2, Lei Chen1,6, Yang Dong1,w, Yuan Chen1,w, Yun Ding1,w, Ruoping Zhao1, Mingji Feng2, Yabing Zhu2, Yue Feng2, Xuanting Jiang2, Deying Zhu2,7, Hui Xiang1, Xikan Feng2,4, Shuaicheng Li4, Jun Wang2,**, Guojie Zhang2,8,**, Marcus R. Kronforst3,** & Wen Wang1,**

Butterflies are exceptionally diverse but their potential as an experimental system has been limited by the difficulty of deciphering heterozygous genomes and a lack of genetic manipulation technology. Here we use a hybrid assembly approach to construct high-quality reference genomes for Papilio xuthus (contig and scaffold N50: 492 kb, 3.4 Mb) and Papilio machaon (contig and scaffold N50: 81 kb, 1.15 Mb), highly heterozygous species that differ in host plant affiliations, and adult and larval colour patterns. Integrating comparative genomics and analyses of gene expression yields multiple insights into butterfly evolution, including potential roles of specific genes in recent diversification. To functionally test gene function, we develop an efficient (up to 92.5%) CRISPR/Cas9 gene editing method that yields obvious phenotypes with three genes, Abdominal-B, ebony and frizzled. Our results provide valuable genomic and technological resources for butterflies and unlock their potential as a genetic model system.

1 State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China. 2 BGIShenzhen, Shenzhen 518083, China. 3 Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA. 4 City University of Hongkong, Hongkong, China. 5 Department of Evolution and Ecology, University of California Davis, Davis, California 95616, USA. 6 University of Chinese Academy of Sciences, Beijing 100049, China. 7 School of Bioscience and Biotechnology, South China University of Technology, Guangzhou 510641, China. 8 Centre for Social Evolution, Department of Biology, Universitetsparken 15, University of Copenhagen, Copenhagen DK-2100, Denmark. * These authors contributed equally to this work. ** These authors jointly supervised this work. w Present addresses: College of Life Science, Kunming University of Science and Technology, Kunming 650093, China (Dong Y.); Department of Medicine, Duke University Medical Center, Durham, North Carolina 27710, USA (C.Y.); Janelia Research Campus, Howard Hughes Medical Institute, Ashbum, VA 20147, USA (Ding Y.). Correspondence and requests for materials should be addressed to W.W. (email: [email protected]) or to M.R.K. (email: [email protected]).

NATURE COMMUNICATIONS | 6:8212 | DOI: 10.1038/ncomms9212 | www.nature.com/naturecommunications

& 2015 Macmillan Publishers Limited. All rights reserved.

1

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9212

B

utterflies are famous for their extraordinarily diverse wing patterns, which differ not only among species but also among populations, sexes and even seasonal forms1,2. Wing patterns are highly variable, because they are multifunctional, involved in roles from crypsis to warning colouration, mimicry, thermoregulation and mate selection2,3. Beyond wing pattern, butterflies are also diverse in virtually all aspects of their biology, ranging from behaviour and biogeography to cellular biology and biochemistry, with decades of study having placed much of this variation in a well-resolved ecological context4. These features make butterflies a promising system to explore the genetics, evolution and development of morphological diversification and speciation5,6. Although butterflies have notable strengths in terms of natural variation and ecology, they also suffer critical shortcomings in the quest to characterize the genetic basis of organismal phenotypes, in particular difficulty in deciphering usually heterozygous genomes and a lack of functional genetics methodology. For instance, although there are estimated 18,000 butterfly species, there are currently only 6 butterfly genome sequences7–11. If additional, high-quality genomes from closely related species were available, it would be feasible to comprehensively trace genetic changes responsible for phenotypic change over evolutionary time. However, the nature of high heterozygosity in most wild insects, including butterflies, has hampered efforts to obtain high-quality reference genomes7,12,13. In addition, efficient and precise genetic manipulation technologies are indispensable for a model organism. However, general and efficient genetic manipulation of butterflies has not been reported, which, together with limited genomic resources, has greatly restricted their application as model organisms. The only case of genome editing in a butterfly is the use of zinc-finger nucleases in the monarch butterfly14. Recently, clustered regulatory interspaced short palindromic repeat (CRISPR)-associated (Cas)-based RNAguided DNA endonucleases such as the Streptococcus pyogenes Cas9 nuclease (CRISPR/Cas9) has emerged as an efficient tool for gene editing across a wide spectrum of organisms15,16, including insects such as fruitfly Drosophila melanogaster17,18 and silkworm Bombyx mori19,20. P. xuthus and P. machaon are two closely related swallowtail butterfly species from the most basal lineage of butterflies, the family Papilionidae. Despite being closely related, P. xuthus and P. machaon differ in many aspects of their biology, including adult and larval colour pattern, larval host plants and geographic distribution, with P. xuthus mainly distributed in East Asia and P. machaon widely distributed across Asia, Europe and North America. Papilionidae is one of the most historically significant groups of butterflies; P. machaon was named as the type species

for all butterflies by Linnaeus21 and since then the group has been a long-term focus in the study of mimicry, vision and learning, and pigmentation3. Here we present high-quality reference genomes for the two highly heterozygous and closely related butterflies P. xuthus and P. machaon. Integrating comparative genomics and analyses of gene expression yields multiple insights into butterfly evolution. We develop an efficient and widely applicable CRISPR/Cas9 gene editing method that results in obvious phenotypes with three genes, Abdominal-B (Abd-B), ebony and frizzled (fz). Our results provide valuable genomic and technological resources for butterflies and unlock their potential as a genetic model system. Results Genome sequencing and assembly. We collected P. xuthus and P. machaon samples from the suburb of Ya’an (Sichuan, China) (Supplementary Fig. 1). The heterozygosity of the two species was high, 1.008% for P. xuthus and 1.229% for P. machaon, and initial assembly using standard Illumina next-generation sequencing data resulted in poor assemblies (Supplementary Note 1, Supplementary Tables 1–3 and Supplementary Fig. 2). Therefore, we used 454FLXPlus long reads (Z700 base pair (bp)) in combination with a large quantity of Illumina short reads (100–150 bp, 487  ), to assemble contigs, and we then used Illumina long-insert mate-pair sequencing data to generate scaffolds (Supplementary Note 1, Supplementary Tables 1–11 and Supplementary Figs 2–6). This hybrid assembly approach eliminated issues due to high heterozygosity, yielding a final assembly comprising 244 Mb with contig N50 of 492 kb and scaffold N50 of 3.4 Mb for P. xuthus and a 281-Mb assembly with contig N50 of 81 kb and scaffold N50 of 1.15 Mb for P. machaon (Table 1 and Supplementary Tables 6 and 7). Three evaluation methods show that our assemblies are quite complete and reliable (Supplementary Note 1 and Supplementary Tables 8–11). The assembled genome sizes are consistent with estimates by both k-mer analysis (Supplementary Table 3) and by flow cytometry (Supplementary Table 5). Notably, the contig N50 sizes for both genomes are large compared with those of all published Lepidoptera genomes (Fig. 1a), and the contig N50 size of P. xuthus genome is the largest among all published animal genomes excluding such classical models as fruitfly D. melanogaster, human Homo sapiens, mouse Mus musculus and rat Rattus norvegicus (Supplementary Table 12). We generated a linkage map using 74 offspring from a P. xuthus cross and used it to assemble 87% of the P. xuthus genome into 30 chromosomes (Supplementary Note 2, Supplementary Table 13 and Supplementary Fig. 7). Based on the P. xuthus chromosomal assembly, we compared syntenic

Table 1 | Comparison of reference genomes among Lepidopterans.

Chromosome number (n) Genome size (Mb) based on the bases in contigs and in scaffolds Genome size (Mb) measured by C-value N50 of contig (kb)/scaffold (Mb) TE (% genome) Number of protein-coding genes

Pm 31* 265/281 234B256 81/1.15 21.10 15,499

Px 30* 231/244 218B238 492/3.4 20.26 15,322

Hm 21w 274/269w 300# 51/0.2w 28.47w 15,984**

Dp 29–30z 242z/249z 290# 51z,/0.7z 12.21z 16,866z

Bm 28y 432/481y 520# 13/3.7y 21.1y 14,623y

PLX 31|| 385/394|| NA 49/0.72|| 33.97|| 18,071||

Bm, B. mori; Dp, D. plexippus; Hm, H. melpomene; Plx, P. xylostella; Pm, P. machaon; Px, P. xuthus. *From Maeki59. wFrom Dasmahapatra et al.8 zFrom Zhan et al.7 yFrom Xia et al.60 ||From You et al.13 zFrom Zhan et al.61 #From Gregory et al.62 **Predicted in this study using the same pipeline as in Pm and Px.

2

NATURE COMMUNICATIONS | 6:8212 | DOI: 10.1038/ncomms9212 | www.nature.com/naturecommunications

& 2015 Macmillan Publishers Limited. All rights reserved.

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9212

that of B. mori, although a large fragment of P. xuthus chrZ is homologous to a region of B. mori chr5. The assembled genomes of P. xuthus and P. machaon have similar composition of repetitive element (Supplementary Note 3,

relationships among P. xuthus, P. machaon and B. mori (Fig. 1b), and found that chromosome 8 (chr8) of P. xuthus resulted from a fusion of ancestral lepidopteran chr8 and chr31. The Z chromosome (that is, chr30) of P. xuthus is highly homologous to

a

b Outgroup (moths)

8

Papilionidae

B. mori

10

9

11

12 13

7

14

6

15

5

16 4

P. xuthus P. machaon P. glaucus P. polytes

3

17

21

27 28 29

20

30(Z )

1(Z

19

)

2

18

24

24

Hesperiidae

23

25 26

22

Hedylidae

27

21

26

22

25

23

Pieridae

20 28

Riodinidae

17

1

18

31

8

19

16

Lycaenidae

2 15

3

14

4

D. plexippus H. melpomene M. cinxia

Px02642.FPPS Pm07701.FPPS Cf.AY95 4919.FP PS Bm.A B27499 Bm0 7.FPPS 0 3 Px0 1927.FP PS Pm 2635.F P 0 PS Px 7193 Pm 0263 .FPP S Px 076 8.FP PS 95 P 02 Pm m07 637 .FPP 07 192 .FPP S 69 .F S 4. PP FP S PS

3.7 2.5 1.2 0.0 –1.2 –2.5 –3.7

Px_02638_Fpps

Pm_07190_Fpps

Px_02636_Fpps

Pm_16572_ qm

Px_02640_Fpps

4.2 2.8 1.4 0.0 –1.4 –2.8 –4.2

Pm_07696_Fpps Pm_07701_Fpps

Px_02639_Fpps Pm_07192_Fpps Px_02641_Fpps Pm_07694_Fpps Px_09247_Fpps Pm_07695_Fpps Px_00691_Fpps Pm_07698_Fpps Px_03398_qm Px_00692_Fpps

6

7

Dm0086586.A21 Bm001573.AU1 Bm001 Px0745419.B29 Pm02999.B46 9.B29 Px0 Pm 3088.B Px 13631. 46 Pm03679 B46 Pm 0715 .B46 Px0 1363 2.B50 0.B P 2 P x00 302.B 46 21 P m0 43 P x0 70 2.B P x0 22 95.B 1 Px x02 227 82.B 1 P 3 9 1 Pm x0 023 01. .B1 2 07 30 00.B B20 11 3.B 21 3. 21 B1

B1 4. .B1 30 14 B1 V2 02 71 5. B3 Px m0 378377. .B1 P x0 07 78 21 P m 63 7.B 21 P m0 08 9.B P x03 62 21 P m13 86.B 21 P x030 84.B 21 P 030 88.B Px 022 2.B21 Pm 1362 .B21 Pm 02287 21 Pm 286.B Pm02085.B16 Px03 627.B21 Pm13 9.B17 Pm0228 Pm00559.B1 Pm01812.B47 Pm13096.B21 Pm13624.B21 BM006785.AY 1 Dp2182 Px17 8.AY1 Pm 588.N9 Px0 19507. Pm01130.ABC2 E 8 12 Bm 14 B 013 6.AE12 B m01 237 B m01 3236 .AE2 B m0 323 .AE B m0 132 8.A 2 Bmm01132 41.A E7 4 E H H m 01 323 0.A 7 D m 08 15 9. E7 D p1 087 751 45.A AE7 p1 44 5 .A E 44 25 0.A E1 22 23 .A E1 2 .A E3 2 E3 2 2

H H m1 Hm m1 501 1 50 2 Hm Dp0 50 13. .AB 1 A 7 H 000 56 4.A B 3 Hm m02 53. 6.ABB3 3 A Dp 0202024. B1 3 Bm 0116 5.ABAB53 Pm 012081.AB 13 1 9 1 Px1 5911.A.AB53 7 Hm0 814.ABB13 Hm044341.A 13 N 34 Hm06 2.AN55 376.AN Hm0434 5 3. Hm04344. AN5 AN Dp15918.AN25 Pm14020.AN5

Pm_07194_Fpps

g

Px eg Px g L1 Px L2 Px L3 Px L4 Px L5 Px pu Px pa F p Px upa M ad Px ult F ad ult M

Px00691.FPPS

Px_02637_Fpps

f

Pm e Pm gg L Pm 1 L Pm 2 L Pm 3 L Pm 4 L Pm 5 p Pm upa F p Pm upa M a Pm dult ad F ult M

Rp

Rp

Px eg Px g L1 Px L2 Px L3 Px L4 Px L5 Px pu Px pa F pu Px pa M ad Px ult F ad ult M

1.8 1.2 0.6 0.0 –0.6 –1.2 –1.8

8

Pm_07700_Fpps Pm_10221_Fpps

Px_02643_Fpps

Pm_17866_Fpps

Px_02635_Fpps

Pm_07702_Fpps

Px_09248_Fpps

Pm_07193_Fpps

Px_02642_Fpps

Pm_12194_Fpps

32 AE 2 3. E1 2 28 1.A E1 2 10444 2.A E1 2 A Hmp1 444 24. .AE3 7 D p1 44 73 E 8 D 1 8 .A 2 Dpm0102035.AE 12 H m1 020 .AE 8 H m1 204 E2 H m10 06.A B4 H 102 91.A Hm 0066 B4 Bm06369.A.AB4 Px 13205 2 Pm 6368.AN Px0 4.AN2 Pm1320 8.AN5 Hm0215 .AN5 Dp00371 Bm010854.AN2 Px13056.AN5

e

9

P. machaon

D D m0 Dm m0 07 Dp 00 083 183 6 Hm163 887 97 9.D Pm 01 01. 07. .D4 2 Px 165 574 BQ1D5 15 24 .A1 2 75 .B 4 8.B Q5 Q5 Dp 11 Dp0 529.A B 3 Dp1 899.A 14 3 Dp134404.ABB4 Hm12 03.AB 4 25 5 Hm1225 4.AB14 5.A Hm09213. B14 Hm00991.AAB4 B14 Pm18236.AB4 Px06728.AB14 Pm02415.AB4 Px06780.AB144 .AB Bm012385 86.AB4 Bm0123 6.AB14 4 73 Pm18 361.AB114 2 B Pm0 2367.A B4 .A Pm0 18737 B14 Pm 8708.A B144 .A 1 1 6 Pm 0236 7.AB 14 Pm 1699 8.ABB14 4 Px 699 4.A B1 4 1 2 A B Px 049 23. 2.A 14 4 9 B Pm 04 492 1.A B1 14 Pm m0 92 6.A AB P 04 99 5. Pm x16 699 P x1 P

S PP PS .F P 04 8.F 71 00 PS 00 966 .FP P 7 A AY 50 PS G . 3 P A Ng 95 PS A .AY 257.F 2.FP Ip 009 0978 S TC .KC1 6.FPP 5 S Pc 092 GPP 0 TC 1521.G .GPPS Dp1 953508 PS Ip.AY 40.FP PS2 Px026 4996.FP Bm.AB27 Hm02284.FPPS

S PP G PPS 9. 63 6.F PS 02 69 .FP S Px m07 247 .FPP P 09 66 PPS Px 178 21.F S Pm 102 .FPP Pm 04052 .FPPS Px 194 PPS 2 1 Pm 7698.F Pm0 1.FPPS Px0264 0.FPPS Pm0770 FPPS GB12859. Px02636.FPPS

Dp .H 212 Q 6 Dp 850 3.G 15 37 G P .H Q8 275. 3.FP PS G TC 5 00 037 GP PS 49 2.F PS Ap.A Mp.E Y96 78.G PPS GP U33 8 5 8 4430 P Mp.A .FPP 5.FPP S Y9685 S S/G P 86.G PPS/F PS PPS Hm0147 5.GGPPS Rs.AB5483 55.GGPPS Pm07194.FPPS

00

Px02643.FPPS Bm.AB Bm00 274995.FPPS 1 1926 .FPP Mu.A S Ai.A Y9549 21.F PPS Cf. J0099 Px AY95 62.FP PS CC 0069 4920 .FP C G0 2.F PS C CG 09 PPS CG CG 009 927. 12 009 627 2.FP PS 38 92 .1. 9. 8.1 FP PS FP . PS FPP S

Pm07702.FPPS PS Dp17841.FP PS 4.GGP S Hm0147 GPP PS 291.G Hm04 74.GGP S P 60 GP B26 5.G PPS Nt.A G S 607 B26 079.G GPP S 6 Nt.A P .G 6 1 2 GP PS 08 AB Nt. B266 90.G GP PS G P S 2 A 04 62. G PP Nt. Hm 212 01.G G G Dp 024 72. 0 65 Bm m1 P

03 3 68 98. G Dm C 9 .AF G85 4.G GP G P 9 04 96 3.GG PPS S 59 .G PPS GP PS GB 107 56.F Pm0 PPS 71 GB18 90.FPP S 623.F Dp02463.G PPS GPPS Hm04293.G GPPS GB19361.FPPS Hm00118.GGPPS GGPPS Hm04292. PS 6.GGP Hm0147 PS 866.FP S GB11 .GGPP S 6 868 5.FPP S 1 B G 8 P 123 7.FP S P GB 3 .FP S 153 GB 9248 GPP S P 0 Px 85.G .FP PS 9 8 P S 15 038 GG PP p D 95 4. /G 6 1 S 2 00 21 PP M p .X D 1.F 3 Ap 44 33 EU p. M AP

10

Dm0086587.A9 Dm0085466.A2 553.A18 Dm0084 RA 585.A8A Dm0086 A19-R 86583. A20 Dm00 0086584. .A17 Dm 86581 3-RA 0 Dm0 582.A20.A223 6 658 .A1 008 Dm Dm008 7770 4-RA1 008 .A1 6.U Dm 78148551 1.T31 008 00 710 8.T 1 Dm Dm 008 69068.VG1 7 2 Dm 00 769 00. .G 1 Dm 00 871 117 2.W Dm 00 087 33 Dm m0 085 D m0 D

d

Px

11

P. xuthus

c

AG

5 12

Px_02282_Cyp6a2 Px_02279_Cyp6a2 Px_03087_Cyp6a2 Px_02298_Cyp6a23 Px_06780_Cyp6a2 Px_15758_Cyp6a13 Px_02303_Cyp6a2 Px_02242_Cyp6a18 Px_00432_Cyp6a2 Px_02302_Cyp6a2 Px_06369_Cyp6a9 Px_17814_Cyp6a2 Px_03679_Cyp6a8 Px_06368_Cyp6a18 Px_02301_Cyp6a23 Px_03084_Cyp6a2 Px_02300_Cyp6a2 Px_03785_Cyp6a2 Px_03086_Cyp6a8 Px_16996_Cyp6a2 Px_03088_Cyp6a13 Px_03085_Cyp6a2 Px_16997_Cyp6a2 Px_07459_Cyp6a2 Px_16998_Cyp6a2 Px_16995_Cyp6a2 Px_01130_Cyp6a2 Px_13056_Cyp6a2 Px_02304_Cyp6a2 Px_06782_Cyp6a2 Px_17588_Cyp6a13

NATURE COMMUNICATIONS | 6:8212 | DOI: 10.1038/ncomms9212 | www.nature.com/naturecommunications

& 2015 Macmillan Publishers Limited. All rights reserved.

h 4.8 3.2 1.6 0.0 –1.6 –3.2 –4.8

Pm e Pm gg L Pm 1 L Pm 2 L Pm 3 L4 Pm L5 Pm p Pm upa F p Pm upa ad M Pm ult ad F ult M

13

Nymphalidae

Pm_00559_Cyp6a13 Pm_13205_Cyp6a13 Pm_01812_Cyp6a2 Pm_15911_Cyp6a18 Pm_07095_Cyp6a2 Pm_04921_Cyp6a2 Pm_04922_Cyp6a13 Pm_07113_Cyp6a2 Pm_02361_Cyp6a2 Pm_06378_Cyp6a2 Pm_02289_Cyp6a18 Pm_02288_Cyp6a2 Pm_13622_Cyp6a2 Pm_02287_Cyp6a2 Pm_04924_Cyp6a2 Pm_07152_Cyp6a2 Pm_13630_Cyp6a13 Pm_02286_Cyp6a2 Pm_02999_Cyp6a2 Pm_02366_Cyp6a2 Pm_02367_Cyp6a2 Pm_04923_Cyp6a2 Pm_18736_Cyp6a2 Pm_18737_Cyp6a18 Pm_13627_Cyp6a2 Pm_13096_Cyp6a2 Pm_13624_Cyp6a2 Pm_18708_Cyp6a13 Pm_13631_Cyp6a13 Pm_13629_Cyp6a2 Pm_07377_Cyp6a2 Pm_13204_Cyp6a18 Pm_18236_Cyp6a13 Pm_19507_Cyp6a13 Pm_02415_Cyp6a2 Pm_07112_Cyp6a17 Pm_16524_Cyp6a13 Pm_08146_Cyp6a2 Pm_07114_Cyp6a2 Pm_14020_Cyp6a2

3

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9212

Supplementary Tables 14–16 and Supplementary Fig. 8) and are predicted to contain 15,322 and 15,499 protein-coding genes, respectively (Table 1, Supplementary Note 4, Supplementary Tables 17 and 18, and Supplementary Figs 9–11). More than 80% of gene models are supported by evidence from at least two prediction methods (ab initio, homology and RNA sequencing (RNA-seq); Supplementary Fig. 9) and both genomes show similar gene features to those of other Lepidoptera (Supplementary Table 17 and Supplementary Figs 10 and 11). About 50% of gene families are conserved in butterflies (Supplementary Table 18 and Supplementary Fig. 12). We inferred a phylogenetic tree among representative taxa spanning holometabolous insects including seven lepidopteran species (P. xuthus, P. machaon, Heliconius melpomene, Danaus plexippus, Melitaea cinxia, B. mori and Plutella xylostella) using 1,071 single-copy orthologues (Supplementary Note 5 and Supplementary Fig. 13). We analysed lineage-specific genes and gene families, and identified positively selected genes (Supplementary Note 5, Supplementary Tables 19–26 and Supplementary Figs 13 and 14). We also obtained genome-wide gene expression profiles for P. xuthus and P. machaon at each of ten development stages: egg, each of the five larval instars, male pupa, female pupa, male adult and female adult (Supplementary Note 6, Supplementary Tables 27–30 and Supplementary Figs 15–19). By integrating these multiple layers of information, we inferred important genes and gene pathways in butterfly evolutionary history and then we developed an efficient and widely applicable genome-editing method for butterfly functional genetics. Assembly, gene annotations, gene family and developmental transcriptome can be obtained from ftp://ftp.genomics.org.cn/ pub/papilio. Comparative genomics. Comparison with other insect genomes revealed interesting evolution patterns, in particular evolution in the juvenile hormone (JH) pathway in tandem with butterfly morphological diversification. JHs are key hormones in insect development. There are six different JH forms in insects and, interestingly, Lepidoptera have five of the six JH forms, while other insects generally have only one JH form22. Short-chain isoprenyl diphosphate synthases (scIPPSs) are key enzymes in the JH synthesis pathway23 (Supplementary Fig. 20). Based on comparisons across holometabolous insects, all butterflies experienced scIPPS gene expansion (Fig. 1c, Supplementary Note 7 and Supplementary Tables 31 and 32). Furthermore, scIPPSs can be classified into farnesyl pyrophosphate synthases (FPPSs) and geranylgeranyl pyrophosphate synthases (GGPPSs). In butterflies, FPPS genes expanded from 1 copy to 14 copies in Papilio, while GGPPS genes expanded from 1 copy to 7–8 copies in nymphalid butterflies (Fig. 1c and Supplementary Table 32). It has been suggested that in moths it is alternative splicing of the scIPPS genes that contributes to diverse JH forms24. In addition, notable expansions of genes encoding JH epoxide hydrolase and

JH diol kinase in Papilio (Supplementary Table 31 and Supplementary Fig. 21), combined with the prominent expansion of scIPPS genes in butterfly JH biosynthesis, suggest significant diversification of JH metabolism in butterflies, which may be related to the evolution of their numerous morphological forms. Interestingly, the scIPPS genes may have also played roles in the differentiation among Papilio species, because we observed that although P. machaon and P. xuthus share the same 14 FPPS genes, these orthologues were differentially expressed between P. machaon and P. xuthus (Fig. 1e,f and Supplementary Table 33). Collectively, the above observations suggest that scIPPSs might have played roles in both the deep evolutionary history of butterflies as well as very recent divergence between closely related species. Comparative genomics also revealed evidence for genes critical in host plant adaptation. Although P. machaon and P. xuthus are congeneric species, their host plants are different. As larvae, P. xuthus feeds on plants in the family Rutaceae, such as citrus, whereas P. machaon feeds on plants in the family Apiaceae. Both groups of plants contain toxic furanocoumarins, but Apiaceae have higher levels of furanocoumarins than Rutaceae25. By comparing P. xuthus and P. machaon genomes with those of D. plexippus, H. melpomene, B. mori and D. melanogaster, we found CYP6, a cytochrome P450 monooxygenase (P450) gene family, expanded in swallowtail butterflies (Supplementary Note 8, Supplementary Table 34 and Supplementary Fig. 22). CYP6, together with CYP9 and a-esterase families, are thought to contribute to xenobiotic detoxification in insects26. We observed that 50% of CYP6 members in the two Papilio genomes belong to the CYP6B subfamily, while there is no CYP6B in the monarch or passion-vine butterfly genomes, species whose host plants have no toxic furanocoumarins (Fig. 1d, Supplementary Table 35 and Supplementary Fig. 22). Higher expression of CYP6 genes in larvae of P. machaon and P. xuthus, compared with other developmental stages (Fig. 1g,h), suggests that CYP6 genes may play an important role in larval feeding. CYP6AB is another P450 subfamily found only in Lepidoptera. We found additional expansions of CYP6B and CYP6AB in P. machaon only (Fig. 1d and Supplementary Fig. 22), which may account for this species’ ability to feed on Apiaceae plants with a higher content of furanocoumarins. Genomic divergence among swallowtail species. There are striking morphological differences between P. xuthus and P. machaon, especially at the fifth-instar larval ontogenetic stage (Supplementary Fig. 1). Similar to caterpillars of hundreds of other tropical butterfly species, the fifth instar larvae of P. xuthus display false eyespots on their metathorax, which permit them to escape predation by mimicking venomous snakes27,28. On the other hand, fifth instar larvae of P. machaon exhibit disruptive colouration characterized by highly contrasting patterns of black and green stripes with yellow spots, providing camouflage that conceals body shape at a distance, while serving as warning

Figure 1 | Butterfly comparative genomics. (a) Phylogeny of butterfly families58 showing relationship of P. xuthus and P. machaon to D. plexippus, H. melpomene, M. cinxia, P. glaucus and P. polytes. (b) Chromosome mapping of P. xuthus (n ¼ 30) to B. mori (n ¼ 28). For P. machaon (n ¼ 31), we only plot chromosome 8 (chr8) and chr31, which were fused in P. xuthus. (c) Maximum-likelihood tree showing strong expansion of the scIPPS genes in the genomes of swallowtail butterflies. Included are all scIPPS genes identified in the genomes of ten holometabolous insects (P. xuthus, P. machaon, D. plexippus, H. melpomene, B. mori, P. xylostella, A. gambiae, D. melanogaster, T. castaneum and A. mellifera). The clades of P. xuthus and P. machaon are highlighted by light blue and deep blue, respectively. (d) Maximum-likelihood tree showing expansions of CYP6B genes in the genomes of P. xuthus and P. machaon, and of the CYP6AB genes in the genome of P. machaon, as compared with CYP6 genes of D. plexippus, H. melpomene, B. mori and D. melanogaster. In c,d, the clades of P. xuthus and P. machaon are highlighted by light blue and deep blue, respectively. (e,f) Expression profiles of FPPS and GGPPS genes at all development stage of P. xuthus (e) and P. machaon (f). (g,h) Expression profiles of CYP6 genes at all development stage of P. xuthus (g) and P. machaon (h). Expression measured in reads per kilobase of transcript per million reads mapped (RPKM). 4

NATURE COMMUNICATIONS | 6:8212 | DOI: 10.1038/ncomms9212 | www.nature.com/naturecommunications

& 2015 Macmillan Publishers Limited. All rights reserved.

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9212

colouration at close range29,30. We found a number of genes related to morphological traits that were expanded, positively selected, or differentially expressed in P. xuthus and P. machaon (Supplementary Notes 9–11, Supplementary Tables 36–40 and Supplementary Figs 22–36). Butterfly larvae, swallowtails’ in particular, frequently have marked green pigmentation, which consists of blue bile binding to bilin-binding protein and a yellow component binding to proteins encoded by yellow-related gene31,32. Interestingly, we observed an expansion of the bilinbinding protein gene family in butterflies, especially in swallowtails (Supplementary Table 36 and Supplementary Fig. 32), which may be related to the exceptionally green caterpillars in swallowtails. We also identified a Papilio-specific single-copy yellow-related gene and found it was expressed at a much higher level in the fifth instar larvae of P. machaon compared with that of P. xuthus (Supplementary Table 36 and Supplementary Figs 33 and 34), corresponding to the stronger yellow marking in P. machaon (Supplementary Fig. 1). To better characterize patterns of genome evolution among closely related swallowtail species, we performed a genome-wide sliding window analysis of DNA sequence divergence among P. xuthus, P. machaon and a third Papilio species, Papilio polytes (Fig. 2, Supplementary Note 12, Supplementary Tables 41–43 and Supplementary Fig. 37) with its resequencing data available33. A total of 915 genes showed strong divergence in all pairwise comparisons. By integrating this with results from our analyses of recent positive selection and differential gene expression, and looking for overlap among data sets, we found a small subset of genes that might have played an important role in recent diversification among these three closely related Papilio species (Supplementary Table 43). One particular gene was frizzled (fz). We found that the genome window containing the 50 -untranslated region of fz emerged as a highly divergent

0.1

segment in all pairwise comparisons among the three Papilio species, but the coding region itself did not. This gene also showed strong differential expression between P. xuthus and P. machaon across all developmental stages (Supplementary Note 11, Supplementary Table 39 and Supplementary Fig. 36), 1 of only 32 genes to do so (and 1 of only 4 highly divergent genes to do so) (Supplementary Fig. 17). Frizzled is well-known for its role in polarizing epithelia during development (planar cell polarity, PCP), but it also plays a distinct role patterning the larval cuticle during embryogenesis34. In Drosophila, fz interacts with wingless, armadillo and other genes, to determine the distribution of naked cuticle versus denticle on larvae35. Given the highly divergent larval morphology among swallowtails, we hypothesized that fz may play a similar role in influencing larval cuticle development, in which case DNA sequence and expression divergence at this gene could be a result of natural selection on larval anatomy. Furthermore, another PCP pathway (Fat/Dachsous) that also influences larval denticle patterning in Drosophila36 showed evidence of recent, positive selection in P. machaon (Supplementary Tables 38–40). Experimentally, testing genes’ potential role in morphological diversification among Papilio species required functional genetics methodology not available in butterflies. CRISPR/Cas9 system. Given the recent, widespread utility of CRISPR/Cas9 (refs 15,16), we decided to implement a CRISPR/ Cas9 genome-editing approach in P. xuthus (Table 2, Supplementary Note 13, Supplementary Tables 44–50 and Supplementary Figs 38–55). Before targeting candidate diversification genes, such as fz, we developed our pipeline (Supplementary Fig. 38) by focusing on the gene Abd-B and we then verified it with ebony (Supplementary Table 44 and Supplementary Fig. 39), because these two genes are expected to have predictable and

fz

0.05

0

1

2

3

5

4

6

7

8

9

10

0.1

df 0.05

0

11

12

13

15

14

16

17

18

19

20

0.1 10 Mb

0.05

0

21

22

23

24

25

26

Genes differentially expressed throughtout development Positively selected genes in P. xuthus and P. machaon Highly divergent genomic regions Xuthus vs. polytes Xuthus vs. machaon Polytes vs. machaon

27

28

P. xuthus

29

30

P. machaon

P. polystes

Figure 2 | Genome-wide divergence among Papilio species. Pairwise genetic divergence among P. xuthus, P. machaon and P. polytes was calculated in 50-kb windows. This yielded 70 highly divergent genomic regions, windows in the upper 95th percentile in all three comparisons, encompassing 915 genes. When overlaid with signatures of positive selection and differential expression throughout development, 11 genes (highlighted in yellow) emerged as strong candidates for a role in recent diversification, including frizzled. Degree of freedom (df) is the density of different single-nucleotide polymorphisms (SNPs) per bp of each 50-kb window. NATURE COMMUNICATIONS | 6:8212 | DOI: 10.1038/ncomms9212 | www.nature.com/naturecommunications

& 2015 Macmillan Publishers Limited. All rights reserved.

5

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9212

Table 2 | Mutagenesis efficiency of knocking out genes Abdominal-B, ebony and frizzled in P. xuthus. Gene Px_03961_Abd-B

Target sites* T42 T42/T95

Px_01073_e Px_15230_fz

T2/T303 T454/T6 T268/T283

RNA con. (ng ll  1)w 200; 300 600; 600 50/50; 100 566/416; 1,200 158/159; 1,200 200/150; 1200 500/500; 1,,000

Phynotype rate (%)z 7.95 (7/88) 8.26 (10/121) 6 (6/100) 91.79 (123/134) 31.25 (5/16) 88 (22/25) 4.12 (4/96)

Mutation rate (%) 35.38 25 18.33 90.85 66.67 29.92 45.16

Mutation type D: 3–51; I: 4–18 D: 3–66; I: 4–5; M: 40 D: 5–59; I: 3–19 D: 3–73; I: 3–64; M: 3–29 D: 5–33; I: 3–10; M: 3 D: 6–62; I: 3–15; M: 3–8 D: 3–64; I: 3–15; M: 4–5

D, delete fragment (bp); I, insert fragment (bp); M, mutated fragment (bp); sgRNA, single guide RNA. *‘/’ denotes injection of mixed targets. wThe number before and after semicolon denotes injected concentration of sgRNA and Cas9 mRNA, respectively. zThe number in the bracket denotes mutant/observed injected individuals.

visible morphological phenotypes. Abd-B is a HOX gene that plays a critical role in determining cell fate in the tail end of the organism. For instance, mutation or decreased expression of AbdB causes development of extra prolegs on all segments posterior to the sixth abdominal segment (A6) in silkworm larvae37. We began our Abd-B gene-editing experiment with low concentrations of single guide RNAs (sgRNAs) (150 ng ml  1) and Cas9 messenger RNA (300 ng ml  1) but failed to observe morphologically mutated individuals, although sequencing yielded evidence of lowfrequency disruptions (Supplementary Table 45 and Supplementary Fig. 40). Increasing injection concentrations (sgRNAs:Cas9 (ng ml  1), 200:300, 600:600) or the ratio (sgRNAs/Cas9 mRNA (ng ml  1): 100/150) gave rise to a low frequency (7.4%) of morphologically mutated larvae with varying abnormalities (Supplementary Tables 45 and 46, and Supplementary Figs 41–44). Remarkably, further increasing injection concentration of sgRNAs (982 ng ml  1) and Cas9 mRNA (1,200 ng ml  1), together with shortening the time period from egg laying to injection from 4 to 2 h, resulted in both a high frequency (92.5%) of morphologically mutated individuals and a majority (90%) of mutated individuals with severe expected phenotypic abnormalities (Fig. 3a–d, Supplementary Tables 45 and 46, and Supplementary Fig. 43a). For example, all mutated individuals had a curled abdomen resulting from abnormal terga on segments A3 and after, and most individuals had prolegs on all or part of segments A7–A10, segments that do not normally have prolegs (Supplementary Tables 45 and 46, and Supplementary Fig. 43). Sequencing of the targeted region showed a high frequency of gene disruption (Supplementary Table 46 and Supplementary Fig. 42). In addition, we also tested whether expression of the target gene (Abd-B) at RNA or protein level disappeared or was extremely reduced in mutants using both the quantitative reverse transcription–PCR (qRT–PCR) and western blotting. Both qRT–PCR and western blotting results suggested that the level of Abd-B protein in mutants was extremely low compared with that of the wild-type individuals (Supplementary Note 13 and Supplementary Fig. 45a,b), further providing evidence of Abd-B gene disruption. Based on our experience with Abd-B, we conclude that the key factors for successful gene editing in butterflies include high concentrations of sgRNA and Cas9 mRNA, an appropriate ratio, mixed injection of two or more sgRNAs with close targeting sites and timing egg injection to target early embryogenesis (Supplementary Note 13). It is noteworthy that all the mutant test experiments used individuals that developed from injected eggs (G0), and DNA, RNA, as well as protein were extracted from G0 whole body samples. Having established the methodology, we further tested it on a second proof-of-principle gene, ebony. Ebony encodes the enzyme N-b-alanyl dopamine synthetase, a central component of melanin biosynthesis (Supplementary Fig. 25), and expresses not only in 6

pupa and adult but also larvae of P. xuthus and P. machaon, including the green region of both larvae (Supplementary Figs 26, 28c and 30). Swallowtail butterflies have a unique class of wing pigments, papiliochromes, that derive in part from melanin precursors such as tyrosine and b-alanine via the action of ebony38. Ebony is also known to influence behaviour39. We obtained many ebony mutants of P. xuthus with the CRISPR/Cas9 system (Fig. 3e–j, Supplementary Note 13, Supplementary Tables 47 and 48 and Supplementary Figs 46–51). Mutated fifth instar larvae displayed enhanced melanic pigmentation (Fig. 3e and Supplementary Fig. 46), consistent with ebony mutants in other insects40–42, and they also showed an absence of orange colour in the false eyespot (Fig. 3f and Supplementary Fig. 46). Interestingly, two mutated fifth instar larvae also showed abnormal curticular structure in the location of eversible osmeterium, a swallowtail’s specific defensive organ, and one or both of tubular arms of their osmeterium, unlike those of wild-type individuals, could not evert when irritated by abrupt contact (Supplementary Fig. 47). Adult mutants showed brown pigmentation across the body and the regions of wings that were normally yellow (Supplementary Fig. 48). We also verified the decreased expression of ebony in the fifth instar larvae of mutants compared with that of wild type by qRT–PCR (t-test, P ¼ 0.001) (Supplementary Note 13 and Supplementary Fig. 51), suggesting that ebony gene was largely disrupted in mutants. Finally, we turned to the candidate diversification gene fz. Consistent with our hypothesis that fz plays a role in generating divergent larval morphology, we obtained mutants showing a variety of larval anatomy phenotypes, including asymmetric appendages and cuticular structures (Fig. 3k–n, Supplementary Tables 49 and 50, and Supplementary Figs 52–55). For example, mutated larvae had smaller prolegs on one side of the body, smooth and colourless dorsal cuticle, or vestigial tubercles on the prothorax or metathorax. These results suggest that divergent evolution at the fz gene, as well as differential expression, may be related to the distinct larval anatomy between species. To exclude the possibility that these morphological mutants were induced by off-target cleavage events during the genomeediting process, we further identified all possible off-target sites in the genome using three methods (CasOT43, Cas-OFFinder44 and COSMID45) with the mismatches up to 5 bp. For the two target sites (Px_10703_e-T454 and -T6) of ebony, the three methods identified 197 possible off-target sites, but none of them is completely identical to the target sites (Supplementary Note 13 and Supplementary Table 51). Next, we performed wholegenome resequencing for an ebony fifth instar larva mutant (Supplementary Note 13). Our resequencing data confirmed disruption of the two target sites of the ebony gene (Supplementary Fig. 56), but there was no disruption of any other functional gene in the mutant (Supplementary Table 52).

NATURE COMMUNICATIONS | 6:8212 | DOI: 10.1038/ncomms9212 | www.nature.com/naturecommunications

& 2015 Macmillan Publishers Limited. All rights reserved.

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9212

a

b

c

d

Abdominal-B

e

f

g

h

i

j

Ebony

k

l

m

n

Frizzled

Figure 3 | CRISPR/Cas9-induced morphological mutants of Abdominal-B, ebony and frizzled in P. xuthus. (a–d) Mutations induced by the injection of Cas9 mRNA and Abdominal-B sgRNA with prolegs on segment A7–A10 (a), segments that do not normally have prolegs (c), or with a curled abdomen resulting from abnormal terga on segments A3 and after (b), compared with wild type (d). Green dots in a and c show normal prolegs of A6 and A10, while red stars in a denote the redundant prolegs on A7–A10. (e–j) Mutations induced by the injection of Cas9 mRNA and ebony sgRNA with enhanced melanic pigmentation (e) and an absence of orange colour in the false eyespot (f) in fifth instar larvae, compared with wild type (h,i), and with brown pigmentation (g) across the body in regions that were normally yellow and in wing patches that were normally orange in adult (j). (k–n) A mutant induced by the injection of Cas9 mRNA and frizzled sgRNA with smooth and colourless dorsal cuticle in right side (k), while normal in its left side (m), and its dorsal view (l) compared with wild type (n). Scale bars, 1 mm (a–d,f,i,k–n) and 10 mm (e,g,h,j).

Among the 197 possible off-target sites of ebony T454 (143 sites) and T6 (54 sites), we observed that 33 sites have variation different from the reference genome. Among these 33 sites, 29 are due to single nucleotide variations (SNVs) and 4 are due to indels. Three SNVs are found at the possible off-target sites of three genes, respectively, but these three SNVs are all synonymous. Other 30 sites are all in non-coding regions. Therefore, the observed phenotypes are most possibly caused by the designed target genes’ disruption rather than off targeting. The 33 variation sites that did not disrupt genes could result from polymorphisms given the high heterozygosity of P. xuthus, although the possibility of off targeting could not be completely excluded at this point. High specificity of Cas9 editing and low incidence of

off-target mutations have recently been reported by wholegenome sequencing of human stem cell46,47; in butterflies, future detailed analysis on offsprings from the same parents will allow to test whether some of the variations are from off targeting or not. Nevertheless, our results show that Cas9 is an efficient and reliable tool in butterfly gene editing. It is worth noting that we have only focused on P. xuthus at this point, because we found difficult to raise P. machaon under laboratory conditions. However, our experience suggests that this Cas9 method will be widely applicable to any butterfly species as long as one can collect a sufficient number of eggs and then rear them to an appropriate developmental stage to observe mutants’ phenotypes.

NATURE COMMUNICATIONS | 6:8212 | DOI: 10.1038/ncomms9212 | www.nature.com/naturecommunications

& 2015 Macmillan Publishers Limited. All rights reserved.

7

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9212

Discussion The amazing diversity of butterflies provides a rich, natural experiment with which to explore the molecular mechanisms of morphological and species diversification, but doing so with butterflies has been hindered by complications associated with assembling genome sequences from highly heterozygous, outbred insects and an absence of widely applicable gene-editing technology. By focusing on a biologically significant group of butterflies, we have developed widely applicable methods to overcome these challenges, thereby enabling the scientific community to fully harness the research potential of this remarkable insect radiation. Furthermore, our results lend unique insight into the evolutionary history of butterfly diversification. Much attention is often paid to adult butterflies, and their wing patterns in particular. However, similar to all holometabolous insects, the butterfly life cycle is a two-stage process consisting of larva and adult. Perhaps surprisingly, our genomic analyses point repeatedly to the caterpillar stage as a major driver of natural selection with many of the genes that emerge from our analyses connecting to the larval stage through effects on larval morphology or host plant use. This enriches our view of insect evolution, and with these new genetic and genomic tools in hand we can now test the generality of these findings. Methods Butterfly husbandry and sample collection. Pupae of P. xuthus Linnaeus (Px) and P. machaon Linnaeus (Pm) were mainly purchased from butterfly insectariums in the suburbs of Ya’an, Sichuan, China. A proportion of pupae and adults were also acquired by raising the larvae collected by Xueyan Li and Guichun Liu in the suburb of Kunming, Yunnan, China. Pupae were reared under the conditions of 26 °C, 75% relative humidity and 18 h/6 h light/darkness. Emerged adults were crossed via hand pairing48,49. After mating, females were placed in net rooms with host plants for oviposition and then the mated adults were frozen at  80 °C for DNA extraction or kept as dried specimens. Eggs were collected to be used for experiments such as microinjection or allowed to develop on host plant at room temperature. Px larvae were fed with a horticulturally rutaceous plant (Zanthxylum piperitum) found by Xueyan Li, which appears to be a new host plant due to the species expansion and local adaptation. Pm larvae were fed with Umbelliferae plants fennel (Foeniculum vulgare) or carrot (Daucus carota). To get high quality and high volume of DNA for library construction, head and thorax tissues (excluding wings) of six homogametric male (ZZ) adults of P. xuthus and P. machaon both from Sichuan, China, were dissected and used for DNA isolation using a Gentra Puregene Blood kit (Qiagen, Germany) following manual instructions. A single individual was used to construct short-insert libraries (150, 250 and 500 bp); each of other five individuals was used to construct long-insert libraries (2, 5, 10 and 20 kb) and 454-library, respectively. RNA samples from different development stages of both species including egg, larvae from first instar (L1) to fifth instar (L5), pupa (female and male) and adult (female and male) were collected for transcriptome sequencing. Owing to small body of egg and early development larvae, ten eggs, ten L1 and five L2 were used for constructing single RNA library; for other development stages, single individual was used to construct RNA library. For linkage mapping, P. xuthus pupae were obtained from a butterfly breeder, and one 3-day old male and one newly emerged female were used to set up a cross. The two parents (adults) and their 94 F1 offspring (second to fourth instar larvae) of P. xuthus were collected for DNA isolation and restriction site associated (RAD) sequencing. The eggs of P. xuthus for microinjection of Cas9 were collected in our greenhouse. The related protocols involved with butterfly in this study have been reviewed and approved by the internal review board of Kunming Institute of Zoology, Chinese Academy of Sciences. Genome sequencing and assembly. All short reads from short- and long-insert libraries for genome assembly were produced with the Illumina Hiseq2000 platform at BGI (Shenzhen, China) (Supplementary Note 1). Long reads were produced with the Roche 454FLXPlus Titanium platform at Duke University (USA) using the 454 Life Science/Roche protocol on the GS-FLX þ System. We carried out several rounds of assembly using high-quality data (Supplementary Note 1) from different sequencing platforms, that is, Illumina short reads, 454 long reads or both combined (Supplementary Table 1). Considering that there exist a larger number of short contigs in the assemblies only by Illumina short reads (Supplementary Table 1), we took a hybrid assembly strategy (Supplementary Fig. 5) of combining Illumina short reads with 454 long reads for the final assembly. First, about 30  Illumina reads from short insert libraries were used to correct amplicon pyrosequencing errors in 454 reads. To do the correction, Coral 1.4 (http://www.cs.helsinki.fi/u/lmsalmel/coral/)50 was used to correct sequencing 8

errors by forming multiple alignments. We employed 454 model with options ‘  f  k 21  e 0.07  p 20’ during correction. For P. xuthus and P. machaon, 1.60% and 1.10% errors in 454 read bases were corrected, respectively. Second, corrected 454 reads were then fed into Newbler 2.6 (ref. 51) in DataAnalysis_2.6 toolkit with options ‘  mi 90  ml 40  nrm  het  m  cpu 20  l 500’ to build contigs. Third, paired-end sequencing Illumina short reads were employed to combine contigs by SSPACE-1.2 basic52 (http://www.baseclear.com/landingpages/ sspacev12/). The reads from short insert size libraries (o2 kb) were first used to construct scaffolds (with parameters  a 0.6  x 0  g 1  k 5). Next, reads from long insert size libraries (^2 kb) were added in with parameters ‘  a 0.6  x 0  g 1  k 3’. Finally, we used short paired-end (150, 250 and 500 bp libraries) information to retrieve gap-crossed read pairs with one end mapped to a unique contig and the other located in a gap, and performed a local assembly for these collected reads to fill the gaps by GapCloser v1.12 (ref. 53) (http://soap.genomics.org.cn/soapdenovo.html). We further estimated genome size using flow cytometry (Supplementary Note 1) and then made haplotype separation and excluded microsporidia genome reads (Supplementary Note 1). Assembly quality was evaluated by three methods, that is, aligning short reads to assemblies, aligning transcripts to evaluate completeness of assemblies and evaluating completeness by CEGMA (Core Eukaryotic Genes Mapping Approach) (Supplementary Note 1). RAD-based linkage mapping of scaffolds. RAD library preparation was performed mainly according to Etter et al.54 with some modifications and sequencing was performed with an Illumina Hiseq 2000, and EcoRI site was used for RAD tag extraction (Supplementary Note 2). Linkage maps were constructed using both regression (Kosambi mapping function) and maximum likelihood algorithms (Supplementary Note 2). Syntenic relationship of chromosomal linkage between P. xuthus and P. machaon was analysed based on their orthologues (Supplementary Note 2). Genome annotation. Repetitive sequences and transposable elements in P. xuthus, P.machaon and two nymphalid butterflies (D. plexippus and H. melpomene) were annotated using a combination of homology to Repbase sequences, de novo prediction approaches and Tandem Repeats Finder (Supplementary Note 4). Protein-coding genes in Papilio genomes were predicted using homology-based methods, ab initio gene prediction and RNA-seq data (Supplementary Note 5). Gene function information, motifs and domains of their proteins were assigned by comparing with public databases including SwissProt, TrEMBL, KEGG, InterPro and Gene Ontology (Supplementary Note 5). To avoid biased comparisons of gene sets among butterfly species used in this study and two former studies (D. plexippus and H. melpomene), we conducted all these analyses for the four butterfly genomes. Gene evolution. To explore gene evolution patterns among butterflies, gene cluster was analysed including the genomes of five butterflies (P. xuthus, P. machaon, D. plexippus, H. melpomene and M. cinxia), moths (B. mori and P. xylostella), mosquito (Anopheles gambiae), fruit fly (D. melanogaster), beetle (Tribolium castaneum), bee (Apis mellifera), which cover all orders of holometabolous insects (Supplementary Note 5). One thousand and seventy-one single-copy genes were used for constructing phylogenetic tree of 11 species (Supplementary Note 5). Gene orthologous relationship among Lepidopteran insects (P. xuthus, P. machaon, D. plexippus, H. melpomene and B. mori) was determined (Supplementary Note 5) and positively selected genes were identified in P. xuthus, P. machaon and both of them (Supplementary Note 5). Transcriptome sequencing and analysis. Transcriptome sequencing for different developmental stages was performed with Illumina RNA-seq protocols and two methods, that is, de novo assembly of clean RNA reads and mapping them back to the assembled genomes, were carried out for transcriptome assembly (Supplementary Note 6). RPKM (reads per kb per million reads) was used to measure gene expression abundance. We analysed gene expression dynamics in development (Supplementary Note 6), differentially expressed genes (Supplementary Note 6), expression patterns of positively selected genes and lineage-specific genes (Supplementary Note 6). Specific gene families and pathways. To explore possible significance of expansion of farnesyl pyrophosphate synthase (FPPS) gene family in Papilio butterflies, we investigated all genes encoding scIPPS and other enzymes in the pathways of JH biosynthesis and degradation, as well as protein prenyltransferases in ten insect species (P. xuthus, P. machaon, D. plexippus, H. melpomene, B. mori, P. xylostella, A. gambiae, D. melanogaster, T. castaneum and A. mellifera), constructed their gene trees and analysed their expression patterns (Supplementary Note 7). We also identified gene families involved in detoxification of various xenobiotics, including cytochrome P450s, glutathione S-transferase and carboxylesterases (Supplementary Notes 8 and 9). In addition, we also analysed the genes related to body colour (Supplementary Note 10) and PCP genes (Supplementary Note 11) playing a fundamental role in morphogenesis of

NATURE COMMUNICATIONS | 6:8212 | DOI: 10.1038/ncomms9212 | www.nature.com/naturecommunications

& 2015 Macmillan Publishers Limited. All rights reserved.

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9212

vertebrates and invertebrates. All phylogenetic trees were constructed using Maximum likelihood in PAML55. Analysis of genomic divergence. The divergence regions were analysed based on the assembled genomes of P. xuthus, P. machaon and Illumina genome resequencing data of one individual of a third Papilio species (P. polytes) with its data available. We conducted pairwise local alignments using the P. xuthus genome as a reference and aligned paired-end reads of one male P. polytes individual and simulated paired-end reads of the P. machaon genome. Next, we calculated fixed different single-nucleotide polymorphisms per bp for each 50-kb window across the reference genome for three comparisons (P. xuthus/P. machaon, P. xuthus/ P. polytes and P. machaon/P. polytes) and the highly divergent regions were characterized using 95% smoothed empirical likelihood quantiles among all three comparisons as a cutoff (Supplementary Note 12). Genome editing in butterfly using CRISPR/Cas9 system. We designed sgRNA target sites by seeking sequences corresponding to N20NGG on exon regions of the sense or antisense strand of the DNA by ZiFit Targeter programme56. Next, we BLAST (using Basic Local Alignment Search Tool) these candidate target sequence against the P. xuthus genome to eliminate those with off-target sites using strict criteria, where the candidate-editable site is defined only when the seed region (12 nucleotides (nt) to protospacer adjacent motif NGG) is unique15. From candidate-editable sites, we selected those with the first two bases of GG, GA or AG for sgRNA synthesis. sgRNA can be synthesized by plasmid-based or PCR-based strategies and we used a PCR-based method to synthesize sgRNAs. Briefly, a unique oligonucleotide encoding T7 polymerase-binding site and the sgRNA target sequence N20 was designed as forward primer (CRISPRF 50 GAAATTAATACGACTCACTATAN20GTTTTAGAGCTAGAAATAGC-30 ) and a common oligonucleotide encoding the remaining sgRNA sequence (sgRNAR 50 -AAAAGCACCGACTCGGTGCCACTTTTTCAAGTTGATAACGGACTAGC CT TATTTTAACTTGCTATTTCTAGCTCTAAAAC-30 ) was designed as a reverse primer18. All the oligonucleotides (CRISPRF and sgRNAR) were synthesized by GENEray Company (Shanghai, China). The information of target sites and the primers for analysing target sites of three genes (Px_03961_Abd-B, Px_01073_e and Px_15230_fz) are shown in Supplementary Table 44 and Supplementary Fig. 39. PCR of primer-self amplification was performed with Q5 high-quality DNA polymerase (BioLabs) in 100 ml reaction volumes. Three hundred nanograms of purified PCR product was used as DNA template to perform in-vitro transcription with the MAXIscript T7 kit (Life Technology, USA) for 4 h at 37 °C. The Cas9 mRNA was transcribed using NotI-digested Cas9 expression vector pTD1-T7-Cas9 (ref. 19) and the mMESSAGE mMACHINE T7 ULTRA kit (Life Technologies). After the transcription reaction, the poly (A) tailing reaction and DNase I treatment were performed according to the manufacturer’s instructions. Both the sgRNA and the Cas9-encoding mRNA were then purified by LiCl precipitation and redissolved in RNase-free water. Fresh eggs were collected from the host plant leaves, dipped into clear water and then aligned on the microscope slide with a soft paintbrush and fixed with glue. Two-nanolitre mix of sgRNA(s) and Cas9-encoding mRNA, at varying concentrations (Supplementary Tables 45, 47 and 49), were injected through the chorion into each egg under a dissecting microscope (Nikon SMZ800), using a TransferMan NK2 and FemtoJet microinjection system (Eppendorf, Germany). After injection, eggs were put in a petri dish and then placed in an incubator at 25 °C and 70% relative humidity. When embryos hatched, host plant leaves were placed into the dishes for newly hatched larva feeding. The leaves with larvae were carefully transferred into large dishes. The phenotype of G0 generation larvae was carefully checked using a dissecting microscope and photographed using a digital camera. Pupae were transferred into plastic baskets to eclose. We observed morphologic changes in different developmental stages, using a microscope and by the eye. For Abd-B, we mainly observed variation of the larval abdomen; for ebony, we primarily noted colour variation of larvae and adults. For fz, we mainly observed cuticular structures. For Abd-B mutants of P. xuthus, genomic DNA extraction for the whole body of unhatched G0 larvae dissected from developed eggs and subsequent PCR were carried out using an Animal Tissue Direct PCR kit (Foregene, China), following manual instruction, and genomic DNA of whole body of G0-hatched larvae were extracted using Gentra Puregene Blood Kit (Qiagen), and exTaq polymerase was used in PCR amplification. The primer pairs included F3/R6, F9/R11, and F10/R11 (Supplementary Fig. 39a). For ebony mutants, whole body of larvae, prepupae and adult were used to extract genomic DNA, using Gentra Puregene Blood Kit (Qiagen), and exTaq polymerase was used in PCR amplification. The primer pairs were F2/R2 for amplifying target sites of T2-T303 (Px_01073_e-I and Px_01073_ e-II) or F7/R4 for amplifying T454-T6 (Px_01073_e-III) (Supplementary Fig. 39b). For fz mutants, whole body of larvae were used to extract genomic DNA, using Gentra Puregene Blood Kit (Qiagen) and exTaq polymerase was used in PCR amplification. The primer pair for amplifying target sites is F2/R2 (Supplementary Fig. 39c). PCR products of target sites were detected with T7 endonuclease I (T7EI) as previously described57. T7EI-positive and morphologically mutated individuals were further confirmed by Sanger sequencing 12 TA clones. We also performed qRT–PCR and western blot analysis to check the effect of disruption of Abd-B gene on its expression at RNA or protein levels (Supplementary Note 13). In addition,

we also performed qRT–PCR to check the effect of ebony gene on its expression at RNA level (Supplementary Note 13). For those targets resulting in morphological mutants, to exclude the possibility that these morphological mutants were induced by off-target cleavage events during the genome-editing process, we further analysed possible off-target sites in the genome using three methods CasOT43, Cas-OFFinder44 and COSMID45 with the least stringent parameters and carried out whole-genome next-generation sequencing validation for the ebony gene mutant (Supplementary Note 13). The resequencing reads of mutant have been uploaded to the NCBI Sequence Read Archive and are available via the accession number SRA272356.

References 1. Joron, M. & Mallet, J. L. B. Diversity in mimicry: paradox or paradigm? Trends Ecol. Evol. 13, 461–466 (1998). 2. Brakefield, P. M. & French, V. Butterfly wings: the evolution of development of colour patterns. BioEssays 21, 391–401 (1999). 3. Nijhout, H. F. The Development and Evolution of Butterfly Wing Patterns (Smithsonian Institution Scholarly Press, 1991). 4. Boogs, C. L., Ehrlich, P. R. & Watt, W. B. Butterflies: Ecology and Evolution Taking Flight (University of Chicago Press, 2003). 5. Beldade, P. & Brakefield, P. M. The genetics and evo-devo of butterfly wing patterns. Nat. Rev. Genet. 3, 442–452 (2002). 6. McMillan, W. O., Monteiro, A. & Kapan, D. D. Development and evolution on the wing. Trends Ecol. Evol. 17, 125–133 (2002). 7. Zhan, S., Merlin, C., Boore, J. L. & Reppert, S. M. The monarch butterfly genome yields insights into long-distance migration. Cell 147, 1171–1185 (2011). 8. Dasmahapatra, K. K. et al. Butterfly genome reveals promiscuous exchange of mimicry adaptations among species. Nature 487, 94–98 (2012). 9. Ahola, V. et al. The Glanville fritillary genome retains an ancient karyotype and reveals selective chromosomal fusions in Lepidoptera. Nat. Commun. 5, 4737 (2014). 10. Cong, Q., Borek, D., Otwinowski, Z. & Grishin, N. V. Tiger swallowtail genome reveals mechanisms for speciation and caterpillar chemical defense. Cell Rep. 10, 910–919 (2015). 11. Nishikawa, H. et al. A genetic mechanism for female-limited Batesian mimicry in Papilio butterfly. Nat. Genet. 47, 405–409 (2015). 12. Langley, C. H., Crepeau, M., Cardeno, C., Corbett-Detig, R. & Stevens, K. Circumventing heterozygosity: sequencing the amplified genome of a single haploid Drosophila melanogaster embryo. Genetics 188, 239–246 (2011). 13. You, M. S. et al. A heterozygous moth genome provides insights into herbivory and detoxification. Nat. Genet. 45, 220–225 (2013). 14. Merlin, C., Beaver, L. E., Taylor, O. R., Wolfe, S. A. & Reppert, S. M. Efficient targeted mutagenesis in the monarch butterfly using zinc-finger nucleases. Genome Res. 23, 159–168 (2013). 15. Cong, L. et al. Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819–823 (2013). 16. Gaj, T., Gersbach, C. A. & Barbas, 3rd C. F. ZFN, TALEN, and CRISPR/Casbased methods for genome engineering. Trends Biotechnol. 31, 397–405 (2013). 17. Gratz, S. J. et al. Genome engineering of Drosophila with the CRISPR RNAguided Cas9 nuclease. Genetics 194, 1029 (2013). 18. Bassett, A. R., Tibbit, C., Ponting, C. P. & Liu, J. L. Highly efficient targeted mutagenesis of Drosophila with the CRISPR/Cas9 system. Cell Rep. 4, 220–228 (2013). 19. Wang, Y. Q. et al. The CRISPR/Cas system mediates efficient genome engineering in Bombyx mori. Cell Res. 23, 1414–1416 (2013). 20. Ma, S. Y. et al. CRISPR/Cas9 mediated multiplex genome editing and heritable mutagenesis of BmKu70 in Bombyx mori. Sci. Rep. 4, 4489 (2014). 21. Linnaeus, C. Systema naturae per regna tria naturae:secundum classes, ordines, genera, species, cum characteribus, differentiis, synonymis, locis (Editio decima, reformata, 1758). 22. Cusson, M. et al. Characterization and tissue-specific expression of two lepidopteran farnesyl pyrophosphate synthase homologs: implications for the biosynthesis of ethyl-substituted juvenile hormones. Proteins 65, 742–758 (2006). 23. Vandermoten, S., Haubruge, E. & Cusson, M. New insights into short-chain prenyltransferases: structural features, evolutionary history and potential for selective inhibition. Cell. Mol. Life Sci. 66, 3685–3695 (2009). 24. Kinjoh, T. et al. Control of juvenile hormone biosynthesis in Bombyx mori: cloning of the enzymes in the mevalonate pathway and assessment of their developmental expression in the corpora allata. Insect Biochem. Mol. Biol. 37, 808–818 (2007). 25. Peroutka, R., Schulzova, V., Botek, P. & Hajslova, J. Analysis of furanocoumarins in vegetables (Apiaceae) anal citrus fruits (Rutaceae). J. Sci. Food Agric. 87, 2152–2163 (2007). 26. Danielson, P. B., MacIntyre, R. J. & Fogleman, J. C. Molecular cloning of a family of xenobiotic-inducible drosophilid cytochrome p450s: evidence for involvement in host-plant allelochemical resistance. Proc. Natl Acad. Sci. USA 94, 10797–10802 (1997).

NATURE COMMUNICATIONS | 6:8212 | DOI: 10.1038/ncomms9212 | www.nature.com/naturecommunications

& 2015 Macmillan Publishers Limited. All rights reserved.

9

ARTICLE

NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9212

27. Janzen, D. H., Hallwachs, W. & Burns, J. M. A tropical horde of counterfeit predator eyes. Proc. Natl Acad. Sci. USA 107, 11659–11665 (2010). 28. Kodandaramaiah, U., Vallin, A. & Wiklund, C. Fixed eyespot display in a butterfly thwarts attacking birds. Anim. Behav. 77, 1415–1419 (2009). 29. Schaefer, H. M. & Stobbe, N. Disruptive coloration provides camouflage independent of background matching. Proc. R. Soc. B Biol. Sci. 273, 2427–2432 (2006). 30. Tullberg, B. S., Merilaita, S. & Wiklund, C. Aposematism and crypsis combined as a result of distance dependence: functional versatility of the colour pattern in the swallowtail butterfly larva. Proc. R. Soc. B Biol. Sci. 272, 1315–1321 (2005). 31. Kawooya, J. K. et al. Why are green caterpillars green? ACS Symp. Ser. 276, 11 (1985). 32. Shirataki, H., Futahashi, R. & Fujiwara, H. Species-specific coordinated gene expression and trans-regulation of larval color pattern in three swallowtail butterflies. Evol. Dev. 12, 305–314 (2010). 33. Kunte, K. et al. doublesex is a mimicry supergene. Nature 507, 229 (2014). 34. Goodrich, L. V. & Strutt, D. Principles of planar polarity in animal development. Development 138, 1877–1892 (2011). 35. Povelones, M., Howes, R., Fish, M. & Nusse, R. Genetic evidence that drosophila frizzled controls planar cell polarity and armadillo signaling by a common mechanism. Genetics 171, 1643–1654 (2005). 36. Repiso, A., Saavedra, P., Casal, J. & Lawrence, P. A. Planar cell polarity: the orientation of larval denticles in Drosophila appears to depend on gradients of Dachsous and fat. Development 137, 3411–3415 (2010). 37. Tomita, S. & Kikuchi, A. Abd-B suppresses lepidopteran proleg development in posterior abdomen. Dev. Biol. 328, 403–409 (2009). 38. ffrench-Constant, R. & Koch, P. B. in: Butterflies: Ecology And Evolution Taking Flight. (eds Boggs, C. L., Watt, W. B. & Ehrlich, P. R.) (Chicago University Press, 2003). 39. Takahashi, A. Pigmentation and behavior: potential association through pleiotropic genes in Drosophila. Genes Genet. Syst. 88, 165–174 (2013). 40. Futahashi, R. et al. Yellow and ebony are the responsible genes for the larval color mutants of the silkworm Bombyx mori. Genetics 180, 1995–2005 (2008). 41. Wittkopp, P. J., True, J. R. & Carroll, S. B. Reciprocal functions of the Drosophila yellow and ebony proteins in the development and evolution of pigment patterns. Development 129, 1849–1858 (2002). 42. Sakai, R. K., Baker, R. H. & Iqbal, M. P. Genetics of ebony, a nonlethal recessive melanotic mutant in a mosquito. J. Hered. 63, 275 (1972). 43. Xiao, A. et al. CasOT: a genome-wide Cas9/gRNA off-target searching tool. Bioinformatics 30, 1180–1182 (2014). 44. Bae, S., Park, J. & Kim, J. S. Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases. Bioinformatics 30, 1473–1475 (2014). 45. Cradick, T. J., Qiu, P., Lee, C. M., Fine, E. J. & Bao, G. COSMID: a web-based tool for identifying and validating CRISPR/Cas off-target sites. Mol. Ther. Nucleic Acids 3, e214 (2014). 46. Veres, A. et al. Low incidence of off-target mutations in individual CRISPRCas9 and TALEN targeted human stem cell clones detected by whole-genome sequencing. Cell Stem Cell 15, 27–30 (2014). 47. Yang, L. H. et al. Targeted and genome-wide sequencing reveal single nucleotide variations impacting specificity of Cas9 in human stem cells. Nat. Commun. 5, 5507 (2014). 48. Arikawa, K., Suyama, D. & Fujii, T. Hindsight by genitalia: photo-guided copulation in butterflies. J. Comp. Physiol. A 180, 295–299 (1997). 49. Lederhouse, R. C., Ayres, M. P. & Scriber, J. M. Evaluation of spermatophore counts in studying mating systems of Lepidoptera. J. Lepidopt. Soc. 43, 9 (1989). 50. Salmela, L. & Schroder, J. Correcting errors in short reads by multiple alignments. Bioinformatics 27, 1455–1461 (2011). 51. Margulies, M. et al. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437, 376–380 (2005). 52. Boetzer, M., Henkel, C. V., Jansen, H. J., Butler, D. & Pirovano, W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27, 578–579 (2011). 53. Li, R. et al. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 20, 265–272 (2010). 54. Etter, P. D., Bassham, S., Hohenlohe, P. A., Johnson, E. A. & Cresko, W. A. SNP discovery and genotyping for evolutionary genetics using RAD sequencing. Methods Mol. Biol. 772, 157–178 (2011). 55. Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007). 56. Sander, J. D. et al. ZiFiT (Zinc Finger Targeter): an updated zinc finger engineering tool. Nucleic Acids Res. 38, W462–W468 (2010).

10

57. Guschin, D. Y. et al. A rapid and general assay for monitoring endogenous gene modification. Methods Mol. Biol. 649, 247–256 (2010). 58. Heikkila, M., Kaila, L., Mutanen, M., Pena, C. & Wahlberg, N. Cretaceous origin and repeated tertiary diversification of the redefined butterflies. Proc. Biol. Sci. 279, 1093–1099 (2012). 59. Maeki, K. A use of chromosome numbers in the study of taxonomy of the lepidoptera and notes on the internal reproductive anatomy (With 6 Textfigures). J. Faculty Sci. Hokkaido Univ. Ser. VI Zool. 13, 259–262 (1957). 60. Xia, Q. Y. et al. A draft sequence for the genome of the domesticated silkworm (Bombyx mori). Science 306, 1937–1940 (2004). 61. Zhan, S. & Reppert, S. M. MonarchBase: the monarch butterfly genome database. Nucleic Acids Res. 41, D758–D763 (2013). 62. Gregory, T. R. et al. Eukaryotic genome size databases. Nucleic Acids Res. 35, D332–D338 (2007).

Acknowledgements This project was funded by grants from the National Natural Science Foundation of China (number 31321002), Chinese Academy of Sciences (XDB13000000) and from Yunnan Provincial Science and Technology Department (number 2013HA004) to W.W. M.R.K. and W.Z. were supported by the National Institutes of Health Grant GM10862602. We thank D. Yang and Y. Peng for kindly providing their greenhouse; Y. Wu, C. Zhou., Z. Dong and M. Li for help or assistance in butterfly husbandry; Y. Huang for the kind gift of Cas9 expression vector; A. Tan and X. Huang for helpful suggestions in Cas9 experiments; J. Lyu for fruitful discussion in data analysis; and Q. Li for helping in off-target identification.

Author contributions W.W., X.L., Y. Dong and M.R.K. conceived the study and designed scientific objectives. W.W., X.L. and M.R.K. led the project and manuscript preparation. J.W., G.Z., X. Fang and M.F. supervised genome sequencing, assembly, and annotation and analysis. G.Z., D.F., D.Z. and L. Zhang developed the sequencing and assembling strategy. D.F., L. Zhang, Y.Z., Y.F., X. Feng and X.J. conducted genome and transcriptome assembly and evaluation, and performed genome annotation. D.F., X.L., L. Zhao and H.X. performed data analysis of genome and transcriptome. M.R.K. and W.Z. constructed butterfly cross, conducted RAD sequencing, constructed linkage map and performed analysis of genomic landscape of divergence. X.L., G.L., Y. Ding and Y.C. raised butterflies, provided materials and prepared DNA and RNA samples for genomic and transcriptomic sequencing. G.L. and X.L. performed flow cytometry analysis. X.L., G.L., L.C. and R.Z. performed CRISPR/Cas9 gene editing. X.L., W.W., M.R.K., L. Zhao, D.F. and W.Z. wrote the manuscript.

Additional information Accession codes. The whole-genome shotgun projects for the two swallowtail butterflies P. xuthus and P. machaon have been deposited in DDBJ/EMBL/GenBank under the accession codes LADI00000000 and LADJ00000000, respectively. The version described in this paper is the first version LADI01000000 and LADJ01000000, respectively. The genome assembly and annotation is available at GenBank as BioProject ID PRJNA270384 and PRJNA270386, respectively. The whole-genome sequence reads have been deposited in the NCBI Sequence Read Archive (SRA) under the accession codes SRA230220 and SRA230221, respectively. The RAD reads have been deposited in the NCBI Sequence Read Archive (SRA) under the accession code SRA272689. The RNAseq data have been deposited in GenBank GEO under the accession codes GSE65280 and GSE65281, respectively. Supplementary Information accompanies this paper at http://www.nature.com/ naturecommunications Competing financial interests: The authors declare no competing financial interests. Reprints and permission information is available online at http://npg.nature.com/ reprintsandpermissions/ How to cite this article: Li, X. et al. Outbred genome sequencing and CRISPR/Cas9 gene editing in butterflies. Nat. Commun. 6:8212 doi: 10.1038/ncomms9212 (2015). This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

NATURE COMMUNICATIONS | 6:8212 | DOI: 10.1038/ncomms9212 | www.nature.com/naturecommunications

& 2015 Macmillan Publishers Limited. All rights reserved.

Cas9 gene editing in butterflies.

Butterflies are exceptionally diverse but their potential as an experimental system has been limited by the difficulty of deciphering heterozygous gen...
2MB Sizes 3 Downloads 13 Views

Recommend Documents


Cas9 Gene Editing.
The CRISPR/Cas9 system has emerged as an important tool in biomedical research for a wide range of applications, with significant potential for genome engineering and gene therapy. In order to achieve conditional control of the CRISPR/Cas9 system, a

Cas9-mediated gene editing in human tripronuclear zygotes.
Genome editing tools such as the clustered regularly interspaced short palindromic repeat (CRISPR)-associated system (Cas) have been widely used to modify genes in model systems including animal zygotes and human cells, and hold tremendous promise fo

Cas9 Technology.
Until very recently, the prospect of introducing mutations or exogenous DNA sequences at precise locations in the genomes of plants and animals was difficult, if not impossible. This rapidly changed with the demonstration that the type II CRISPR-Cas

Cas9 gene editing technology studies.
The clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein 9 (Cas9) system is an adaptive immune defense system that resists the invasion of viruses and plasmids heterologous genetic material in bacteria and archa

Cas9 and rAAV Templates for Efficient Gene Editing.
Altering endogenous genes in cells is an integral tool of modern cell biology. The ease-of-use of the CRISPR/Cas9 system to introduce genomic DNA breaks at specific sites in vivo has led to its rapid and wide adoption. In the absence of a DNA templat

Cas9 and genome editing in Drosophila.
Recent advances in our ability to design DNA binding factors with specificity for desired sequences have resulted in a revolution in genetic engineering, enabling directed changes to the genome to be made relatively easily. Traditional techniques for

CRISPR-Cas9 Genome Editing in Drosophila.
The CRISPR-Cas9 system has transformed genome engineering of model organisms from possible to practical. CRISPR-Cas9 can be readily programmed to generate sequence-specific double-strand breaks that disrupt targeted loci when repaired by error-prone

Cas9 system produces specific and homozygous targeted gene editing in rice in one generation.
The CRISPR/Cas9 system has been demonstrated to efficiently induce targeted gene editing in a variety of organisms including plants. Recent work showed that CRISPR/Cas9-induced gene mutations in Arabidopsis were mostly somatic mutations in the early

Photoactivatable CRISPR-Cas9 for optogenetic genome editing.
We describe an engineered photoactivatable Cas9 (paCas9) that enables optogenetic control of CRISPR-Cas9 genome editing in human cells. paCas9 consists of split Cas9 fragments and photoinducible dimerization domains named Magnets. In response to blue