-

Paris, France ∗Corresponding author. £ sarah.lang@uni-graz.at(S. A. Lang); bernhard.liebl@uni-leipzig.d(Be. Liebl); burghardt@informatik.uni-leipzig.d(Me. Burghardt) ç https://sarahalang.com(S. A. Lang); https://www.mathcs.uni-leipzig.de/ifi/forschung/computational-humanitie(sB/. Liebl); https://www.mathcs.uni-leipzig.de/personenprofil/mitarbeiter/juniorprof-dr-manuel-burgh(Mar. dBturghardt) ȉ

Toward a Computational Historiography of Alchemy: Challenges and Obstacles of Object Detection for Historical Illustrations of Mining, Metallurgy and Distillation in 16th-17th Century Print

Sarah A. Lang

BernhardLiebl

ManuelBurghard

0 0 Computational Humanities Research Group, University of Leipzig 1 Department Centre for Information Modelling (ZIM), University of Graz

2023

000 0 0002

This study explores the use of modern computer vision methods for object detection in historical images extracted from 16th-17th century printed books containing illustrations of distillation, mining, metallurgy, and alchemical apparatus. We found that the transfer of knowledge from contemporary photographic data to historical etchings proves less e昀ective than anticipated, revealing limitations in current methods like visual feature descriptors, pixel segmentation, representation learning, and object detection with YOLOv8. These 昀椀ndings highlight the stylistic disparities between modern images and early print illustrations, suggesting new research directions for historical image analysis.

eol>computer vision object detection alchemy chymistry early-modern print metallurgy mining distillation annotation

the norm in the kitchens and makeshi昀琀 laboratories of the past has been seen by Smith as the precursor to the natural sciences and chemistry as we know them toda5y5,[p. 292]. Morris argues that chemical laboratories in the modern sense emerged with the replacement of multipurpose or make-shi昀琀 spaces, which were not speci昀椀cally designed for carrying out chemical operations, with professionalized work environments for performing chemical and metallurgical operations 4[1, p. 19–20]. He further states that this rise of chemical laboratories coincides with the boom of a genre of metallurgical technical treatis4e0s][. Empirical evidence of these 椀昀rst laboratories remains scarce, with only a handful of alchemical laboratories discovered so far [ 34 ]. This is where early modern handbooks on distillation, metallurgy and mining, rich with illustrations, become invaluable. These texts provide unmatched insight into the laboratories, processes, and practices in thaertes technicae at the time, illustrating the underpinnings of the era’s chemistry and technology. Despite their signi昀椀cance for the history of technology and the Chemical Humanities 4[ 3 ], these books remain relatively understudied to this day.

1.1. Depicting mining, metallurgy, and distillation

During the proto-industrial revolution, mining and metallurgy 昀氀ourished, leading to the emergence of encyclopedic compendia of technological apparatus and processes. These include works such as Georgius Agricola’Dse re metallica libri XII [ 3 ], Vannoccio Biringucci’sDe la pirotechnia Libri X [ 6 ], Lazarus Ercker’sAula subterranea [ 23 ], and Giambattista della Porta’s De distillatione libri IX [ 44 ]. Metallurgical technical treatises began to become a staple in the genre of didactic manuals and were frequently accompanied by technical illustrations. Beginning with smaller treatises, grander montanist works started appearing by the mid-16th century, such as Vannoccio Biringuccio’Dse la Pirotechnia (1540) [ 6 ] and Georg Agricola’Dse Re Metallica (1556) [ 2 ]. This knowledge, always accessible in books, as Michael Giesecke has emphasized, was so attractive because it replaced the exchange with experts and, thus, o昀琀en made expensive and time-consuming journeys unnecessary2[ 7 ]. Consequently, ease of 昀椀nding relevant passages, through 昀椀tting illustrations or knowledge organization tools such as indices, was pivotal to their success. Besides metallurgical-focused works, distillation treatises also became popular in the 16th century3[ 5 ]. Particularly in昀氀uential was Hieronymus Brunschwig’s Liber der Arte Distillandi (Straßburg 1512) [11] or Walther Hermann Ry昀’s Distillation Book (Frankfurt am Main 1545) 5[0]. Brunschwig’s treatises have been published in a bewildering variety of versions, translations, and re-edition3s5[, p. 284–287].1

1.2. Research agenda and the case for automatic object detection

Since book illustration was expensive, early modern printers opportunistically reused illustrations from woodcuts and copper plates, thereby separating the images from their original contexts. Thus, illustrations would be commissioned for one speci昀椀c publication, rendering lots of detail and providing an alternative communication medium for the message expressed in the text of that particular book, and then reused in other contexts where they 昀椀tted more 1The Strasbourg doctor and pharmacist 昀椀rst published hiLsiber de Arte Distillandi De Simplicibus in 1500. This is referred to by research as the ‘small distillation book’. Twelve years later, the author followed up with a more voluminousLiber de Arte Distillandi De Compositis, known as the ‘large distillation book1’1[]. or less well, much like modern stock photograph2y8][. However, this means that not every image used in early modern print was made speci昀椀cally to illustrate the exact matter discussed in a text passage. Medical books, herbiaries and distillation books are a medium particularly rich in illustration, for which even legal battles for ‘copyright’ are not unheard of. Especially later richly illustrated encyclopedic works could only be 昀椀nanced due to their reuse of earlier image material. What does this mean for pragmatic literature though? Do the images faithfully represent the processes being described and the equipment needed to carry them out? We know, for example, that Lazarus Ercker’Asula Subterranea [ 22 ] (or ‘Bergwercksarten’) is a true handbook, in the sense that it is detailed enough so that one can replicate the processes described. But can this be true for all other books from that genre as well, given what we know about the practices of illustration reuse in historical print?

It is in this context that we propose to apply computer vision techniques to automatically detect the illustrations in these books. Being able to detect relevant objects in digitised book pages is a crucial 昀椀rst step for a quantitative Distant Viewing 5[] analysis of such apparatus within early modern chymical and pragmatic literature. In this short paper, we discuss challenges and obstacles we encountered during a 昀椀rst series of experiments in annotating a sample of such illustrations and training di昀erent approaches for object detection for historical illustrations of mining, metallurgy, and distillation in 16th–17th century print.

2. Detecting alchemical apparatus 2.1. Related work

We presume that a computational analysis of illustration practices can yield answers to the questions outlined above. As for related work, there is one branch of works that uses computer vision methods on illustrations in 15th/16th century pri3n7t,[ 21, 28 ]. However, these approaches are less concerned with the recognition of individual objects and more focused on identifying illustrations as a whole, particularly their reuse in di昀erent books. Cormier et al. [ 17 ] use machine learning approaches to classify illustrations as either woodcut or copperplate engravings. An interactive Visual Analytics System (VeCHArt) for comparing copies or di昀erent states of a print is proposed by P昀氀üger et al. [ 42 ]. Valleriani et al.5[ 8 ] present an empirical study on the visual similarity of early modern scienti昀椀c illustrations on cosmology while Kaoua et al. 3[0] provide insights from a large-scale study on image collation, as they try to match di昀erent illustrations in a large corpus of manuscripts.

What all these approaches have in common is their emphasis on studying illustrations as complete entities, analyzing their style, similarities, or reuse. However, for our speci昀椀c use case of detecting alchemical apparatus, we require an approach that is able to detect singular objects in a complex scene depicted in an illustration. Since we could not 昀椀nd any existing methods for object detection in 16th/17th century book illustrations, we conducted a series of experiments using various approaches on our own.

2.2. First experiments with existing methods

First of all, we experimented with out-of-the-box methods, such as the Distant Viewing Toolkit (Figure1), Segment Anything (segment-anything.com/) (Figure2) and image querying, using OWL-ViT (Figure3). While revealing some successes at 昀椀rst glance, a昀琀er some more testing these algorithms have proven largely inadequate in di昀erentiating speci昀椀c objects of interest from early modern prints. This medium is rich in visually similar etchings and contains typical alchemical objects that algorithms trained on modern data may simply not be familiar with.

2.3. Training and evaluation corpus

Because of the shortcomings of the above approaches, we proceeded to compile some training data, to provide a representative sample for the book genre de昀椀ned above, containing books primarily concerned with mining, metallurgy or distillation. Some of them represent di昀erent issues or print runs of the same book for standard works such as Hieronymus Brunschwig’s De Arte Distillandi [11], Georgius Agricola’sDe re metallica [ 3 ] or Vannoccio Biringuccio’s Pirotechnia [ 6 ], in which illustrations frequently di昀er in between di昀erent editions or print shops. Unlike the training corpus, the evaluation corpus was constructed to contain books not only concerned with mining, metallurgy or distillation. This allows us to verify if the algorithm actually learned anything and is able to distinguish illustrations not related to our subject (such as workshop scenes not related to alchemy, metallurgy, mining or distillation) from the objects we wish to detect. Accordingly, we 昀椀rst evaluate the ability to detect illustrations of laboratory equipment in early modern book pages, and then look at the performance for classifying speci昀椀c objects. Our training corpus, thus, only contains books that we know contain a su昀케cient number of relevant illustrations from the contexts of mining, metallurgy and distillation from the 16th-17th centuries [ 11, 12, 50, 51, 10, 13, 24, 57, 31, 20, 22, 3, 1, 2, 6, 7, 9, 8 ], while the evaluation corpus contains books from other alchemy-related areas and encycloped5i6a,s32[ , 33, 19, 18, 48, 4 ]. The challenges of annotating the training corpus are described in the next section.

3. The alchemy of annotation

The next step involved the semi-automatic annotation of images using the Supervisely platform (https://supervise.ly), whereby each component of alchemical apparatus was labeled individually in the hopes of providing the most useful form of annotations to improve model training. This process resulted in the creation of pixel-level labels.

We based our annotation on previous work done at the Herzog August Bibliothek Wolfenbüttel [ 26 ].2 In Frietsch’s classi昀椀cation [ 26 ], ‘Alchemistic equipment’ (49E393,https://iconcl ass.org/49E393) is a subclass of ‘Alchemy’ (49E39) in IconClass and organized as illustrated in Figure4. As the annotation table (Table1) shows, we did not incorporate all of the IconClass categories as labels. The classes to be used were selected by the relative frequency of related images in our corpus and depending on whether it made sense to keep subclasses or 2Adhering to the alchemy IconClass classi昀椀cation and vocabulary created by Ute Frietsch, which includes most alchemical apparatus, would not only keep a successful object detection model coming out of this work interoperable, but it also provides us with 1,800 tagged images we may re-use for creating ground truth in future work. not (as many of them are not that visually distinctive nor frequent enough in our corpus to be e昀ective to annotate). The goal was to keep the number of necessary annotation labels (and classes) as low as possible for our initial experiments. On the other hand, we introduced a class forambices (singularambix, a distillation helmet), which are frequently depicted, yet were lacking from Frietsch’s classi昀椀cation of alchemical equipmen3 tT.his approach represents a compromise between keeping the number of classes as low as possible while still including a su昀케cient number for making meaningful interpretations later. Had we annotated both the non-explicitly alchemical and the explicitly alchemical tools the same way, we would probably train our algorithm to simply detect tools, regardless of the label assigned to them coming from the IconClass alchemy category. 3As we initially had planned not to include composite devices in the hopes of thus providing better training data for the algorithms, some classes very visually distinctive for alchemy were not included, such as alembics and moor’s heads (Figure5). Notably, within the category of ‘pots’o(llae), some objects exhibit visually distinct alchemical characteristics, like triangular crucibles (for examples see T1a)b, lwehile others only can be interpreted as alchemical within a guaranteed alchemical context such as cupels, which visually look like simple pots or cups. We further opted to unite a range of furnace types under a single label. 49E3931 alchemistic vessels 49E39311 bottles (ampullae) • philosophical egg (ovum philosophicum) • pelican • phial • receiver (receptaculum) 49E39312 flasks (cucurbitae) • alembic • Moor’s head • operculum • retort • rosenhut 49E39313 pots, jars (ollae) • aludel • chalice • crucible • cupel 49E3932 alchemistic furnace • assay furnace • athanor • carburizing furnace • ‘slow Harry’ (piger henricus) • reverberatory furnace • smelting furnace 49E3933 alchemistic bath (balneum) • balneum arenae • balneum Mariae 49E3939 other alchemistic equipment

4. Prelimary results

In the rapidly evolving Digital Humanities (DH) sub-昀椀eld oDfistant Viewing [ 5 ], the application of computer vision techniques in diverse research areas has been met with enthusiasm. But despite this enthusiasm, our study reveals that these models may not yet readily adapt to specialized tasks in the DH. We have encountered substantial challenges in deploying these models for object detection in early modern depictions of chemical apparatus. The likely culprits were not solely the unique visual style of these etchings but also the models’ unfamiliarity with the nuances of early modern alchemical equipment and associated terminology. It is apparent that these models, adept at interpreting modern visual styles and contexts, are confounded by the distinct visual style of early modern etchings. In the following subsections, we present preliminary results for the detection of alchemical objects in early modern illustrations that were achieved with a range of di昀erent supervised and unsupervised computer vision approaches.

4.1. Visual feature descriptors

First, we experimented with an unsupervised clustering approach for visual feature description, namely the ORB (Oriented FAST and Rotated BRIEF4[ 9 ]) method. This approach is tailored for exact image reproduction (cf. the work done on woodcut reuse in chapbooks with VIS2E1][). This did not involve any training or the usage of our annotations and was meant to discern whether some intrinsic structures within the data could be utilized. Unfortunately, ORB failed to demonstrate such patterns in our data set.

4.2. Pixel Segmentation

Next, we decided to try pixel segmentation approaches, which allow us to perform object detection by dividing an image into segments and labeling each pixel, trying to map it to an object class. We 昀椀rst deployed approaches, where models classify each pixel individually, namely UNet [ 47 ] (with a ResNet-34 backbone) and the newer SegFormer5[ 9 ]. Despite being unable to recognize several elements (notably, animals), the U-Net/ResNet deep learning model detected, i.e. segmented, some plants correctly. Overall, however, the classi昀椀cation still proved to be erroneous. With the ResNet-based pixel segmentation, we reached an overall accuracy of 33.0% a昀琀er 昀椀ne tuning for 50 epochs. A similar story unfolded when using the SegFormer B15[ 9 ] deep learning model, which occasionally managed to identify the rough area of an object but again without determining the correct category.

4.3. Representation learning

Furthermore, we continued assessing the e昀케cacy of non-supervised models, which operate without annotation to discover structures in the data and thus are supposed to identify similar objects. We employed SimSiam (Simple Siamese Representation Learning1)6[] and SimCLR (Siamese Contrastive Representation Learning)1[ 5 ] for unsupervised clustering using Siamese networks. Siamese networks are used in unsupervised visual representation learning to maximize similarity between image augmentations. SimCLR (Contrastive Learning of Visual Representations) performs unsupervised representation learning from unlabeled images, which leverages data augmentation for contrasting di昀erent visual representations. SimCLR and SimSiam perform well on ImageNet. Yet the methods yielded equally discouraging results on our historical data.

4.4. YOLO object detection

Finally, we turned to the state-of-the-art object detection framework YOLYOou( Only Look Once) version 8 mode4l, because a predecessor (YOLOv5) had been previously reported as being suitable for detecting images in historical prin1t4[]. Unfortunately, the performance of YOLO – like the previous approaches – fell short of our expectations. As YOLO is a popular framework and widely known in the Computational Humanities community, we will discuss it in more detail. We based our quality assessment on the model’s ability to correctly detect objects and accurately label them.

YOLO training was performed using about 50% of each class for training and the rest for validation (昀椀gure 10). We initially experimented with 3-fold cross-validation, however, due to the scarceness of our training data, we 昀椀nally opted for the single train-validation split.

As each image usually contained various labels with di昀erent classes (see Figu6r)e, producing such a strati昀椀ed sampling was unfortunately not straightforward, as one image must be either assigned to the training set or to the validation set with all its containing labels to prevent data leakage. Sometimes a large proportion of all available labels was on one or two images. A further complication was the presence of sometimes partially overlapping label annotations, such as distillation helmets being part of furnace setups. These create potential sources of confusion for both training and validation (昀椀gur6e). We found no solution for 昀椀xing the overlapping labels, but we partitioned image regions with non-overlapping labels into isolated (non-overlapping and non-leaking) sub-images, thereby producing a larger number of possible assignments to training or validation sets (and through this a lower strati昀椀cation error). The image partitioning was performed using a custom plane sweep algorithm that produced a hierarchy of either horizontal or vertical axes that subdivided images without cutting across label bounding boxes. To compute the actual training-validation split, we generated 10,000 random splits and picked the one that yielded a label distribution with the lowest mean error in its test-val ratio over all classes. For future studies, we plan to resort to more robust approaches [ 53 ]. Still, except for the furnace class, our approach produced a good strati昀椀cation for all classes (昀椀gure10).

We now report some of the training results. Trainingyoalo8n model with default parameters yielded a model with a mAP@0.5 of 0.3. Switching toyaolov8s model with a resolution of 1280 pixels (instead of the 640-pixel default) improved this score to 0.37 (discussed below and shown in 昀椀gure 7). As the confusion matrix (昀椀gure 8) and the precision-recall curves (昀椀gure 7) show, the classes that were best detected are ‘plants’, ‘ollae’ and ‘animals’. ‘Furnaces’, ‘other-equipment’, ‘cucurbitae-retorte’, ‘cucurbitae-rosenhut’ and ‘ampullae’ are detected considerably less well, having both issues with precision and recall. The classes ‘human’, ‘mineralmetal’ and ‘cucurbitae’ showed very low overall precision. The detection of ‘cucurbitae-ambix’ did not seem to work at all. We also experimented with other resolutions (up to 1,600 pixels), as well as adding augmentation througmhixup and various image transformations, as well as tuning the mosaic setting and the box_loss gain. However, we found no improvements in overall performance. Looking at the training curves, it turned out that for all tested YOLO models, resolutions, and settings, from smalleysotlov8n model to the largeryolov8s model, generalization for object localization did not work well, while generalization for object classi椀昀cation seemed to present no issues at all: while the classi昀椀cation losscls_loss was reduced rather symmetrically for both training and validation sets, and bthoxe_loss for the training data showed a nearly perfect training curve in all regimes, tbhoex_loss for the validation set turned out to be highly unstable and erratic in all cases, implying at least partial over昀椀tting. Upon analyzing whyollae was recognized better than other classes, we noted that the characteristic rounding could potentially account for a somewhat better model performance in this category. Across other label classes, visual variance was higher, which is illustrated in 椀昀gure 10. For example, the depictions of objects in thaempullae category varied considerably (e.g., the jugs with handles in the ‘training’ set lacked larger openings at the top).

The overall lack of success was probably due to the ratio of large ‘variance in the data’ to small ‘number of annotations’. The latter pales in comparison to the recommended 昀椀gure of 1,500 images (and 10,000 labels) per categor5y.The daunting prospect of manually annotating such a volume of images, however, was contrary to our objectives of automating the task. Annotating 1,500 objects per category would not only be very laborious and potentially nonsensical for our task, this amount of examples per class also simply may not exist per object type in our historical data.

In preliminary experiments we observed that out-of-the-box YOLO models, pre-trained on COCO, showed no advantage in terms of transfer learning for the task at ha6ndN. ot only are COCO images modern, but their di昀erences also tend to be much bigger than amongst di昀erent types of early modern alchemical laboratory apparatus. Thus, the model probably cannot adapt 5https://docs.ultralytics.com/yolov5/tutorials/tips_for_best_training_results/ 6The COCO dataset consists of 80 distinct object classes (from a modern context) like cats, zebras, or baseball bats. easily to grappling with historical data or distinguish in such a lot of details types of objects it has never seen before and does not know what to call.

5. Conclusions and future work

As part of our endeavour to utilize computer vision techniques for detecting early modern depictions of chemical apparatus, we initially embarked on experimental runs using readily available toolkits. These preliminary e昀orts yielded encouraging results, suggesting the viability of digging deeper into the intricacies of this interdisciplinary task. Encouraged by these early indications, we decided to extend our exploration, leveraging custom annotations to 昀椀netune a model. However, as the previous sections have detailed, these subsequent e昀orts were met with considerable obstacles and ultimately did not live up to the promise suggested by our initial forays. This, in turn, strongly suggests that further in-depth investigation is required in this area.

Attempts to harness state-of-the-art computer vision models revealed a distinct lack of generalisability to the idiosyncratic nature of early modern etchings. These unsuccessful attempts underscore the unique challenges presented by these unconventional early modern images. The ‘rendering’ or hatchings, i.e. the style of our images, could be what thwarts the algorithms. They may also have issues with granularity due to the etchings’ visual similarity because all the objects to be analyzed are early modern book illustrations characterized by cross-hatching and strong black lines. The model may simply recognize them all as parts of books or book pages but does not realize that it is the di昀erence between those particular illustrations that we are interested in7.

Going forward, we propose to explore one or few-shot approaches, although such methods are not extensively supported for object detection. We might try reducing our object detection problem (which is more complex than classi昀椀cation and for which there are also fewer readily available frameworks) into a classi昀椀cation problem by working with cropped images. The complex nature of the image data at hand suggests a need for more comprehensive annotation or potentially attempting to leverage style transfer to enhance our outcomes. We had initially tested this approach with theInstructPix2Pix model, which can convert hatching into real7This may be because the training data it was trained on probably did not have lots of images like ours and when it did, these simply may have been labelled as ‘book’ or ‘book page’ by annotators who, unlike us, were not interested in their particular details. At least indications for this were witnessed when we were 昀椀rst trying out the Distant Viewing Toolkit 5[] as an out-of-the-box tool (cf. Figur1e). istic shading, but unfortunately, this led to the loss of crucial visually distinctive details in the images and was ultimately unhelpful for our object detection task (Fig9u)r. eLeveraging the classi昀椀cation capabilities of large Vision-Language Models (VLMs) such as BLIP-236[] would be very interesting as well, however, the object localization issue needs to solved 昀椀rst, maybe by using OWL-ViT [ 39 ] or SegmentAnything (Figure2) only for bounding box estimation but not for classi昀椀cation [ 52 ].

In conclusion, despite the growing enthusiasm foDristant Viewing in the DH, the application of recent computer vision methods in the context of early modern print illustrations requires more nuanced approaches. The models’ failure to recognize and classify early modern etchings of chemical apparatus serves as a sobering reminder of the gap that still exists between the outof-the-box availability of state-of-the-art technology and the challenges in its DH application on historical data.

IconClass DescriptionVisual Representation other

[1]

AgricolaD.e Re Metallica . Basileae / Basel: In O昀케cina Frobeniana, Per Hier. Frobenivm Et Nic. Episcopivm , 1561 . url:http://data.onb.ac.at/rep/10B29699.

[2]

Agricola.De Re Metallica . Basileae / Basel: König, Emanuel, 1657 . urlh: ttp://data.onb .ac.at/rep/10B29680.

[3]

G. Agricola.Vom

Bergkwerck . Basel / Basel: Jeronymus Froben and Niclausen Bischo昀, 1557 . url: http://data.onb.ac.at/rep/108C988B.

[4]

AldrovandiM.usaeum Metallicum . Bononiae / Bologna: Joan. Bapt. Ferronij, 1648 . url: http://data.onb.ac.at/rep/10B29761.

[5]

T. B.

Arnold and

L. Tilton. “Distant

Viewing: Analyzing Large Visual Corpora”D.Iingi

:tal Scholarship in the Humanities (

2019 ). doi: http://dx.doi.org/10.1093/digitalsh/fqz01.3

[6]

Biringuccio .Pirotechnia. Vinegia / Venice: Giouan Padoano, 1550 . url:http://data.on b. ac.at/rep/108CB82E.

[7]

Biringuccio .Pirotechnia. Venetia / Venice: Gironimo Giglio, 1559 . urlh:ttp://data.on b. ac.at/rep/1085DDED.

[8]

Biringuccio .Pirotechnia. Vinegia / Venice: Comin da Trino di Monferrato : Navo, Curzio Troiano de, 1559 . urlh: ttp://data.onb.ac.at/rep/10727526.

[9]

Biringuccio .Pirotechnia. Bologna / Bologna: Longhi, 1678 . urlh: ttp://data.onb.ac.at/r ep/106F01F6.

[10] [11] [12] [13]

Brunschwig .Distilierbuch der rechten Kunst. Franckfurt am Mayn / Frankfurt am Main: Weygand Hanen Erben , 1565 . url:http://data.onb.ac.at/rep/1084C642.

Brunschwig .Liber de arte distillandi. Straßburg / Strasbourg: Johann Grüninger, 1512 .

url: http://data.onb.ac.at/rep/109559D1.

Brunschwig . The vertuose boke of distyllacyon of the waters of all maner of herbes . London: Laurens Andrewe , 1527 . urlh:ttps://archive.org/details/mobot3175300081606 3/page/n30/mode/thumb.

Brunschwig and

W. H.

Ry昀 . New Vollkommen Distillierbuch. Franckfurt a. M. / Frankfurt am Main: Christian Egenol昀s Erben , 1597 . url:http://data.onb.ac.at/rep/10A5871B.

[14]

Büttner ,

Martinetz ,

El-Hajj , and

Valleriani . “ CorDeep and the Sacrobosco Dataset: Detection of Visual Elements in Historical Documents” . IJno:urnal of Imaging 8 .10 (Oct. 15 , 2022 ), p. 285 . doi: 10 .3390/jimaging8100285.

[15]

Chen ,

Kornblith ,

Norouzi , and

HintonA. Simple Framework for Contrastive Learning of Visual Representations . June 30 , 2020 . url: http://arxiv.org/abs/ 2002 .0570 9.

[16]

Chen and

He . Exploring Simple Siamese Representation Learning. Nov. 20 , 2020 . url: http://arxiv.org/abs/ 2011 .1056 6.

[17]

Cormier ,

Park , and

Beck . “ Automatic Classi昀椀cation of Woodcuts and Copperplate Engravings” . In:2020 17th Conference on Computer and Robot Vision (CRV) ( 2020 ), pp. 85 - 92 . url: https://api.semanticscholar.org/CorpusID:21954848. 7

[18]

J. v.

Cuba . Gart der Gesuntheit. Straßburg / Strasbourg: Mathia Apiario, 1536 . urhl:ttp: //data.onb.ac.at/rep/108C96EA.

[19]

J. v. Cuba. Ortus

Sanitatis . Venetiis / Venice: Joannes de Cereto de Tridino, 1538 . url: http://data.onb.ac.at/rep/109E61DA.

[20]

Della PortaD.e Distillatione . Romae / Rome: Reu. Camerae Apostolicae, 1608 . urlh:tt p://data.onb.ac.at/rep/1058CD41.

[21]

Dutta , G.

Bergel, and

Zisserman . “ Visual Analysis of Chapbooks Printed in Scotland” . In:The 6th International Workshop on Historical Document Imaging and Processing (HIP '21), September 5-6 , 2021 , Lausanne, Switzerland ( 2021 ). doi: https://doi.org/10.1145 /3476887.3476893.

[22]

L. Ercker. Aula

Subterranea . Franckfurt am Mayn / Frankfurt am Main: Johan Feyerabendt , 1598 . url: http://data.onb.ac.at/rep/104A2CBD.

[23]

Ercker . Aula subterranea . Frankfurt (Main): Zunner , 1672 . urlh:ttps://www.deutsche stextarchiv.de/book/show/ercker%5C %5Faula01%5C%5F167.2

[24]

Ficinus . Liber de arte distulandi. Straßburg / Strasbourg: Johan Grüniger, 1509 . url: http://data.onb.ac.at/rep/10A81A42.

[25]

Fors ,

L. M.

Principe , and

H. O.

Sibum . “ From the Library to the Laboratory and Back Again: Experiment as a Tool for Historians of Science” . IAn:mbix 63/2 ( 2016 ), pp. 85 - 97 .

[26]

Frietsch . IconClass alchemy notation . Herzog August Bibliothek Wolfenbüttel , 2017 . url: http://alchemie.hab.de/bilde.r

[27]

Giesecke . Der Buchdruck in der frühen Neuzeit. Eine historische Fallstudie über die Durchsetzung neuer Informations- und Kommunikationstechnologien . Frankfurt am Main , 1991 .

[28] G. Götzelmann. “Bilderschätze, Bildersuchen: Digitale Auswertung von Illustrationswiederverwendungen im Buchdruck des 16 . Jahrhunderts”. InW: issen und Buchgestalt. Ed. by

Hegel and

Krewet . Wiesbaden: Harrassowitz Verlag, 2022 , pp. 323 - 340 .

[29] M. M. A . Hendriksen. “ Rethinking Performative Methods in the History of Science” . In: Berichte zur Wissenscha昀琀sgeschichte 43 ( 2020 ), pp. 313 - 322 .

[30]

Kaoua ,

Shen ,

Durr ,

Lazaris ,

Picard , and

Aubry . “Image Collation: Matching illustrations in manuscripts” . InC:oRR abs/2108 .08109 ( 2021 ). url: https://arxiv.org /abs/2108.08109.

[31]

Kerzenmacher .Alle Farben Wasser. München / Munich, Frankfurt am Mayn / Frankfurt am Main: Egenol昀, 1589. url: https://digital.deutsches-museum.de/de/digital-catalogue /library-object/BV020856683/%5C# 1 . 2

[32]

Kircher . Mundus subterraneus 1 . Amstelodami / Amsterdam: Janssonius & Weyerstraten, 1665. url: http://data.onb.ac.at/rep/1047DA01.

[33]

Kircher . Mundus subterraneus 2 . Amstelodami / Amsterdam: Janssonius & Weyerstraten, 1665. url: http://data.onb.ac.at/rep/1047D9FF.

[34]

Lang . “Alchemical Laboratories: Texts, Practices,

Material

Relics . An Introduction” . In: Alchemistische Labore. Praktiken, Texte und materielle Hinterlassenscha昀琀en / Alchemical Laboratories. Practices, texts, material relics. Ed. by

Lang ,

Fröstl , and

Fiska . Graz: Grazer Universitätsverlag, 2023 , pp. 13 - 30 .

[35]

Laube . “ Am Anfang ist Gestaltung. Bemerkungen zu Titelblättern bei Destilliertraktaten des 16 . Jahrhunderts” . In:Wissen und Buchgestalt. Ed. by

Hegel and

Krewet . Wiesbaden: Harrassowitz Verlag, 2022 , pp. 275 - 298 .

[36]

Li et al. “ BLIP-2: Bootstrapping Language-Image Pre-Training with Frozen Image Encoders and Large Language Models” . Ina:rXiv preprint arXiv:2301.12597 ( 2023 ). url: htt p://arxiv.org/abs/2301.12597.

[37]

Malaspina and

Zhong . “ Image-matching technology applied to Fi昀琀eenth-century printed book illustration” . InLe:tt Mat Int 5 ( 2017 ), pp. 287 - 292 .

[38]

Martinón-Torres . “Some recent developments in the historiography of alchemy” . In: Ambix 58/3 ( 2011 ), pp. 215 - 37 .

[39]

Minderer ,

Gritsenko ,

Stone ,

Neumann ,

Weissenborn ,

Dosovitskiy ,

Mahendran ,

Arnab ,

Dehghani ,

Shen ,

Wang ,

Zhai ,

Kipf , and

Houlsby . “ Simple Open-Vocabulary Object Detection with Vision Transformers” . IEnc: cv 2022 . 2022 . doi: https://doi.org/10.48550/arXiv.2205.0623.0

[40]

P. J. T.

Morris . “ The history of chemical laboratories: a thematic approach”C.Ihne:mTexts 7/21 ( 2021 ). doi: https://doi.org/10.1007/s40828-021-00146- x.

[41]

P. J. T.

Morris . The Matter Factory: A History of the Chemistry Laboratory . London: Reaktion Books, 2015 .

[42]

P昀氀üger ,

Thom ,

Schütz ,

Bohde , and

Ertl . “ VeCHArt: Visually Enhanced Comparison of Historic Art Using an Automated Line-Based Synchronization Technique” . In: IEEE Transactions on Visualization and Computer Graphics 26 .10 ( 2020 ), pp. 3063 - 3076 . doi: 10 .1109/tvcg. 2019 . 2908166 .

[43]

Piorko ,

Hendriksen , and

Werrett . “Alchemical Practice: Looking Towards the Chemical Humanities” . In:Ambix 69 : 1 ( 2022 ), pp. 1 - 18 . doi: 10 .1080/00026980. 2022 . 203 5572.

[44]

G. D.

Porta .De Distillationibus libri IX. Strasbourg: Zetzner , 1609 .

[45]

L. M.

Principe and

W. R.

Newman . “Some Problems with the Historiography of Alchemy” . In: Secrets of Nature: Astrology and Alchemy in Early Modern Europe . Ed. by

W. R.

Newman and

Gra昀琀on . Cambridge/Massachusetts: MIT Press, 2001 , pp. 385 - 432 .

[46]

Reardon . “ The Alchemical Revolution” . InSc:ience 332 ( 2011 ), pp. 914 - 915 .

[47]

Ronneberger ,

Fischer , and

BroxU . -Net: Convolutional Networks for Biomedical Image Segmentation . May 18 , 2015 . url: http://arxiv.org/abs/1505.04597.

[48]

Rößlin and

J. v.

Cuba .Kreuterbuch. Franckenfurt, am Meyn / Frankfurt am Main: Egenol昀 , 1536. url: http://data.onb.ac.at/rep/109E61B8.

[49]

Rublee ,

Rabaud ,

Konolige , and G. Bradski. “ORB : An e昀케cient alternative to SIFT or SURF” . In:2011 International Conference on Computer Vision . 2011 IEEE International Conference on Computer Vision (ICCV). Barcelona, Spain: Ieee, Nov. 2011 , pp. 2564 - 2571 . doi: 10 .1109/iccv. 2011 . 6126544 .

[50] [51]

W. H.

Ry昀. Das new groß Distillier Buch . Franckfurt / Frankfurt am Main: Christ. Egenol昀 , 1545 . url: http://data.onb.ac.at/rep/107BF132.

W. H. Ry昀 . New groß distillier-Buch. Franckfort / Frankfurt am Main: Christian Egenol昀's Erben , 1556 . url: http://data.onb.ac.at/rep/105FA8A2.

[52]

Saranrittichai et al. “ Zero-Shot Visual Classi昀椀cation with Guided Cropping” . Ianr:Xiv preprint arXiv:2309.06581 ( 2023 ). url: http://arxiv.org/abs/2309.0658 1.

[53]

Sechidis et al. “ On the Strati昀椀cation of Multi-Label Data” . In:Machine Learning and Knowledge Discovery in Databases. Ed. by

Gunopulos et al. Vol. 6913 . Springer Berlin Heidelberg, 2011 , pp. 145 - 158 . doi:https://doi.org/10.1007/978-3- 642 -23808-6\_1 0 .

[54]

Smith.

The Making and Knowing Project (

2015 - 2022 ). 2020 . url: makingandknowing.o rg.

[55]

P. H.

Smith . “Laboratories”. InT:he Cambridge History of Science 3 / Early Modern Science. Ed. by

Park and

Daston . Cambridge: Cambridge University Press, 2006 , pp. 290 - 305 .

[56] L. Thurneysser zum ThurnH.istoria sive Descriptio Plantarum Omnium . Berlini / Berlin: ?, 1578. url: http://data.onb.ac.at/rep/108C96EA.

[57]

Ulsted and J. A. et al. Coelum philosophorum . Lugduni / Lyon: Guilielmus Rovillius, 1572 . url: http://data.onb.ac.at/rep/1086E9DB.

[58]

Valleriani ,

Kräutli ,

Lockhorst , and N. Shlomi. “ Vision on Vision: De昀椀ning Similarities Among Early Modern Illustrations on Cosmology”S.cIinen:ti昀椀c Visual Representations in History . Ed. by

Valleriani , G. Giannini, and

Giannetto . Cham: Springer International Publishing, 2023 , pp. 99 - 137 . doih: ttps://doi.org/10.1007/978-3- 031 -1131 7 - 8 \_4.

[59]

Xie ,

Wang ,

Yu ,

Anandkumar ,

J. M.

Alvarez , and P. LuoS.egFormer: Simple and E昀케cient Design for Semantic Segmentation with Transformers . Oct . 28 , 2021 . url: http://a rxiv. org/abs/2105.1520 3.