<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Decryption Through the Likelihood of Frequency of Letters</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Barbara</forename><forename type="middle">Sánchez</forename><surname>Rinza</surname></persName>
							<email>brinza@cs.buap.mx</email>
							<affiliation key="aff0">
								<orgName type="department">Computer Science</orgName>
								<orgName type="institution">Benemérita Universidad Autónoma de Puebla</orgName>
								<address>
									<addrLine>14 Sur y Av. San Claudio</addrLine>
									<postCode>72000</postCode>
									<settlement>Puebla</settlement>
									<region>Pue</region>
									<country key="MX">México</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Zacarias</forename><surname>Fernando</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Computer Science</orgName>
								<orgName type="institution">Benemérita Universidad Autónoma de Puebla</orgName>
								<address>
									<addrLine>14 Sur y Av. San Claudio</addrLine>
									<postCode>72000</postCode>
									<settlement>Puebla</settlement>
									<region>Pue</region>
									<country key="MX">México</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Luna</forename><forename type="middle">Pérez</forename><surname>Flores</surname></persName>
							<email>fzflores@yahoo.com.mx</email>
							<affiliation key="aff0">
								<orgName type="department">Computer Science</orgName>
								<orgName type="institution">Benemérita Universidad Autónoma de Puebla</orgName>
								<address>
									<addrLine>14 Sur y Av. San Claudio</addrLine>
									<postCode>72000</postCode>
									<settlement>Puebla</settlement>
									<region>Pue</region>
									<country key="MX">México</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Mart</forename><surname>Mauricio</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Computer Science</orgName>
								<orgName type="institution">Benemérita Universidad Autónoma de Puebla</orgName>
								<address>
									<addrLine>14 Sur y Av. San Claudio</addrLine>
									<postCode>72000</postCode>
									<settlement>Puebla</settlement>
									<region>Pue</region>
									<country key="MX">México</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Marco</forename><surname>Ínez Cortés</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Computer Science</orgName>
								<orgName type="institution">Benemérita Universidad Autónoma de Puebla</orgName>
								<address>
									<addrLine>14 Sur y Av. San Claudio</addrLine>
									<postCode>72000</postCode>
									<settlement>Puebla</settlement>
									<region>Pue</region>
									<country key="MX">México</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><surname>Antonio</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Computer Science</orgName>
								<orgName type="institution">Benemérita Universidad Autónoma de Puebla</orgName>
								<address>
									<addrLine>14 Sur y Av. San Claudio</addrLine>
									<postCode>72000</postCode>
									<settlement>Puebla</settlement>
									<region>Pue</region>
									<country key="MX">México</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Decryption Through the Likelihood of Frequency of Letters</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">3A884FBF00B3DDC3022DB5B80BB80394</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T09:13+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>Probability, Decrypt</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The method to decrypt the information using probability leads to a more thorough job, because you have to know the percentage of each of the letters of the language that is being analyzed here is Spanish. You can consider not only the probabilities of the letters also syllables, set of three, four letters and even words. Then you have this thing to do is make comparisons of the frequencies of cipher text and the frequencies of the language to begin to replace by a correspondence. And finally passing a scanner and find the decrypted text.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Cryptography is the science that alters the linguistic representations of a message <ref type="bibr" target="#b0">[1]</ref>. For this there are different methods, where the most common is encryption. This science masking the original references of the information by a conversion method governed by an algorithm that allows the reverse or decryption of information. Use of this or other techniques, allowing for an exchange of messages that can only be read by the intended beneficiaries as 'consistent'. A consistent recipient is the person to whom the message is directed with the intention of the sender. Thus, the recipient knows the discrete coherent used for masking the message. So either have the means to bring the message to the reverse process cryptographic, or can infer the process that becomes a message to the public. The original information to be protected is called plaintext or cleartext. Encryption is the process of converting plain text into unreadable gibberish called ciphertext or cryptogram. In general, the concrete implementation of the encryption algorithm (also called figure) is based on the existence of key secret information that fits the encryption algorithm for each different use <ref type="bibr" target="#b1">[2]</ref>.</p><p>Decryption is the reverse process to recover the plaintext from the ciphertext and key. Cryptographic protocol specifies the details of how to use algorithms and keys (and other primitive operations) to achieve the desired effect. The set of protocols, encryption algorithms, key management processes and actions of the users, which together constitute a cryptosystem, which is what the end user works and interacts. In this work, we must first have a ciphertext which must meet certain requirements, such a text should be bijective so that each element of the domain carries a single element of the condominium. In addition we must also take account of the rules of Kerckhoff <ref type="bibr" target="#b2">[3]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Development work</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Frequencies in Spanish</head><p>Is required to decrypt text using the odds as to how often they used certain letters in the alphabet, for this work only considered the Spanish language <ref type="bibr" target="#b4">[5]</ref>.</p><p>The frequencies of Spanish, which were used for this study were:</p><p>1. Frequency triglyphs 2. Frequency of digraphs 3. Most common words 4. Frequency of letters at the beginning of words 5. Frequency of letters in Spanish 6. Frequency Words</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Triglyphs Frequencies</head><p>The letter frequency statistics may vary from one to another depending on the corpus author has chosen to develop them. Usually differences when the corpus is literary or consists of texts of different origins. Table <ref type="table" target="#tab_0">1</ref> shows the frequency of each of the Spanish alphabet with their respective percentage. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Most Frequent words</head><p>The vowels make up about 46.38% of the text. The high frequency letters account for 67.56% of the text. Mid-frequency points accounting for 25% of the text <ref type="bibr" target="#b3">[4]</ref>.</p><p>In the dictionary the most common vowel is A, but in written texts is the E because of prepositions, conjunctions, verbs, etc. The most common consonants are L, S, N, D, with about 30%. The less frequent six letters: V, N, J, Z, X and K (just over 1%). The average frequency of a Spanish word is 5.9 letters. The coincidence index for Spanish is 0.0775. In addition to solving the encryption table <ref type="table" target="#tab_1">2</ref> we mentioned that we most frequently used words in a text of 10 000 words. Next, table <ref type="table" target="#tab_2">3</ref> shows the frequencies of the 4-letter words.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Most common words</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4">Frequency digraphs</head><p>The size of the corpus is 60,115 letters. The frequencies are absolute. The digraphs are read by row and column in that order. Below in table <ref type="table" target="#tab_3">4</ref> shows the union digraphs are letters from letters.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.5">Most common initial letter</head><p>The most frequent letters in Spanish that start a word are listed in Table <ref type="table" target="#tab_4">5</ref> 3 Results</p><p>The ciphertext is used as said it had to be bijective and have Kerckhoff rules and the decrypted text shown in Figure <ref type="figure" target="#fig_0">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Four-letter words</head><p>Distribution of letters in literary texts Word Frequency E -16,78% R -4,94% Y -1,54% J -0,30% PARA 67 A -11,96% U -4,80% Q - </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusions</head><p>We conclude that this method of decryption is good however would have to tweak a little more due to it depends on the text we have and how much text to decrypt was also observed that only decrypts an encrypted bijective. In this work, as seen in the results of Figure <ref type="figure" target="#fig_0">1</ref>, which apply various processes, first see the probability of the lyrics in Spanish that are more frequent, then seen with the syllables that are more frequent in Spanish, and then with the last word and you miss the information, text analyzer, as shown in Figure <ref type="figure" target="#fig_0">1</ref> a large percentage of the information is decoded, but as mentioned in the top, this will depend have that much information to process it.  </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. with each of the texts worked, 01 encrypted text, 02 text one pass, 03 second pass the text, either original text decrypted</figDesc><graphic coords="6,138.47,216.95,338.40,301.94" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Frequency triglyphs</figDesc><table><row><cell cols="7">High frequency letters Medium frequency letters Low frequency letters Frequencies 0.5%</cell></row><row><cell>letter</cell><cell>freq.%</cell><cell>letter</cell><cell>freq.%</cell><cell>letter</cell><cell>freq.%</cell><cell>G, F, V, W</cell></row><row><cell>E</cell><cell>16,78</cell><cell>R</cell><cell>4,94</cell><cell>Y</cell><cell>1,54</cell><cell></cell></row><row><cell>A</cell><cell>11,96</cell><cell>U</cell><cell>4,80</cell><cell>Q</cell><cell>1,53</cell><cell></cell></row><row><cell>O</cell><cell>8,69</cell><cell>I</cell><cell>4,15</cell><cell>B</cell><cell>0,92</cell><cell></cell></row><row><cell>L</cell><cell>8,37</cell><cell>T</cell><cell>3,31</cell><cell>H</cell><cell>0,89</cell><cell></cell></row><row><cell>S</cell><cell>7,88</cell><cell>C</cell><cell>2,92</cell><cell></cell><cell></cell><cell>J, Z, X, K, N</cell></row><row><cell>N</cell><cell>7,01</cell><cell>P</cell><cell>2,76</cell><cell></cell><cell></cell><cell></cell></row><row><cell>D</cell><cell>6,87</cell><cell>M</cell><cell>2,12</cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Most frequent words of one, two and three letter</figDesc><table><row><cell></cell><cell></cell><cell cols="3">Two-letter words Three-letter words</cell></row><row><cell cols="4">Word Frequency Frequency Word</cell><cell>Frequency</cell></row><row><cell>DE</cell><cell>778</cell><cell>778</cell><cell>QUE</cell><cell>289</cell></row><row><cell>LA</cell><cell>460</cell><cell>460</cell><cell>LOS</cell><cell>196</cell></row><row><cell>El</cell><cell>339</cell><cell>339</cell><cell>DEL</cell><cell>156</cell></row><row><cell>EN</cell><cell>302</cell><cell>302</cell><cell>LAS</cell><cell>114</cell></row><row><cell>QUE</cell><cell>289</cell><cell>119</cell><cell>POR</cell><cell>110</cell></row><row><cell>Y</cell><cell>226</cell><cell>98</cell><cell>CON</cell><cell>82</cell></row><row><cell>A</cell><cell>213</cell><cell>74</cell><cell>UNA</cell><cell>78</cell></row><row><cell>LOS</cell><cell>196</cell><cell>64</cell><cell>MAS</cell><cell>36</cell></row><row><cell>DEL</cell><cell>156</cell><cell>63</cell><cell>SUS</cell><cell>27</cell></row><row><cell>SE</cell><cell>119</cell><cell>47</cell><cell>HAN</cell><cell>19</cell></row><row><cell>LAS</cell><cell>114</cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 .</head><label>3</label><figDesc>Frequency with four letters</figDesc><table><row><cell>1,53%</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4 .</head><label>4</label><figDesc>Frequency of digraphs</figDesc><table><row><cell></cell><cell></cell><cell cols="4">A B C D E F G H I J K L M</cell></row><row><cell></cell><cell cols="5">A 12 14 54 64 15 5 8 4 10 8 41 30</cell></row><row><cell></cell><cell cols="2">B 11</cell><cell></cell><cell>5</cell><cell>14 1 12</cell></row><row><cell></cell><cell cols="2">C 39</cell><cell>5</cell><cell>17</cell><cell>8 80</cell><cell>3</cell></row><row><cell></cell><cell cols="2">D 32</cell><cell cols="2">1 2 84</cell><cell>1 30</cell></row><row><cell></cell><cell cols="5">E 20 5 47 26 17 8 21 6 9 3 44 26</cell></row><row><cell></cell><cell cols="2">F 2</cell><cell></cell><cell>9</cell><cell>12</cell><cell>1</cell></row><row><cell></cell><cell cols="2">G 12</cell><cell></cell><cell>12</cell><cell>5</cell><cell>1</cell></row><row><cell></cell><cell cols="2">H 15</cell><cell></cell><cell>3</cell><cell>5</cell></row><row><cell></cell><cell cols="4">I 43 8 42 29 40 5 8</cell><cell>1 14 16</cell></row><row><cell></cell><cell cols="2">J 4</cell><cell></cell><cell>5</cell></row><row><cell></cell><cell>K</cell><cell></cell><cell></cell><cell>1</cell></row><row><cell></cell><cell cols="2">L 44</cell><cell cols="2">5 5 35 1 3</cell><cell>28</cell><cell>9 5</cell></row><row><cell></cell><cell cols="2">M 32 10</cell><cell></cell><cell>42</cell><cell>30</cell></row><row><cell></cell><cell cols="5">N 41 2 33 37 41 10 6 2 28 1</cell><cell>5 4</cell></row><row><cell></cell><cell cols="5">O 19 17 28 26 16 6 5 5 4 1 22 33</cell></row><row><cell></cell><cell cols="2">P 30</cell><cell>1</cell><cell>16</cell><cell>5</cell><cell>8</cell></row><row><cell></cell><cell>Q</cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell cols="5">R 74 1 12 10 94 1 12 45 1 1 6 15</cell></row><row><cell></cell><cell cols="5">S 32 2 18 15 57 3 2 4 41 1</cell><cell>5 7</cell></row><row><cell></cell><cell cols="2">T 60</cell><cell>1</cell><cell>67</cell><cell>35</cell></row><row><cell></cell><cell cols="4">U 13 6 11 5 52 1 3</cell><cell>9</cell><cell>9 6</cell></row><row><cell></cell><cell cols="2">V 12</cell><cell></cell><cell>1 15</cell><cell>15</cell></row><row><cell></cell><cell cols="2">W 1</cell><cell></cell><cell>1</cell></row><row><cell></cell><cell>X</cell><cell></cell><cell>1</cell><cell>4</cell></row><row><cell></cell><cell cols="4">Y 5 1 3 2 5 1 1</cell><cell>1 1</cell></row><row><cell>letter</cell><cell>P</cell><cell>C</cell><cell>D</cell><cell cols="2">E S A L R M N T</cell></row><row><cell cols="6">frequency 1.1128 1.081 1.012 989 789 761 435 425 403 346 298</cell></row><row><cell>letter</cell><cell>Q</cell><cell>I</cell><cell cols="3">H U G V F O B J Y W Z K</cell></row><row><cell cols="6">frequency 286 281 230 219 206 183 177 169 124 47 27 19 2 1</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 5 .</head><label>5</label><figDesc>Frequency of initial letters</figDesc><table /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Liddell and Scott&apos;s Greek-English Lexicon</title>
		<imprint>
			<date type="published" when="1984">1984</date>
			<publisher>Oxford University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Codigos Y Claves Secretas: Programas En Basic, Basado A Su Vez En Un Estudio Lexicogrfico Del Diario</title>
		<author>
			<persName><forename type="first">Anaya</forename><surname>Multimedia</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1986">1986</date>
			<pubPlace>El Pas; Mexico</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">William</forename><forename type="middle">F</forename><surname>Friedman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lambros</forename><forename type="middle">D</forename><surname>Callimahos</surname></persName>
		</author>
		<title level="m">Military Cryptanalytics, Cryptographic Series</title>
				<imprint>
			<date type="published" when="1962">1962</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m">Part I</title>
				<meeting><address><addrLine>Laguna Hills, Ca</addrLine></address></meeting>
		<imprint>
			<publisher>Aegean Park Press</publisher>
			<date type="published" when="1985">1985</date>
			<biblScope unit="volume">2</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Cryptograms In Spanish</title>
		<author>
			<persName><forename type="first">Wayne</forename><forename type="middle">G</forename><surname>Barker</surname></persName>
		</author>
		<imprint>
			<publisher>Aegean Park Press</publisher>
			<pubPlace>Laguna Hills, Ca</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
