How to transcribe:
Using Microsoft Excel® (the prefered method):
-
Since all the contributed transcriptions will be merged into the
same database, it is very important that the same format be used by
all. Please follow the instructions to the letter. If you are a new
volunteer please e-mail the results after you have completed the first
page so that we may check for any problems before continuing.
-
Please transcribe the entries using Microsoft Excel® following the template shown in the Sample below. If you do not have Microsoft Excel®, you can use any other spreadsheet
program as long as I can read it in Microsoft Excel (for example you
can save your spreadsheet in comma delimited value format - please
be sure to send me a test file before doing a lot of work). You can
also use any word processing program - but please see the format instructions
below.
-
Please follow the following instructions exactly (see the sample
transcription at the bottom of the page):
-
Use the template shown in the sample at the bottom of the page.
It is not necessary to title the columns unless you feel it helps
you (I will erase the column titles when I merge your contribution
into the data base).
-
Do NOT use any accents (we don't want to sort "á"
separately from "a").
-
Do NOT use the word "y" ("and") in front of the
2nd Surname (we don't want to sort on this letter).
-
Do NOT use dieresis/umlaut (two dots) over a "u".
Write "Aguero" rather than "Agüero".
-
DO use the letter "ñ". You can enter it
by holding down the "ALT" key and entering the sequence "164"
on the numeric pad at the end of your keyboard (if this doesn't
work try "0241").
-
Use one row of cells for each entry.
-
If there is no data to be entered in a cell, just leave the cell
blank.
-
If the entry says "idem" or "id." DO NOT type the word "idem". The word "idem" means "same
as above". Please go up the column until you find the first real
(non "idem") data and type that data in. (when the data
gets sorted "idem" won't mean much).
-
Do not enter the data from the last three archive reference number
columns ("libro, folio, número"), nor the remarks ("observaciones")
shown on the extreme right column of some names.
-
Enter the date of entry as 5-Abr-97. Use the first 3-letters
of the Spanish month (dont translate the month names
- use "Ene, Feb, Mar, Abr, May, Jun,
Jul, Ago, Sep, Oct, Nov, Dic"). You
do not need to enter any leading zeroes (such as 01 for 1).
-
In the Column titled "Corps ID" please enter our code
number assigned to the corresponding Regimental Unit. These can
be found on our List of Regiments
If the first group of soldiers on the first page of your set
does not have a unit name, please ask us and we will search the
prior page and let you know the Regiment.
If you find a Regiment or other Corps organization which is not
on our list let us know and we will assign an ID code number to
it.
-
One of the nice things about using Excel®,
is that once you have typed data in a cell it tries to automatically
fill out the same information on the next cell down the column.
This can help save some time. Also you can copy and paste data
if the same data such as Corps ID is repeated for a long series
of names. Be aware, however, that this feature can work against
you - If you have typed the name "Jose Antonio" and
you next try to type the name "Jose", Excel will add
the "Antonio" automatically and you must go back and
manually erase it. Be alert to this behavior to avoid errors.
Using a Word Processor:
-
Please refer to the sample transcription at the bottom of this page.
-
Do not use any fancy fonts, bold lettering, underlines, superscripts
or subscripts. We need a plain text file in order to import
it into the database.
-
Don't try to enter the data into a word processor table format. We
need plain text!
-
Type the entries one line to each soldier. Don't insert any blank
lines.
-
Use the same format rules for each data field as described for Excel® above.
-
Separate each data field within each line by means of a single comma.
If there is no data for a column, just enter the comma separator (the
data base requires every line to have the same number of columns -
every line should therefore end up with exactly 8 commas separating
the 9 columns, no more and no less).
How to send in your completed transcription
- Please save your work under the following file name:
- Xnnnnn.xls (if an Excel® spread
sheet), or
Xnnnnn.txt (if a word processor text file - be
sure to save as a plain text ".txt" file)
where:
- "X"; is the first letter of the surnames you are transcribing.
"nnnnn"; is the number of the first soldier entry on the
page.
Thus if Sargent Ballate, in the sample at the bottom of the page,
was the first name on the page, the Excel® file should
be called B5309.xls and the word processor file should be called B5309.txt. This will help keep our heads straight when we start
receiving files from many different volunteers.
- After you have completed each page, please e-mail them to both of the following addresses:
Please put the file name in the "Subject:" of the message,
so we wont miss it.
-
Please DO NOT mail the transcription to the CUBA-L list.
We will check the format, convert it to database format and post
it on the web as soon as possible.
A few last notes:
-
Work at your own pace. If you are unable to continue for any reason,
please let us know as soon as possible so we can reassign the remaining
work to another volunteer.
-
The data will be posted at the following URL: http://www.cubagenweb.org/mil/mambi/
-
We plan to post the names of all volunteers unless you tell us not
to. Please let us know if you prefer to remain anonymous.
Thank you on behalf of all people interested in Cuban Genealogy!
Sample Transcription
Original Data:
Cuartel General del General en Jefe
5309 |
Ballate Mantecon,
Manuel |
Bernardo, Concepcion |
Sargento 1o |
20 |
Abril |
95 |
1 |
99 |
134 |
5310 |
Brizuela, Ismael |
|
Idem |
12 |
Junio |
95 |
1 |
103 |
158 |
|
|
|
|
|
|
|
|
|
|
Excel® Transcription:
# |
Surname
1 |
Surname
2 |
Name(s) |
Father |
Mother |
Rank |
Date
of Entry |
Corps
ID |
5309 |
Ballate |
Mantecon |
Manuel |
Bernardo |
Concepcion |
Sargento 1o |
20-Abr-95 |
C-HQ |
5310 |
Brizuela |
|
Ismael |
|
|
Sargento 1o |
12-Jun-95 |
C-HQ |
|
|
|
|
|
|
|
|
|
Word Processor Transcription:
5309,Ballate,Mantecon,Manuel,Bernardo,Concepcion,Sargento 1o,20-Abr-95,C-HQ
5310,Brizuela,,Ismael,,,Sargento 1o,12-Jun-95,C-HQ
(notice the same number of commas, 8 in each line)
Please note:
-
Surname 1 =first surname listed
-
Surname 2 =second surname (if present) (always appears before
the comma in the original list)
-
Name(s) =given name(s) (include initials if any) (always appears
after the comma)
-
Father =Father's first name (first name listed - before the
comma)
-
Mother =Mother's first name (after the comma)
(if only one name is listed take your best guess - if it ends in "o"
it's usually the Father's, if it ends in "a" it's usually the Mother's).
-
Grado =rank (do not translate and do not enter "idem").
Enter the Sargeant ranks as "Sargento 1o" and "Sargento
2o" Don't use superscripts.
-
Date =date of entry into War (type as dd-Mon-yy)
-
Corps ID =our code corresponding to the Regiment which appears
as a section heading above the group of names. Get the code from our Regiment List, or ask us if you can't find
it.)
Questions and Answers
We received a few questions and thought it might help if we posted the
questions and answers.
First let me explain that I am going to take your Excel spreadsheet and
convert it to a comma delimited text file, very similar to the "word processor"
transcription shown on the instructions. I just think that using Excel
makes things easier keeping the information aligned in the right columns,
facilitating copying repetitive information such as regiment names, and
preventing people from using fancy fonts (the software requires plain
text).
Here is the first batch of questions and answers:
-
On the sample Excel transcription, under "rank" you have "Sargento"
and "1o" on different lines. Could they be on one line?
A: The database software requires all the data pertaining to each
soldier to be on a single line. (In the example the data wrapped to
fit the screen). It doesn't matter if the text wraps around in an
Excel cell - it all straightens out when I convert it to text. Don't
use the "Enter" key to generate two lines in a cell though!
BTW - The "o" is really a superscript and extremely difficult to
read - I originally thought it was an "a" and referred to "Sargento
de 1a Clase" (Sergeant First Class), but I guess it is
probably "Sargento de 1o Grado" (Sergeant of First Degree)
- Please do not use superscripts, just enter it as "Sargento
1o" or "Sargento 2o" using the letter "o",
not the number "0"
-
I understand that we should send in the first page we complete to
check for problems but should we send in every other page we complete
AS we complete them or only when we have completed our whole remaining
nine or so pages?
A: After I check the first page, you can do whatever you prefer.
I prefer to receive large files since I have to process each file
individually, but you can send one page at a time or send one or more
files each containing several pages. You can even break in the middle
of a page if you like - it's up to you . I'm just going to merge it
all into one big file for each letter of the alphabet. The reason
for requesting a test page is that I did not want someone to go through
all the work to transcribe 700 names and do it in an incompatible
way due to some misunderstanding of my instructions. I can do a lot
of batch corrections with my program editor but I can't correct all
the possible errors.
-
And if we send them in as a nine page block should it have nine different
file names or just one?
A: Either is OK. Always use a different name for each file, regardless
of the number of pages. Always use the first letter and number of
the first soldier in the first page of the batch as the file name
- this will hopefully keep the file names unique and make the files
easier to merge into the main data base.
Please DO NOT send the same information more than once. This will
just confuse us and can lead to errors. If you spot an error at a
later time just send us an e-mail mentioning the soldier number and
error and we will fix it in the master data base.
-
Will you inform us if our first page is OK? (or not!)
A: Yes, I will notify each volunteer individually as I get the transcriptions
and check them.
-
Do you want us to enclose each "cell" (that is, a name or a rank
or a date, whatever) in a frame, or I think Excel calls it a "border",
or do you want just plain text, all aligned neatly?
A: Plain text please - it's all going to be converted to plain text
anyway. Alignment within the cell doesn't matter, just let it free
flow and wrap (select the entire spreadsheet and turn on text wrap
under "format ->cell ->alignment). Don't try to make it neat within
a cell.
-
Will you inform all of the volunteers personally if there are any
updates to the instructions or the list of regiments, etc.?
A: Good idea, I just made a batch mailing list so I can notify the
volunteers directly (this way I won't clutter up the CUBA-L list with
administrative stuff). I'm sure I'll have to make clarifications as
questions like these come up. I had to add 2 new regiment names as
I was transcribing the data from an appendix. I also found that some
entries in this appendix did not have an assigned regiment so I put
"M" for "miscellaneous" as a placeholder so I can go back later and
try to fix it if I can find additional data.
-
On my list I have for example - Aguilera (ó Aguila) Mariano
- I guess ó means "or". How do you want us to deal with this?
A: There are two ways to deal with this - one is to enter it the
same as the original (except leave out the accent). Another one would
be to make two entries, one under each surname but indicating the
alternate form also. That way a search would find it under either
surname. In this instance, I think it is better to do two lines as
follows:
9999,Aguilera (o Aguila),.,Mariano....(etc)
9999,Aguila (o Aguilera),.,Mariano....(etc)
-
How should we show the superscript letters? I don't seem to be able
to make them in Excel.
A: Don't use superscripts - use "1o" for "1o". After we
are all done, I can, with my editor, globally restore the superscripts
in the html language.
-
A few of my names have the day part of the date missing. For example,
one has only Abril 95. How should we indicate that?
A: Just substitute a "?" for any missing data as follows: "?-Abr-95".
-
Are there any clues you can throw our way as to whether a parent's
name is male or female? I think that if it ends in o it's probably
male and if it ends in a it's probably female. Is that a reliable
assumption? A person can also look at the other names on his or her
list and if the name in question appears as one of a pair (i.e. both
the mother and father's names are given) it will be apparent whether
it's male or female. As a last resort (rather than guessing) Maria
doesn't seem to mind answering questions.
A: Yes, that is the general rule and fairly reliable. Ed or Maria
will be glad to answer questions.
-
What should we do if we aren't sure of the spelling of a name? I
have number 286 which looks like Leoncio Arias but I'm not 100% sure
and hate to guess.
A: Send us an e-mail mentioning the number - we may be able to decipher
it by consulting a better copy or the original book. Just make a note
of the problem and send it to me in an e-mail so we can try to fix
it later.
-
What should we do if we think there is an possible error in the original?
For example, my number 78 says - Alfonso Madan, Pedro with the comma
after the Madan. Could this guy's first surname be Alfonso?
A: Yes, Alfonso can also be a surname. I am sure we will find real
errors, but let's minimize creating new ones - Try to make an effort
to transcribe it exactly as written. We have already found several
soldiers with female names. I have never heard of female soldiers
in this war but there might have been nurses, etc.
-
You have probably read about the regiment I found that wasn't on
your list. It was Cuartel General del Deparamento Occidental, which
includes soldiers numbered 73 through 87. Maria said to make up a
code and I figured W-HQ was probably appropriate. Did I guess right?
A: Yes, you decoded my system! - Just let me know if you invent a
new code so I can remember to add it to the list.
-
I hope that people aren't typing "idem". I know you mentioned it
but it wouldn't hurt to emphasize how to deal with it.
So far nobody has done this. "idem" and "id." means "same as above".
Obviously you don't want to use these expressions if the data is to
be re-sorted. You may have noted that the original entries are not
in exact alphabetical order, even within the same unit. One of the
real benefits of doing an electronic transcription like this is that
you can sort all the names alphabetically. The software is
currently set up to do this automatically which is why it takes a
few seconds to show names in the grid.
-
When I enter dates, they behave funny - some align to the left in
the cell and some align to the right of the cell. What's happening
and what can I do about it?.
A: What's happening is that Excel is recognizing the date you are
typing in as a real date (rather than text), but only when the months
use the same abbreviation as in English. The dates with Spanish months
such as 15-Ene-95 are not recognized and are handled as text. When
I export the Excel data to a comma delimited file for the database
both get converted to text so it really doesn't matter. If it really
bothers you, you can stop Excel from doing this by selecting the "Date"
column (or the entire spreadsheet) and in the menu "Format"->"Cell"->"Number"
select "Text". You have to do this BEFORE you enter any data, however.
If you try doing this after you have entered data, the English dates
will be converted to a serial number corresponding to days elapsed
from January 1900 (1904 for the Mac - don't ask me why). If you accidentally
do this, just go to the "Edit" menu and select "Undo Cell Formatting"
to get your dates back.
|