Extract lists from a tab-delimited spreadsheet
Created: 2008-10-17 15:49:50
Last updated: 2008-10-18 21:20:52
This workflow takes a spreadsheet exported as a tab delimited file, such as an Excel spreadsheet. The style is expected as a matrix of data with the first row containing the column names and the first column containing the row names (see below). The top left cell is discarded.
The workflow outputs three lists. A list of column names, a list of row names, and a two deep list of data points. The first list is a list of the rows and the sublist is a list of values of the cells from the columns.
|
A |
B |
C |
D |
p |
1 |
2 |
3 |
4 |
q |
5 |
6 |
7 |
8 |
r |
9 |
10 |
11 |
12 |
s |
13 |
14 |
15 |
16 |
Preview
Run
Run this Workflow in the Taverna Workbench...
Workflow Components
Inputs (1)
Name |
Description |
TabDelimitedExcelFile |
|
Processors (6)
Name |
Type |
Description |
SplitColumnNamesIntoList |
local |
Splits the header row of the Excel Sheet by tab |
SplitRows |
local |
Splits the Excel Spreadsheet by newlines, giving a list of strings, each string being a row |
MakeColumnNameList |
beanshell |
For some reason, the previous processor returns a list of lists. This beanshell simply converts it back to a 1-deep list. |
SplitDataRows |
beanshell |
Separates the header row from the rest of the rows for separate processing |
SplitDataPoints |
local |
Splits each row from the excel spreadsheet by tab regex. Each iteration produces a list of the values from the cells of each row. Overall a list of lists is produced. |
SplitRowNames |
beanshell |
Filters out the first element of each row to build the list of row names and the 2-deel list of data points by row |
Beanshells (3)
Name |
Description |
Inputs |
Outputs |
MakeColumnNameList |
For some reason, the previous processor returns a list of lists. This beanshell simply converts it back to a 1-deep list. |
listoflists
|
list
|
SplitDataRows |
Separates the header row from the rest of the rows for separate processing |
rows
|
datarows
headerrow
|
SplitRowNames |
Filters out the first element of each row to build the list of row names and the 2-deel list of data points by row |
datarows
|
datarows
rownameslist
|
Outputs (3)
Name |
Description |
ColumnNameList |
|
DataPoints |
|
RowNameList |
|
Links (9)
Source |
Sink |
TabDelimitedExcelFile |
SplitRows:string |
SplitColumnNamesIntoList:split |
MakeColumnNameList:listoflists |
SplitDataPoints:split |
SplitRowNames:datarows |
SplitDataRows:datarows |
SplitDataPoints:string |
SplitDataRows:headerrow |
SplitColumnNamesIntoList:string |
MakeColumnNameList:list |
ColumnNameList |
SplitRowNames:datarows |
DataPoints |
SplitRowNames:rownameslist |
RowNameList |
SplitRows:split |
SplitDataRows:rows |
Uploader
License
All versions of this Workflow are
licensed under:
Version 1
(of 1)
Credits (1)
(People/Groups)
Attributions (0)
(Workflows/Files)
None
Shared with Groups (0)
None
Featured In Packs (1)
Log in to add to one of your Packs
Attributed By (0)
(Workflows/Files)
None
Favourited By (0)
No one
Statistics
Other workflows that use similar services
(0)
There are no workflows in myExperiment that use similar services to this Workflow.
Comments (0)
No comments yet
Log in to make a comment