Description
WoS Query Partitioner is a tool that interactively splits a Web of Science query which returns more than 100,000 results into smaller queries to allow to easily obtain an exact result count.
It does work by splitting the query using the Source
field (SO
) in a kind of Divide and Conquer recursive strategy.
In addition the application offers two different kinds of graphs which depict the partitioning process as well as a LaTeX table output of the executed queries.
The details of its inner workings will be posted here once the theoretical work in which it is based is published.
Application
Please refer to the instructions & troubleshooting section if you have any problems or questions about the application use.
Graphs and Table
Once the application has finished partitioning a query two different graphs will be shown in this section. The first one is a tree graph structure that depicts how the original query has been splitted. The second one is a partition graph in which the number of results of each subquery is represented in a proportional way.
In addition, and to ease the incorporation of results to other works, the application also generates a LaTeX table code with all the subqueries that have been generated along with their number of results and total results count.
Queries Tree
Partition Graph
LaTeX Queries Table
Instructions & Troubleshooting
The use of the WoS Query Partitioner is quite simple:
Insert the WoS Query to split in the
Query
field and press theBegin Query Partitioning
button. For example, you can input the queryPY=2007 and CU=USA
.A new window will open asking to execute a particular query to the WoS interface and requesting to input the number of results obtained in the blank field. To help the copy and paste mechanism into the Web of Science web page, the query to execute is automatically inserted into the system clipboard. If the results count of the executed query is greater than 100,000 (>100000), the field must be left blank. The
Next Iteration
button should then be pressed.Step 2 will be repeated several times until a final results count is obtained. Once this count is shown, the Graphs and LaTeX Table will be generated.
Troubleshooting
The WoS Query Partitioner has been deployed in form of a Java Applet that is run directly in a Web Browser. Thus, it is necessary to have the latest Java Plugin in order to run it. It can be freely downloaded for a great variety of platforms (including GNU/Linux, MacOS and even Windows) from the Java Webpage. The application should work in almost any platform, but it has only been thoroughly tested in a GNU/Linux environment.
The generated graphs use the SVG standard for vectorial graphics. Thus, a compatible browser will be needed in order to view those images. At the time of he writing, all the major browsers support SVG with the exception of Internet Explorer. In the deployment of the WoS Query Partitioner Firefox has been used and tested to correctly show the generated SVG images.
Finally, it is important to mention that as the application copies some information to the system clipboard (in order to ease the copy & paste to the Web of Science interface) a security warning will be shown (see below).
You should accept it in order to make the application work. Please note that the application has been deployed taking the highest care to avoid any security problems and that it is freely available in the hope that it will be useful, but WITHOUT ANY WARRANTY.
If you have any additional question about the WoS Query Partitioner, please do not hesitate to contact S. Alonso.
Acknoledgements and Citation
If you use this tool in any reasearch, please cite it in the following way:
WoS Query Partitioner, http://sci2s.ugr.es/software/WoSQP/
The web template used has been created with the help of the Web 2.0 Generator.
Sample Results
In the following we show the results of the application of the first Divide & Conquer approach to obtain the results count for the query TS=cancer
. Note that this results where obtained in October 2009 and will have probably changed since then. Summing up, a total of 32
queries had to be executed to obtain the final count of 908155
.
Query | Items | Sum | |
---|---|---|---|
#1 | TS=cancer | >100000 | |
#2 | TS=cancer AND (SO=J* OR SO=C* OR SO=I* OR SO=S* OR SO=E* OR SO=N* OR SO=F* OR SO=H* OR SO=O* OR SO=Z* OR SO=W* OR SO=U* OR SO=Y* OR SO=X* OR SO=3* OR SO=5* OR SO=7* OR SO=9*) | >100000 | |
#3 | TS=cancer AND (SO=JOURNAL * OR SO=I* OR SO=E* OR SO=F* OR SO=O* OR SO=W* OR SO=JA* OR SO=JU* OR SO=JE* OR SO=JI* OR SO=JOURNALS* OR SO=JN* OR SO=JOI* OR SO=3* OR SO=7* OR SO=JM* OR SO=JS* OR SO=JOG* OR SO=JOK*) | >100000 | |
#4 | TS=cancer AND (SO=JOURNAL O* OR SO=E* OR SO=O* OR SO=JA* OR SO=JOURNAL D* OR SO=JE* OR SO=JOURNALS* OR SO=JOI* OR SO=3* OR SO=JM* OR SO=JOG* OR SO=JOURNAL I*) | >100000 | |
#5 | TS=cancer AND (SO=E* OR SO=O* OR SO=JOURNAL OF A* OR SO=JOURNAL OF M* OR SO=JOURNAL OF S* OR SO=JOURNAL OF B* OR SO=JOURNAL OF F* OR SO=JA* OR SO=JOURNAL OF R* OR SO=JOURNAL OF L* OR SO=JOURNAL OF D* OR SO=JOURNAL D* OR SO=JOURNAL OF K* OR SO=JOURNAL OF Z* OR SO=JOURNAL OF Q* OR SO=JOURNALS* OR SO=JOURNAL OF X* OR SO=JM* OR SO=JOURNAL I*) | >100000 | |
#6 | TS=cancer AND (SO=E* OR SO=JOURNAL OF A* OR SO=JOURNAL OF S* OR SO=JOURNAL OF F* OR SO=JOURNAL OF R* OR SO=JOURNAL OF D* OR SO=JOURNAL OF K* OR SO=JOURNAL OF Q* OR SO=JOURNAL OF X* OR SO=JOURNAL I*) | 69181 | 69181 |
#7 | TS=cancer AND (SO=O* OR SO=JOURNAL OF M* OR SO=JOURNAL OF B* OR SO=JA* OR SO=JOURNAL OF L* OR SO=JOURNAL D* OR SO=JOURNAL OF Z* OR SO=JOURNALS* OR SO=JM*) NOT #6 | 53647 | 122828 |
#8 | TS=cancer AND (SO=JOURNAL OF T* OR SO=JOURNAL OF C* OR SO=JOURNAL OF P* OR SO=JOURNAL OF E* OR SO=JOURNAL OF N* OR SO=JOURNAL OF I* OR SO=JOURNAL OF H* OR SO=JOURNAL OF G* OR SO=JOURNAL OF O* OR SO=JOURNAL OF V* OR SO=JOURNAL OF W* OR SO=JOURNAL OF J* OR SO=JOURNAL OF U* OR SO=JE* OR SO=JOURNAL OF Y* OR SO=JOI* OR SO=3* OR SO=JOG*) NOT #6 NOT #7 | >100000 | |
#9 | TS=cancer AND (SO=JOURNAL OF THE * OR SO=JOURNAL OF P* OR SO=JOURNAL OF N* OR SO=JOURNAL OF H* OR SO=JOURNAL OF O* OR SO=JOURNAL OF W* OR SO=JOURNAL OF J* OR SO=JOURNAL OF THER* OR SO=JOURNAL OF TA* OR SO=JE* OR SO=JOURNAL OF TO* OR SO=JOI* OR SO=JOURNAL OF TU* OR SO=3*) NOT #6 NOT #7 | 41732 | 164560 |
#10 | TS=cancer AND (SO=JOURNAL OF C* OR SO=JOURNAL OF E* OR SO=JOURNAL OF I* OR SO=JOURNAL OF G* OR SO=JOURNAL OF V* OR SO=JOURNAL OF TR* OR SO=JOURNAL OF U* OR SO=JOURNAL OF TE* OR SO=JOURNAL OF THEO* OR SO=JOURNAL OF Y* OR SO=JOURNAL OF THO* OR SO=JOURNAL OF TI* OR SO=JOURNAL OF THR* OR SO=JOG*) NOT #6 NOT #7 NOT #9 | 64133 | 228693 |
#11 | TS=cancer AND (SO=I* OR SO=F* OR SO=W* OR SO=JOURNAL F* OR SO=JU* OR SO=JI* OR SO=JN* OR SO=JOURNAL A* OR SO=7* OR SO=JS* OR SO=JOK* OR SO=JOURNAL N*) NOT #6 NOT #7 NOT #9 NOT #10 | 73448 | 302141 |
#12 | TS=cancer AND (SO=C* OR SO=S* OR SO=N* OR SO=H* OR SO=Z* OR SO=U* OR SO=Y* OR SO=X* OR SO=JC* OR SO=JOURNALI* OR SO=JB* OR SO=JOA* OR SO=JOR* OR SO=5* OR SO=9* OR SO=JP* OR SO=JOE* OR SO=JOH* OR SO=JOM*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 | >100000 | |
#13 | TS=cancer AND (SO=S* OR SO=H* OR SO=CA* OR SO=Z* OR SO=CL* OR SO=CE* OR SO=CI* OR SO=CY* OR SO=CZ* OR SO=CM* OR SO=JOURNALI* OR SO=JB* OR SO=JOR* OR SO=5* OR SO=JP* OR SO=JOH* OR SO=CB* OR SO=CT*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 | >100000 | |
#14 | TS=cancer AND (SO=S* OR SO=CA* OR SO=CL* OR SO=CI* OR SO=CZ* OR SO=JOURNALI* OR SO=JOR* OR SO=JP* OR SO=CB*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 | >100000 | |
#15 | TS=cancer AND (SO=CA* OR SO=SC* OR SO=CL* OR SO=SU* OR SO=CI* OR SO=SI* OR SO=SH* OR SO=SL* OR SO=SB* OR SO=SW* OR SO=SV* OR SO=SN* OR SO=CB* OR SO=SD* OR SO=SJ* OR SO=SZ*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 | >100000 | |
#16 | TS=cancer AND (SO=CL* OR SO=SCI* OR SO=CI* OR SO=CAR* OR SO=SH* OR SO=SCH* OR SO=SL* OR SO=CAH* OR SO=SCR* OR SO=SW* OR SO=SV* OR SO=SN* OR SO=SD* OR SO=SZ* OR SO=CAB* OR SO=CAO* OR SO=CAV* OR SO=SCU*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 | 34401 | 336542 |
#17 | TS=cancer AND (SO=CAN* OR SO=SU* OR SO=SI* OR SO=SCA* OR SO=CAT* OR SO=CAL* OR SO=SCO* OR SO=CAM* OR SO=SB* OR SO=CAD* OR SO=CAS* OR SO=CB* OR SO=SJ* OR SO=CA-* OR SO=CAI* OR SO=CAP* OR SO=SCE*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 | 85724 | 422266 |
#18 | TS=cancer AND (SO=SO* OR SO=ST* OR SO=SE* OR SO=SP* OR SO=SA* OR SO=SY* OR SO=SM* OR SO=SK* OR SO=CZ* OR SO=JOURNALI* OR SO=JOR* OR SO=JP* OR SO=S * OR SO=SG* OR SO=SR*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 | 16526 | 438792 |
#19 | TS=cancer AND (SO=H* OR SO=Z* OR SO=CE* OR SO=CY* OR SO=CM* OR SO=JB* OR SO=5* OR SO=JOH* OR SO=CT*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 | 28768 | 467560 |
#20 | TS=cancer AND (SO=N* OR SO=CO* OR SO=CH* OR SO=CU* OR SO=U* OR SO=CR* OR SO=Y* OR SO=X* OR SO=JC* OR SO=CN* OR SO=CC* OR SO=JOA* OR SO=CF* OR SO=9* OR SO=JOE* OR SO=JOM* OR SO=CS* OR SO=CW*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 | 57764 | 525324 |
#21 | TS=cancer AND (SO=A* OR SO=P* OR SO=B* OR SO=M* OR SO=R* OR SO=T* OR SO=G* OR SO=D* OR SO=L* OR SO=V* OR SO=K* OR SO=Q* OR SO=2* OR SO=1* OR SO=4* OR SO=6* OR SO=8* OR SO=0*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 | >100000 | |
#22 | TS=cancer AND (SO=A* OR SO=B* OR SO=R* OR SO=G* OR SO=L* OR SO=K* OR SO=2* OR SO=4* OR SO=8*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 | >100000 | |
#23 | TS=cancer AND (SO=B* OR SO=G* OR SO=AC* OR SO=AR* OR SO=K* OR SO=AU* OR SO=AP* OR SO=AG* OR SO=AT* OR SO=AB* OR SO=AV* OR SO=AA* OR SO=AK* OR SO=A * OR SO=8*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 | >100000 | |
#24 | TS=cancer AND (SO=G* OR SO=BU* OR SO=AR* OR SO=BR* OR SO=AU* OR SO=AP* OR SO=AG* OR SO=BM* OR SO=BL* OR SO=AA* OR SO=AK* OR SO=BS* OR SO=BF* OR SO=BT* OR SO=8*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 | 98550 | 623874 |
#25 | TS=cancer AND (SO=AC* OR SO=BI* OR SO=K* OR SO=BO* OR SO=BE* OR SO=BA* OR SO=AT* OR SO=AB* OR SO=AV* OR SO=BY* OR SO=B * OR SO=A * OR SO=BJ* OR SO=BW* OR SO=B-*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 | 39606 | 663480 |
#26 | TS=cancer AND (SO=R* OR SO=AN* OR SO=L* OR SO=AM* OR SO=AD* OR SO=AS* OR SO=AL* OR SO=AF* OR SO=AQ* OR SO=AI* OR SO=2* OR SO=AE* OR SO=AJ* OR SO=4* OR SO=AX*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 | >100000 | |
#27 | TS=cancer AND (SO=RE* OR SO=L* OR SO=AD* OR SO=AL* OR SO=RU* OR SO=RO* OR SO=AQ* OR SO=RH* OR SO=AE* OR SO=RN* OR SO=RY* OR SO=AX* OR SO=R\&* OR SO=RS*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 | 35432 | 698912 |
#28 | TS=cancer AND (SO=AN* OR SO=AM* OR SO=AS* OR SO=RA* OR SO=AF* OR SO=RI* OR SO=AI* OR SO=2* OR SO=AJ* OR SO=RL* OR SO=4* OR SO=R * OR SO=RB*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 NOT #27 | 87603 | 786515 |
#29 | TS=cancer AND (SO=P* OR SO=M* OR SO=T* OR SO=D* OR SO=V* OR SO=Q* OR SO=1* OR SO=6* OR SO=0*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 NOT #27 NOT #28 | >100000 | |
#30 | TS=cancer AND (SO=M* OR SO=D* OR SO=PH* OR SO=PA* OR SO=PE* OR SO=PL* OR SO=PU* OR SO=PF* OR SO=PT* OR SO=1* OR SO=0* OR SO=PC* OR SO=PP*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 NOT #27 NOT #28 | 70364 | 856879 |
#31 | TS=cancer AND (SO=T* OR SO=PR* OR SO=V* OR SO=PO* OR SO=PS* OR SO=Q* OR SO=PI* OR SO=PM* OR SO=PY* OR SO=6* OR SO=P * OR SO=PN*) NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 NOT #27 NOT #28 NOT #30 | 51130 | 908009 |
#32 | TS=cancer NOT #6 NOT #7 NOT #9 NOT #10 NOT #11 NOT #16 NOT #17 NOT #18 NOT #19 NOT #20 NOT #24 NOT #25 NOT #27 NOT #28 NOT #30 NOT #31 | 146 | 908155 |