NGrams

NGrams are succeeding string. finding ngrams is common in many disciplines - as you can see from figure 1 (source: Wikpedia). Here, finding ngrams is used to bring the words back into their context (and then show the context).

Example Analysis "Polymer Casting", 1976 - 1993, Found NGrams (2 Strings = Bigrams)

  • Total: how often a specific BiGram is found in the whole document collection
  • # Documents: how many documents contain a specific NGram


Attribute
Total
# Documents
acid_anhydrid
239.0
68.0
acid_copolym
120.0
49.0
acid_est
354.0
93.0
acid_g
157.0
47.0
acid_group
189.0
54.0
acid_mal
70.0
46.0
acid_methacryl
108.0
60.0
acid_polym
167.0
54.0
acid_solut
150.0
64.0
air_bubbl
119.0
51.0
cast_film
833.0
163.0
cast_form
111.0
54.0
cast_glass
285.0
76.0
cast_membran
203.0
55.0
cast_mold
110.0
54.0
cast_polym
293.0
68.0
cast_sheet
170.0
44.0
cast_solut
141.0
59.0
dry_nitrogen
94.0
46.0
film_cast
379.0
141.0
film_composit
130.0
57.0
film_compris
172.0
72.0
film_contain
119.0
45.0
film_dri
90.0
57.0
film_exampl
129.0
57.0
film_film
188.0
92.0
film_form
703.0
201.0
film_heat
79.0
43.0
film_membran
169.0
59.0
film_obtain
133.0
58.0
film_polym
188.0
75.0
film_prepar
134.0
71.0
film_produc
91.0
53.0
film_said
113.0
45.0
film_support
165.0
48.0
film_surfac
156.0
57.0
film_thick
484.0
159.0
film_us
69.0
51.0
films_cast
140.0
80.0
films_invent
116.0
46.0
films_prepar
94.0
63.0
films_produc
79.0
47.0
flat_sheet
154.0
65.0
flow_rat
451.0
85.0
form_cast
127.0
76.0
form_composit
83.0
48.0
form_film
193.0
113.0
form_membran
101.0
61.0
form_polym
111.0
67.0
form_solut
108.0
66.0
form_thin
138.0
61.0
glass_plat
654.0
121.0
glass_transit
536.0
119.0
group_carbon
362.0
64.0
inert_atmospher
133.0
48.0
inert_atmospher
133.0
48.0
layer_cast
106.0
51.0
layer_compris
109.0
50.0
layer_form
227.0
84.0
layer_lay
77.0
55.0
layer_said
110.0
45.0
layer_surfac
71.0
45.0
layer_thick
111.0
55.0
low_dens
292.0
58.0
low_molecular
432.0
129.0
low_temperatur
168.0
93.0
low_viscos
216.0
75.0
lower_alkyl
183.0
58.0
lower_molecular
87.0
47.0
lower_temperatur
82.0
48.0
metal_oxid
333.0
51.0
metal_salt
146.0
51.0
metal_surfac
102.0
55.0
mm_diamet
106.0
61.0
mm_hg
148.0
58.0
mm_mm
185.0
54.0
mm_thick
201.0
79.0
n_butyl
127.0
63.0
n_dimethyl
126.0
54.0
n_integ
150.0
58.0
n_methyl
166.0
79.0
n_methylpyrrolidon
430.0
49.0
non_por
104.0
59.0
non_solv
1091.
113.0
non_woven
102.0
56.0
p_phenylen
168.0
43.0
r_alkyl
143.0
60.0
vinyl_acet
662.0
135.0
vinyl_alcohol
300.0
62.0
vinyl_chlorid
258.0
64.0
vinyl_est
81.0
43.0
vinyl_eth
141.0
47.0
vinyl_monom
300.0
43.0
water_ad
120.0
62.0
water_bath
221.0
91.0
water_cont
146.0
45.0
water_dri
71.0
49.0
water_exampl
56.0
48.0
water_form
113.0
69.0
water_insolubl
215.0
51.0
water_mixtur
89.0
52.0
water_remov
101.0
63.0
water_resist
338.0
59.0
water_result
122.0
44.0
water_solubl
985.0
132.0
water_solv
64.0
46.0
water_wash
225.0
60.0
water_wat
93.0
58.0