Commit Graph

162 Commits

Author SHA1 Message Date
Gregory Soutade
bde91ca936 Move reverse DNS core management into iwla.py + Add robot_domains configuration 2024-10-27 09:16:01 +01:00
Gregory Soutade
70de0d3aca Add no_merge_feeds_parsers_list conf value 2024-10-27 09:15:39 +01:00
Gregory Soutade
9939922c31 Move feeds and reverse_dns plugins from post_analysis to pre_analysis 2024-10-02 08:27:53 +02:00
Gregory Soutade
6d46ac4461 Robots: Improve compatible keyword detection for robots 2024-07-28 09:25:40 +02:00
Gregory Soutade
46c9ae4f15 Feeds: Add domain and number of subscribers for feed parser.
Set correct date for merged feed parsers
Remove bad BAD_FEED_PARSER state
2024-07-28 09:25:06 +02:00
Gregory Soutade
e2210f3eab Update geo ip misc plugin 2024-02-15 10:55:59 +01:00
Gregory Soutade
9db72f41fd Don't analyze referer for non viewed hits/pages 2024-02-15 10:55:38 +01:00
Gregory Soutade
d78739157b Remove all trailing slashs of URL before starting analyze 2024-02-03 09:02:55 +01:00
Gregory Soutade
974d355dd4 Add no_referrer_domains list to defaut_conf for website that defines this policy 2024-01-30 11:24:52 +01:00
Gregory Soutade
83275a8db4 Rework filtered_users output to have full location in a column 2023-08-06 13:25:42 +02:00
Gregory Soutade
16cd817fec Increase not modified page threshold for robot detection 2023-07-05 09:15:48 +02:00
Gregory Soutade
d32b2440ee Bugfix: flags management for feeds display 2023-06-14 09:21:51 +02:00
Gregory Soutade
9c688e1545 Display visitor IP is now a filter 2023-05-21 11:06:16 +02:00
Gregory Soutade
7ef0911fa7 Main key for visits is now remote_ip and not remote_addr 2023-05-21 11:04:40 +02:00
Gregory Soutade
7507b8e77f WIP 2023-04-28 16:17:47 +02:00
Gregory Soutade
de79f526dd Add IP type plugin 2023-04-18 20:34:45 +02:00
Gregory Soutade
4b58048198 Update browsers with msie and Opera 2023-04-18 20:33:09 +02:00
Gregory Soutade
71d8ee2113 Forgot Firefox icon 2023-03-25 08:11:57 +01:00
Gregory Soutade
440f51ddd1 Remove robot rule 1 page for phones 2023-03-23 21:17:52 +01:00
Gregory Soutade
cad3467c25 Remove detection from awstats dataset for browser 2023-03-23 21:16:54 +01:00
Gregory Soutade
adc04bf753 Update iwla :
* Rework arg variable management
  * Manage dry run at top level
  * 'robot' property is now None by default (allow to do analysis only once)
  * Add --disable-display option
2023-03-11 20:51:44 +01:00
Gregory Soutade
a0a1f42df4 Update robot detection plugin :
* Do analyze only one time by month
  * Reactivate rule : no page view if count_hit_only_visitors is False
  * Add exception for "Less than 1 hit per page" rule if a phone is used
  * Check for all error codes in 400..499, not only 403 and 404
  * Referer '-' now counted as null
2023-03-11 20:48:17 +01:00
Gregory Soutade
31bc67ceba Replace feed referers by feed user agent 2023-03-11 20:42:56 +01:00
Gregory Soutade
3fdbc282c8 Remove feed parser detection by referer 2023-03-11 20:42:37 +01:00
Gregory Soutade
c8dfdd17f7 Add "compatible" as a criteria for robot 2023-02-18 08:49:14 +01:00
Gregory Soutade
a5bef4ece6 Search for "compatible" in all requests, not only the first one 2023-02-18 08:48:57 +01:00
Gregory Soutade
21a21cd68f Add a new rule for robots : 1 page and 1 hit, but not from the same source 2023-02-04 08:40:04 +01:00
Gregory Soutade
185664850d Add subdomains plugin 2023-01-28 09:44:43 +01:00
Gregory Soutade
6a4fd4e9c8 New rule for robot : more than 10 not modified pages in a row 2023-01-28 09:40:26 +01:00
Gregory Soutade
ac246eabe2 Find robot name in 'compatible' string and group them 2023-01-28 09:38:59 +01:00
Gregory Soutade
9c57ad3ece Feeds : display last access date for merged feed parsers 2023-01-28 09:36:48 +01:00
Gregory Soutade
3a8c667fdc Feeds display: Add "*" after a space in order to have flags 2023-01-28 09:35:48 +01:00
Gregory Soutade
3117aebac0 Add all_visits_enlight display plugin 2022-11-20 10:15:13 +01:00
Gregory Soutade
82993afbce Show last access information for feeds parsers 2022-11-20 10:12:16 +01:00
Gregory Soutade
975cc66bd5 Don't launch robot analysis rules for feed parsers 2022-11-16 21:10:11 +01:00
Gregory Soutade
f8b37a625c Rework feed merge algorithm. Allow to merge feeds based on name regular expression with merge_feeds_parsers_list conf value 2022-11-16 21:09:50 +01:00
Gregory Soutade
242bb6cabe Fix error in None handling of unknown city/country in filtered users 2022-11-16 21:09:36 +01:00
Gregory Soutade
cce8c75118 Display viewed and no viewed data for feeds (can be set as a robot too late) 2022-11-16 21:09:23 +01:00
Gregory Soutade
ad01b48898 Change merge_one_hit_only_feeds_parsers by merge_feeds_parsers and set it to False by default 2022-11-12 19:00:14 +01:00
928e4eea5a Move iptogeo.py util into misc 2022-11-10 20:14:18 +01:00
Gregory Soutade
41432046a1 Add Anonymize IP plugin 2022-11-10 20:06:00 +01:00
Gregory Soutade
b93c4bf470 Remove browser condition for filter users Geo IP 2022-11-10 20:04:56 +01:00
Gregory Soutade
2aad389e2b Fix message for all visits and top countries 2022-11-10 20:04:45 +01:00
Gregory Soutade
6ce0374fe7 Display external referers in filter_users plugin 2022-11-07 21:13:49 +01:00
310fbcdb07 Remove deprecated track users plugin 2022-11-04 20:14:22 +01:00
Gregory Soutade
c0fc5a0131 Add user agent in filter users display and limit filtered users to 10 in index 2022-11-04 20:04:08 +01:00
Gregory Soutade
7ec7e0d644 Add Geo IP location in filter users 2022-11-04 20:03:52 +01:00
Gregory Soutade
c84373b2d7 Filter users filter can now be a function defined in conf 2022-11-04 20:03:33 +01:00
Gregory Soutade
4d3c2107f0 Don't save all visitors requests into database (save space and computing). Can be changed in deufalt_conf.py with keep_requests value 2022-06-23 21:16:30 +02:00
Gregory Soutade
9cee1fa71c Move Filter Users plugin into pre_analysis (and keep display part) 2022-06-23 21:14:58 +02:00